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^f- (54) Title: HUMAN SECRETED PROTEINS 

s (57) Abstract: The present invention relates to human secreted polypeptides, and isolated nucleic acid molecules encoding said 
^ polypeptides, useful for diagnosing and treating gastrointestinal diseases, disorders, and/or conditions related thereto. Antibodies 
that bind these polypeptides are also encompassed by the present invention. Also encompassed by the invention are vectors, host 
cells, and recombinant and synthetic methods for producing said polynucleotides, polypeptide*, and/or antihodics. The invention 
further encompasses screening methods for identifying agonists and antagonists of polynucleotides and polypeptides of the invention. 
^ The present invention further encompasses methods and compositions for inhibiting or enhancing the production and function of the 
polypeptides of the present invention. 
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Human Secreted Proteins 

5 

Field of the Invention 

The present invention relates to human secreted proteins/polypeptides, and isolated nucleic 
acid molecules encoding said proteins/polypeptides, useful for detecting, preventing, diagnosing, 
prognosticating, treating, and/or ameliorating gastrointestinal diseases and disorders. Antibodies 

10 that bind these polypeptides are also encompassed by the present invention. Also encompassed by 
the invention are vectors, host cells, and recombinant and synthetic methods for producing said 
polynucleotides, polypeptides, and/or antibodies. The invention further encompasses screening 
methods for identifying agonists and antagonists of polynucleotides and polypeptides of the 
invention. The present invention further encompasses methods and compositions for inhibiting or 

15 enhancing the production and function of the polypeptides of the present invention. 

Background of the Invention 
The human digestive system is a collection of specialized organs and body tissues that 
prepare food for use by hundreds of millions of body cells. Food when eaten cannot reach cells 
20 because it cannot pass through the intestinal walls to the bloodstream and, if it could would not be 
in a useful chemical state. The gastrointestinal system modifies food physically and chemically 
and disposes of unusable waste. Physical and chemical modification (digestion) depends on 
exocrine and endocrine secretions and controlled movement of food through the digestive tract. 

The three fundamental processes of the digestive system are: secretion (e.g., delivery of 
25 enzymes, mucus, ions and the like into the lumen, and hormones into blood), absorption (e.g., 
transport of water, ions and nutrients from the lumen, across the epithelium and into blood), and 

♦ 

motility (e.g., contractions of smooth muscle in the wail of the tube that crush, mix and propel its 
contents). Control of digestive function is achieved through a combination of electrical and 
hormonal messages which originate either within the digestive system's own nervous and 
30 endocrine systems, as well as from the central nervous system and from endocrine organs such as 
the adrenal gland. 

The digestive system is composed of the digestive or alimentary tube and accessory 
digestive organs, which include the Mouth (e.g., tongue, taste buds, soft palate pharynx, salivary 
glands, teeth), Esophagus, Stomach, Liver, Gallbladder, Pancreas, Small Intestine (e.g., 
35 duodenum, jejunum, and ileum), and Large Intestine (e.g., caecum). 
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Common digestive system disorders including infections, inflammations, ulcers and 
cancers of the digestive or alimentary tube and above listed accessory digestive organs are 
described in more detail below. 

5 Disorders of the Mouth 

The mouth comprises an area from the lips to the front of the tonsils (fauces) at the start of 
the throat. The mouth contains the gums, teeth, and the tongue, together with salivary glands 
which secrete fluids that lubricate and begin food digestion as it is chewed. The roof of the mouth 

■ 

consists of the hard palate at the front and the soft palate at the back. The floor of the mouth 
10 comprises the tongue (controlled by a number of muscles attached to bones in the neck). At the 
front and sides of the tongue there are a number of taste buds. These respond to different tastes at 
different places (e.g., sweet, salty, sour, and bitter). At the back of the tongue there are some 
swellings which consist of lymphoid tissue. Underneath the tongue there is a midline attachment 
(frenulum) and the opening of several of the salivary ducts. There are other salivary glands (the 
15 parotid glands) lying over the angle of the jaw with a duct opening to the inside of the cheek at 
about the level of the second molar tooth. 

Diseases and disorders of the mouth are vary greatly in manifested symptoms, frequencies, 
severities, and causes. Accordingly, diseases and disorders of the mouth may be caused or 
initiated by viruses, bacteria, genetics (e.g. autoimmune disorders), physical or chemical trauma, 
20 etc. For example, diseases and disorders of the mouth include canker sores (aphthous ulcers), 
herpetic stomatitis leukoplakia, gingivostomatitis, oral cancer, oral lichen lanus, oral thrush, 
histoplasmosis, salivary gland infections, glossitis, Hand, Foot and Mouth disease, salivary duct 
stones, mumps, etc. 

25 Disorders of the Esophagus 

Disorders of the Esophagus include dysphagia (e.g., difficulty in swallowing) and 
odynophagia (e.g., difficulty in swallowing accompanied by pain). Inflammatory disorders of the 
esophagus result from a variety of causes; for example, ingestion of noxious materials (e.g., 
corrosive esophagitis), lodgment of foreign bodies, or a complex of events associated with reflux 

30 of gastric contents from the stomach into the lower esophagus (e.g., peptic esophagitis). 

Disorders of the motility of the esophagus tend to be either precipitated or aggravated at 
times of nervous stress. A disorder commonly due to obesity is gastric reflux. Persisting reflux of 
gastric contents with acid and digesting enzymes leads to chemical inflammation of the lining of 
the esophagus and ultimately to (peptic) ulceration. If inadequately treated, the process leads to 

35 submucosal fibrosis and stricturing, and, besides the symptoms of heartburn and regurgitation, the 
patient experiences pain on eating and swallowing. 
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Further disorders of the esophagus include the formation of diverticula. A serious injury to 
the esophagus is spontaneous rupture. It can occur in patients who have been vomiting or retching 
and in debilitated elderly persons with chronic lung disease. A rupture of this type confined to the 
mucosa only at the junction of the linings of the esophagus and stomach is called a Mallory-Weiss 
5 lesion. 

Benign tumors of the esophagus originate in the submucosal tissues and principally are 
leiomyomas (tumors composed of smooth muscle tissue) or lipomas (tumors composed of adipose, 
or fat, tissues). Malignant tumors are either epidermal cancers, made up of unorganized aggregates 
of cells, or adenocarcinomas, in which there are gland-like formations. Cancers arising from 
10 squamous tissues are found at all levels of the organ, whereas adenocarcinomas are more common 
at the lower end where a number of glands of gastric origin are normally present. The prognosis is 
poor because diagnosis is difficult and the tumor has usually been growing for one or two years 
before symptoms are apparent 

15 Disorders of the stomach 

Any disorder that affects the power of coordination of the stomach muscles is capable of 
producing symptoms ranging from those that are mildly unpleasant (e.g., anorexia and nausea) to 
others that are life-threatening. The intrinsic muscles of the stomach are innervated by branches of 
the vagus nerves, which travel along the esophagus from their point of emergence in the brain . 

20 stem. Severing these nerves or altering their function by the use of anticholinergic medication may 
produce temporary or more prolonged change in the ability of the stomach to empty itself. Gastric 
retention may result from the degeneration of the nerves to the stomach that can result from 
diabetes mellitus. Obstruction due to scarring in the area of the gastric outlet, or to tumors 
encroaching on the lumen, causes the stomach to fill up with its own secretions as well as with 

25 partially digested food. In these circumstances, vomiting leads to dehydration and to electrolyte 
losses, which threaten life if not corrected. 

■ 

Disorders of the stomach include ulcerative diseases, which involve mucosal breakdown 
either confined to the superficial layers of the mucosa (e.g., an erosion) or extending through the 
intrinsic layer of muscle of the mucosa into the tissues below (e.g., an ulcer). The circumstances 

30 that contribute to mucosal injury and ulcer formation include physical and chemical trauma that 
result from hot fluids and food, aspirin and other drugs, irritating spices, and pickling fluids. In 
addition, genetic factors are involved in the development of ulcers. The complications of peptic 
ulcers are hemorrhage, perforation, and obstruction of the outlet of the stomach (pyloric stenosis) 
by scarring of the duodenal bulb or of the pyloric channel. A diffuse inflammation of the stomach 

35 lining, gastritis, is usually an acute process caused by containinated food, alcohol abuse, or by 
bacterial- or viral-induced inflammation of the gastrointestinal tract (gastroenteritis). The other 
form of gastritis is gastric atrophy, in which the thickness of the mucosa is diminished. Diffuse 
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gastric atrophy leads to partial loss of the glands and secreting cells throughout the stomach and 
may be associated with iron-deficiency anemia. 

Malignant tumors of the stomach are common and are probably a result of both genetic 
and environmental factors. Gastric cancer affects men more often than women and accounts for 
5 about 20 percent of all deaths from cancers of the gastrointestinal tract in the United States. Other, 
malignant tumors that involve the stomach are tumors ordinarily made up of lymphoid and 
connective tissue. Benign rumors, especially leiomyomas, are common and may, when large, cause 
massive hemorrhage. Polyps of the stomach are not common except in the presence of gastric 
atrophy. 

10 

Disorders of the Duodenum and Small Intestine 

Primary cancer of the duodenum is an infrequent disease, however, benign tumors of the 
duodenum, particularly polyps and carcinoids, are more frequent. Cancers of the common bile duct 
or of the pancreas are important causes of death. A common disorder of the small intestine, 

15 distension, is caused by lack of coordination of the inner circular and outer longitudinal muscular 
layers of the intestinal wall which usually results in an accumulation of excess contents in the 
lumen. The most common cause of disturbed motility in the small intestine is food that contains an 
unsuitable additive, organism, or component. One of the most serious problems in small intestine 
are motor disturbances which arise from an intestinal obstruction that results from an actual 

20 encroachment on the bowel by an adhesive band or from an internal block produced by a tumor or 
gallstone. In addition, as profound an obstruction results when a portion of the intestine undergoes 
partial necrosis, or death, from failure of its blood supply. 

The extremely common disorder known as the irritable bowel syndrome is probably due to 
a disturbance of the motility of the whole intestinal tract. The symptoms vary from watery diarrhea 

25 to constipation and the passage of stools with difficulty. When the colon is involved, an excess of 
mucus is often observed in the stools. Occasionally the irritable bowel syndrome may be due to an 
allergy to a particular foodstuff. The syndrome may develop following an infection such as 
bacillary dysentery, after which the small intestine remains irritable for many months. 

A further disorder, malabsorption occurs when the small intestine is unable to transport 

30 properly broken down products of digestive materials from the lumen of the intestine into the 
lymphatics or mesenteric veins, where they are distributed to the rest of the body. Defects in 
transport occur either because the absorptive cells of the intestine lack certain enzymes, whether 
by birth defect or by acquired disease, or because they are hindered in their work by other disease 
processes that infiltrate the tissues, disturb motility, permit bacteria to overpopulate the bowel, or 

35 block the pathways over which transport normally proceeds. A malabsorption disorder of unknown 
cause, tropical sprue, is associated with partial atrophy of the mucosa of the small intestine. 
Disorders of the small intestine also include bacterial and parasitic infections. 
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Appendicitis is an inflammation of the vermiform appendix that may be caused by 
infection or partial or total obstruction. Chronic inflainmations of the small intestine include 
tuberculosis and regional enteritis (Crohn's disease). Celiac disease causes damage to the mucosa 
of the small intestine, though it is not clear whether it is caused by an immune reaction, or an 
5 inability to break down a toxic protein, gluten, to smaller peptide fractions. Studies of the immune 
function of those with celiac disease suggest that at least a major part of the process is a delayed 
hypersensitivity reaction and that the morphological changes are correlated with the presence of 
circulating antibodies to gluten. The mucosal reaction results in progressive atrophy, with 
dwarfing, if not complete disappearance, of the microvilli and villi that line the intestinal tract. 

10 

Disorders of the Large Intestine 

A wide variety of diseases and disorders occur in the large intestine. A disease that is 
analogous to achalasia of the esophagus is an idiopathic condition called aganglionic megacolon, 
or Hirschsprung's disease. It is characterized by the absence of ganglion cells and normal nerve 

15 fibres from the distal (or lower) portion of the large intestine, which results in reduced 
neuromuscular transmission and ceased peristalsis. The entire colon slowly becomes more and 
more distended and thick-walled. Abscesses in the perianal area are common complicating features 
of many diseases and disorders of the large intestine. Fungal and bacterial infections are also 
common causes of large intestine disorders. 

20 The most common form of chronic colitis, ulcerative colitis, is idiopathic. It varies from a 

mild inflammation of the mucosa of the rectum, giving rise to excessive mucus and some spotting 
of blood in the stools, to a severe, sudden, intense illness, with destruction of a large part of the 
colonic mucosa, considerable blood loss, toxemia and, less commonly, perforation. The most 
common variety affects only the rectum and sigmoid colon and is characterized by diarrhea and 

25 the passage of mucus. Apart from the greater tendency for fistulas to form and for the wall of the 
intestine to thicken until the channel is obstructed, Crohn's disease is distinguishable from 
ulcerative colitis by microscopic findings. In Crohn's disease, the maximum damage occurs 
beneath the mucosa, and lymphoid conglomerations, known as granuiomata, are formed in the 
submucosa. Crohn's disease attacks the perianal tissues more often than does ulcerative colitis. 

30 Although these two diseases are not common, they are disabling. 

Tumors of the colon are usually polyps or cancers. A peculiar form of polyp is the villous 
adenoma, often a slowly growing, fernlike structure that spreads along the surface of the colon for 
some distance. Cancers compress the colonic lumen to produce obstruction, they attach to 
neighbouring structures to produce pain, and they perforate to give rise to peritonitis. Cancers also 

35 may metastasize to distant organs before local symptoms appear. 

Anorectal disorders related to defecation are more common in the Western world than 
elsewhere. These disorders usually take the form of fissures (cuts or cracks in the skin or mucous 
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membrane) at the junction of the anal mucous membrane with the skin between the thighs. Anal 
fistulas sometimes occur as complications of serious bowel disease, as in tuberculosis or Crohn's 
disease of the bowel, or in certain parasitic diseases. A more general disorder is the enlargement of 
veins of the rectum and anus to form external or internal hemorrhoids. Hemorrhoids protrude, are 
5 associated with anal itching and pain, and bleed, especially when they come in contact with hard 
stools. 

Disorders of the Liver 

A variety of agents, including viruses, drugs, environmental pollutants, genetic disorders, 

10 and systemic diseases, can affect the liver. The resulting disorders usually affect one of the three 
functional components of the liver: the hepatocyte (liver cell) itself, the bile secretory 
(cholangiolar) apparatus, or the blood vascular system. Most acute liver diseases are self-limited, 
and liver functioning returns to normal once the causes are removed or eliminated. In some cases, 
however, the acute disease process destroys massive areas of liver tissue in a short time, leading to 

15 extensive death (necrosis) of hepatic cells and often to death of the patient. Hepatitis may result 
from viral infections or toxic damage from drugs or poisons. When acute hepatitis lasts for six 
months or more, a slow but progressive destruction of the surrounding liver cells and bile ducts 
occurs, a stage called chronic active hepatitis. If hepatocellular damage is severe enough to destroy 
entire acini (clusters of lobules), they are often replaced with fibrous scar tissue. Bile canaliculi 

20 and hepatocytes regenerate in an irregular fashion adjacent to the scar tissue and result in a chronic 
condition called cirrhosis of the liver. Where inflammatory activity continues after the onset of 
cirrhosis, the disorderly regeneration of hepatocytes and cholangites may lead to the development 
of hepatocellular or cholangiolar cancer. 

Although a number of viruses affect the liver, including the cytomegalovirus of infancy 

25 and childhood and the Epstein-Barr virus of infectious mononucleosis, there are three distinctive 
transmissible viruses that are specifically known to cause acute damage to liver cells: hepatitis 
virus A (HAV), hepatitis virus B (HBV), and hepatitis virus non-A, non-B (NANB). The 
symptoms characteristic of the acute hepatitis caused by the HAV, HBV, and NANB viruses are 
essentially indistinguishable from one another. 

30 Acute hepatitis also may be caused by the overconsumption of alcohol or other poisons, 

such as commercial solvents (e.g., carbon tetrachloride), acetaminophen, and certain fungi. Such 
agents are believed to cause hepatitis when the formation of their toxic intermediate metabolites in 
the liver cell (phase I reactions) is beyond the capacity of the hepatocyte to conjugate, or join them 
with another substance for detoxification (phase II reactions) and excretion. Acute canalicular 

35 (cholestatic) hepatitis is most commonly caused by certain drugs, such as chlorpromazine, that 
lead to idiosyncratic reactions or, at times, by hepatitis viruses. Acute congestive liver disease 
usually results from the sudden engorgement of the liver by fluids after congestive heart failure. 
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A prominent autoimmune liver disease is Wilson's disease, which is caused by abnormal 
deposits of large amounts of copper in the liver. Granulomatous hepatitis, a condition in which 
localized areas of inflammation (granulomas) appear in any portion of the liver lobule, is a type of 
inflammatory disorder associated with many systemic diseases, including tuberculosis, sarcoidosis, 
5 schistosomiasis, and certain drug reactions. Granulomatous hepatitis rarely leads to serious 
interference with hepatic function, although it is often chronic. The end result of many forms of 
chronic liver injury is cirrhosis, or scarring of liver tissue in reponse to previous acinar necrosis 
and irregular regeneration of liver nodules and bile ducts. 

Primary biliary cirrhosis, a widespread, though uncommon, autoimmune inflammatory 

10 disease of bile ducts, is a disorder primarily affecting middle-aged and older women. Secondary 
biliary cirrhosis results from chronic obstruction or recurrent infection in the extrahepatic bile 
ducts caused by strictures, gallstones, or tumors. Infestation of the biliary tract with a liver fluke, 
Clonorchis sinensis, is a cause of secondary biliary cirrhosis in Asia. 

Portal hypertension, the increased pressure in the portal vein and its tributaries that is the 

15 result of impediments to venous flow into the liver, is brought about by the scarring characteristic 
of the cirrhotic process. The increased pressure causes feeders of the portal vein to distend 
markedly, producing varices, or dilations of the veins. When varices are located in superficial 
tissues, they may rupture and bleed profusely. Two such locations are the lower esophagus and the 
perianal region. The accumulation of fluid in the abdominal cavity, or ascites, is related to portal 

20 hypertension, significant reduction in serum albumin, and renal retention of sodium. When 
albumin levels in blood are lower than normal, there is a marked reduction in the force that holds 
plasma water within the blood vessels and normally resists the effects of the intravascular pressure. 
The resulting increase in intravascular pressure, coupled with the increased internal pressure 
caused by the portal venous obstruction in the liver, leads to massive losses of plasma water into 

25 the abdominal cavity. The associated reduction of blood flow to the kidneys causes increased 
elaboration of the hormone aldosterone, which, in turn, causes the retention of sodium and water 
and a reduction in urinary output. In addition, because the movement of intestinal lymph into the 
liver is blocked by the cirrhotic process in the liver, the backflow of this fluid into the abdominal 
cavity is gready increased A progressive reduction in kidney function that often occurs in persons 

30 with advanced acute or chronic liver disease, hepatorenal syndrome, probably results from an 
inadequate perfusion of blood through the cortical (outer) portions of the kidneys, where most 
removal of waste products occurs. With advanced hepatocytic dysfunction, a spasm of blood 
vessels in the renal cortex can occur, often with good blood flow to the rest of the kidney. This 
spasm results in progressive failure in kidney function and often leads to death. 

35 Although not uncommon, cancer originating in the liver, usually in hepatocytes and less 

frequently in cells of bile duct origin, is rare in the West and is almost always associated with 
active cirrhosis, particularly the form found in patients with chronic hepatitis. Long exposure to 
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certain environmental poisons, such as vinyl chloride or carbon tetrachloride, has also been shown 
to lead to hepatic cancer. Cancers arising elsewhere in the body, particularly in abdominal organs, 
lungs, and lymphoid tissue, commonly lead to metastatic cancer in the liver and are by far the most 
frequent type of hepatic malignancy. Various benign types of tumors and cysts arise from certain 
5 components of the liver, such as the hepatocytes (adenomas) or blood vessels (hemangiomas). 
While the cause of these lesions is not always clear, hepatic adenomas are associated with the 
prolonged use of female sex hormones (estrogens). Benign cysts in the liver may occur as 
congenital defects or as the result of infections from infestation of the dog tapeworm 
(Echinococcus granulosus). Abscesses on the liver result from the spread of infection from the 
10 biliary tract or from other parts of the body, especially the appendix and the pelvic organs. Specific 
liver abscesses also result from infections with the intestinal parasite Entamoeba histolytica. 

Disorders of the Biliary Tract 

Cholelithiasis, or the formation of gallstones in the gallbladder, is the most common 
15 disease of the biliary tract. There are three types of Gallstones: stones containing primarily calcium 
bilirubinate (pigment stones); stones containing 25 percent or more of cholesterol; and stones 
composed of variable mixtures of both bilirubin and cholesterol (mixed gallstones). Pigment 
stones are the result of an increased amount of bilirubin in the liver (due to hemolytic disease) and 
the consequent secretion into the biliary tract of increased amounts of the water-soluble conjugate, 
20 bilirubin diglucuronide, a pigment that is normally secreted in the urine. Cholesterol and mixed 
cholesterol-bilirubinate stones occur when the proportion of cholesterol in bile exceeds the 
capacity of bile acids and lecithin to contain the total amount of cholesterol in micellar colloidal 
solution. Postcholecystectomy syndrome comprises painful attacks, often resembling preoperative 
symptoms, that occasionally occur following the surgical removal of gallstones and the 
25 gallbladder. These attacks may be related to intermittent muscular spasms of the sphincter of Oddi 
or of the bile ducts. 

Cancer of the biliary tract is rare but may occur in almost any area, including the 
gallbladder, the hepatic ducts, the common bile duct, or the ampulla of Vater. In cancer of the bile 
duct, congenital cysts and parasitic infections, such as liver flukes, seem to lead to increased risks. 
30 Persons with extensive chronic ulcerative colitis also show a greater than normal incidence of bile 
duct carcinoma. 

Jaundice, or yellowing of the skin, scleras, and mucous membranes, occurs whenever the 
level of bilirubin in the blood is significantly above normal. This condition is evident in three 
different types of disorders including, unconjugated, or hemolytic, jaundice; hepatocellular 
35 jaundice; and cholestatic, or obstructive jaundice. Unconjugated jaundice results when the amount 
of bilirubin produced from hemoglobin by the destruction of red blood cells or muscle tissue 
(myoglobin) overwhelms the normal capacity of the liver to transport it or when the ability of the 
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liver to conjugate normal amounts of bilirubin into bilirubin diglucuronide is significantly reduced 
by inadequate intracellular transport or enzyme systems. Hepatocellular jaundice arises when liver 
cells are damaged so severely that their ability to transport bilirubin diglucuronide into the biliary 
system is reduced, allowing some of this yellow pigment to regurgitate into the bloodstream. 
5 Cholestatic jaundice, occurs when essentially normal liver cells are unable to transport bilirubin 
either through the hepatocytic-bile capillary membrane, because of damage in that area, or through 
the biliary tract, because of anatomical obstructions (e.g., atresias, gallstones, cancer). 

Disorders of ilie Pancreas 

10 Inflammation of the pancreas, or pancreatitis, is probably the most common disease of this 

organ. The disorder may be confined to either singular or repeated acute episodes, or it may 
become a chronic disease. There are many factors associated with the onset of pancreatitis, 
including direct injury, certain drugs, viral infections, heredity, hyperlipidemia (increased levels of 
blood fats), and congenital derangements of the ductal system. Localized, severe abdominal and 

15 midback pain resulting from enzyme leakage, tissue damage, and nerve irritation is the most 
common symptom of acute pancreatitis. In severe cases, respiratory failure, shock, and even death 
may occur. Chronic pancreatitis rarely follows repeated acute attacks. It seems instead to be a 
separate disorder that results in mucus plugs and precipitation of calcium salts in the smaller 
pancreatic ducts. Mucous production and plugging of the pancreas in Cystic fibrosis patients 

20 almost invariably causes destruction and scarring of the acinar tissue, usually without damaging 
the islets of Langerhans. A similar process in the hepatic biliary system produces foci of fibrosis 
and bile duct proliferation, a singular form of cirrhosis. 

The discovery of new human digestive system associated polynucleotides, the 
polypeptides encoded by them, and antibodies that immunospecifically bind these polypeptides, 

25 satisfies a need in the art by providing new compositions which are useful in detecting, preventing, 
diagnosing, prognosticating, treating, and/or ameliorating diseases and disorders of the digestive 
system, including, but not limited to, dysphagia, odynophagia, congenital disorders of the 
esophagus, gastric reflux, diverticula, Mallory-Weiss lesions, leiomyomas of the esophagus, 
lipoma, anorexia, nausea, ulcerative disease, pyloric stenosis, gastroenteritis, gastritis, gastric 

30 atropy, gastric cancer, benign tumors of the duodenum (e.g., polyps and carcinoids), pancreatic 
cancer, cancer of the bile duct, distension, irritable bowel syndrome, malabsorption, congenital 
disorders of the small intestine (e.g., Meckel's diverticulum, multiple diverticula), bacterial and 
parasitic infection (e.g., traveler's diarrhea, typhoid, paratyphoid, cholera, roundworms, 
tapeworms, amoebae, hookworms, strongyloides, threadworms, and blood flukes), megacolon 

35 (e.g., Hirschsprung's disease, aganglionic megacolon, acquired megacolon), colitis (e.g., due to 
bacterial, fungal, or parasitic infection, ulcerative colitis), tumors of the colon (e.g., polyps or 
cancers), anorectal disorders (e.g., anal fistulas, hemorrhoids, hepatitis (e.g., acute, chronic, 
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persistent hepatitis, viral (for example, hepatitis caused by hepatitis virus A (HAV), hepatitis virus 
B (HBV), and hepatitis virus non-A, non-B (NANB) infection), congenital disorders of the liver 
(e.g., Wilson's disease, hemochromatosis, cystic fibrosis, biliary atresia, and alphal -antitrypsin 
deficiency), cirrhosis, portal hypertension, cholelithiasis, cancer of the biliary tract, jaundice (e.g., 
5 unconjugated, hemolytic, hepatocellular, cholestatic, or obstructive jaundice). 

The discovery of new human gastromtestinal-associated polynucleotides, the polypeptides 
encoded by them, and antibodies that immunospecifically bind these polypeptides, satisfies a need 
in the art by providing new compositions which are useful in detecting, preventing, diagnosing, 
prognosticating, treating, and/or ameliorating gastrointestinal-specific diseases and disorders 
10 described in more detail below. 



Summary of the Invention 
The present invention encompasses human secreted proteins/polypeptides, and isolated 
nucleic acid molecules encoding said proteins/polypeptides, useful for detecting, preventing, 

15 diagnosing, prognosticating, treating, and/or ameliorating gastrointestinal diseases and disorders. 
Antibodies that bind these polypeptides are also encompassed by the present invention; as are 
vectors, host cells, and recombinant and synthetic methods for producing said polynucleotides, 
polypeptides, and/or antibodies. The invention further encompasses screening methods for 
identifying agonists and antagonists of polynucleotides and polypeptides of the invention. The 

20 present invention also encompasses methods and compositions for inhibiting or enhancing the 
production and function of the polypeptides of the present invention. 
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Polynucleotides and Polypeptides of the Inventii 
Description of Table 1 A 

Table 1A summarizes information concerning certain polypnucleotides and polypeptides 
of the invention. The first column provides the gene number in the application for each clone 

30 identifier. The second column provides a unique clone identifier, "Clone ID:' 1 , for a cDNA clone 
related to each contig sequence disclosed in Table 1 A. Third column, the cDNA Gones identified 
in the second column were deposited as indicated in the third column (i.e. by ATCC Deposit No:Z 
and deposit date). Some of the deposits contain multiple different clones corresponding to the 
same gene. In the fourth column, "Vector" refers to the type of vector contained in the 

35 corresponding cDNA Clone identified in the second column. In the fifth column, the nucleotide 
sequence identified as "NT SEQ ID NO:X" was assembled from partially homologous 
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("overlapping") sequences obtained from the corresponding cDNA clone identified in the second 
column and, in some cases, from additional related cDNA clones. The overlapping sequences 
were assembled into a single contiguous sequence of high redundancy (usually three to five 
overlapping sequences at each nucleotide position), resulting in a final sequence identified as SEQ 
5 ID NO:X. In the sixth column, "Total NT Seq." refers to the total number of nucleotides in the 
contig sequence identified as SEQ ID NO:X." The deposited clone may contain all or most of 
these sequences, reflected by the nucleotide position indicated as "5' NT of Clone Seq." (seventh 
column) and the "3' NT of Clone Seq." (eighth column) of SEQ ID NO:X. In the ninth column, 
the nucleotide position of SEQ ID NO:X of the putative start codon (methionine) is identified as 

10 "5' NT of Start Codon." Similarly , in column ten, the nucleotide position of SEQ ED NO:X of the 
predicted signal sequence is identified as "5' NT of First AA of Signal Pep." In the eleventh 
column, the translated amino acid sequence, beginning with the methionine, is identified as "AA 
SEQ ID NO:Y," although other reading frames can also be routinely translated using known 
molecular biology techniques. The polypeptides produced by these alternative open reading 

15 frames are specifically contemplated by the present invention. 

4 

In the twelfth and thirteenth columns of Table 1A, the first and last amino acid position of 
SEQ ID NO: Y of the predicted signal peptide is identified as "First AA of Sig Pep" and "Last AA 
of Sig Pep." In the fourteenth column, the predicted first amino acid position of SEQ ID NO: Y of 
the secreted portion is identified as "Predicted First AA of Secreted Portion". The amino acid 

20 position of SEQ ID NO: Y of the last amino acid encoded by the open reading frame is identified in 
the fifteenth column as "Last AA of ORF". 

SEQ ID NO:X (where X may be any of the polynucleotide sequences disclosed in the 
sequence listing) and the translated SEQ ID NO:Y (where Y may be any of the polypeptide 
sequences disclosed in the sequence listing) are sufficiently accurate and otherwise suitable for a 

25 variety of uses well known in the art and described further below. For instance, SEQ ID NO:X is 
useful for designing nucleic acid hybridization probes that will detect nucleic acid sequences 
contained in SEQ ID NO:X or the cDNA contained in the deposited clone. These probes will also 
hybridize to nucleic acid molecules in biological samples, thereby enabling a variety of forensic 
and diagnostic methods of the invention. Similarly, polypeptides identified from SEQ ID NO:Y 

30 may be used, for example, to generate antibodies which bind specifically to proteins containing the 
polypeptides and the secreted proteins encoded by the cDNA clones identified in Table 1 A and/or 
elsewhere herein 

Nevertheless, DNA sequences generated by sequencing reactions can contain sequencing 
errors. The errors exist as misidentified nucleotides, or as insertions or deletions of nucleotides in 
35 the generated DNA sequence. The erroneously inserted or deleted nucleotides cause frame shifts 
in the reading frames of the predicted amino acid sequence. In these cases, the predicted amino 
acid sequence diverges from the actual amino acid sequence, even though the generated DNA 
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sequence may be greater than 99.9% identical to the actual DNA sequence (for example, one base 
insertion or deletion in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide sequence or the 
amino acid sequence, the present invention provides not only the generated nucleotide sequence 
5 identified as SEQ ID NO:X, and the predicted translated amino acid sequence identified as SEQ 
ID NO:Y, but also a sample of plasmid DNA containing a human cDNA of the invention 
deposited with the ATCC, as set forth in Table 1A. The nucleotide sequence of each deposited 
plasmid can readily be determined by sequencing the deposited plasmid in accordance with known 
methods 

10 The predicted amino acid sequence can then be verified from such deposits. Moreover, 

the amino acid sequence of the protein encoded by a particular plasmid can also be directly 
determined by peptide sequencing or by expressing the protein in a suitable host cell containing 
the deposited human cDNA, collecting the protein, and determining its sequence. 

Also provided in Table 1A is the name of the vector which contains the cDNA plasmid. 

15 Each vector is routinely used in the art. The following additional information is provided for 
convenience. 

Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap XR (U.S. 
Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 5,128,256 and 5,286,636), 
pBluescript (pBS) (Short, J. M. et al., Nucleic Acids Res, 76:7583-7600 (1988); Alting-Mees, M. 

20 A. and Short, J. M., Nucleic Acids Res. 1 7:9494 (1989)) and pBK (Alting-Mees, M. A. et al., 
Strategies 5:58-61 (1992)) are commercially available from Stratagene Cloning Systems, Inc., 
11011 N. Torrey Pines Road, La Jolla, CA, 92037. pBS contains an ampicillin resistance gene and 
pBK contains a neomycin resistance gene. Phagemid pBS may be excised from the Lambda Zap 
and Uni-Zap XR vectors, and phagemid pBK may be excised from the Zap Express vector. Both 

25 phagemids may be transformed into B. coli strain XL-1 Blue, also available from Stratagene 

Vectors pSportl, pCMVSport 1.0, pCMVSport 2.0 and pCMVSport 3.0, were obtained 
from Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors contain 
an ampicillin resistance gene and may be transformed into E. coli strain DH10B, also available 
from Life Technologies. See, for instance, Gruber, C. E., et al., Focus 15:59 (1993). Vector 

30 lafmid BA (Bento Soares, Columbia University, New York, NY) contains an ampicillin resistance 
gene and can be transformed into E. coli strain XL-1 Blue. Vector pCR®2.1, which is available 
from Invitrogen, 1600 Faraday Avenue, Carlsbad, CA 92008, contains an ampicillin resistance 
gene and may be transformed into E. coli strain DH10B, available from Life Technologies. See, 
for instance, Clark, J. M., Nue. Acids Res. i6:9677-9686 (1988) and Mead, D. et al, 

35 Bio/Teclmology 9: (1991). 

The present invention also relates to the genes corresponding to SEQ ID NO:X, SEQ ID 
NO:Y, and/or a deposited cDNA (cDNA Clone ID). The corresponding gene can be isolated in 

12 
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accordance with known methods using the sequence information disclosed herein. Such methods 
include, but are not limited to, preparing probes or primers from the disclosed sequence and 
identifying or amplifying the corresponding gene from appropriate sources of genomic material. 

* 

Also provided in the present invention are allelic variants, orthologs, and/or species 
5 homologs. Procedures known in the art can be used to obtain full-length genes, allelic variants, 
splice variants, full-length coding portions, orthologs, and/or species homologs of genes 
corresponding to SEQ ID NO:X and SEQ ID NO:Y using information from the sequences 
disclosed herein or the clones deposited with the ATCC For example, allelic variants and/or 
species homologs may be isolated and identified by making suitable probes or primers from the 

10 sequences provided herein and screening a suitable nucleic acid source for allelic variants and/or 
the desired homologue. . 

The present invention provides a polynucleotide comprising, or alternatively consisting of, 
the nucleic acid sequence of SEQ ID NO:X and/or a cDNA contained in ATCC Deposit No.Z. 
The present invention also provides a polypeptide comprising, or alternatively, consisting of, the 

15 polypeptide sequence of SEQ ID NO:Y, a polypeptide encoded by SEQ ID NO:X, and/or a 
polypeptide encoded by a cDNA contained in ATCC deposit No.Z. Polynucleotides encoding a 
polypeptide comprising, or alternatively consisting of the polypeptide sequence of SEQ ID NO: Y, 
a polypeptide encoded by SEQ ID NO:X and/or a polypeptide encoded by the cDNA contained in 
ATCC Deposit No.Z, are also encompassed by the invention. The present invention further 

20 encompasses a polynucleotide comprising, or alternatively consisting of the complement of the 
nucleic acid sequence of SEQ ID NO.X, and/or the complement of the coding strand of the cDNA 
contained in ATCC Deposit No.Z. 

Description of Table IB (Comprised of Tables 1B.1 and 1B.2) 

25 Table 1B.1 and Table 1B.2 summarize some of the polynucleotides encompassed by 

the invention (including cDNA clones related to the sequences (Clone ID:), contig sequences 
(contig identifier (Contig ID:) and contig nucleotide sequence identifiers (SEQ ID NO:X)) and 
further summarizes certain characteristics of these polynucleotides and the polypeptides encoded 
thereby. The first column of Tables 1B.1 and 1B.2 provide the gene numbers in the application for 

30 each clone identifier. The second column of Tables 1B.1 and 1B.2 provide unique clone 
identifiers, "Clone ID:", for cDNA clones related to each contig sequence disclosed in Table 1A 
and/or Table IB. The third column of Tables 1B.1 and 1B.2 provide unique contig identifiers, 
"Contig ID:" for each of the contig sequences disclosed in these tables. The fourth column of 
Tables 1B.1 and 1B.2 provide the sequence identifiers, "SEQ ID NO:X", for each of the contig 

35 sequences disclosed in Table 1A and/or IB. 

Table 1B.1 
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The fifth column of Table 1B.1, "ORF (Frorn-To)'\ provides the location (i.e., 
nucleotide position numbers) within the polynucleotide sequence of SEQ ID NO:X that delineates 
the preferred open reading frame (ORF) that encodes the amino acid sequence shown in the 
sequence listing and referenced in Table 1B.1 as SEQ ID NO:Y (column 6). Column 7 of Table 
5 1B.1 lists residues comprising predicted epitopes contained in the polypeptides encoded by each of 
the preferred ORFs (SEQ ID NO:Y). Identification of potential immunogenic regions was 
performed according to the method of Jameson and Wolf (CABIOS, 4; 18M86 (1988)); 
specifically, the Genetics Computer Group (GCG) implementation of this algorithm, embodied in 
the program PEPTIDESTRUCTURE (Wisconsin Package vlO.0, Genetics Computer Group 

10 (GCG), Madison, Wise.)- This method returns a measure of the probability that a given residue is 
found on the surface of the protein. Regions where the antigenic index score is greater than 0.9 
over at least 6 amino acids are indicated in Table 1B.1 as "Predicted Epitopes". In particular 
embodiments, polypeptides of the invention comprise, or alternatively consist of, one, two, three, 
four, five or more of the predicted epitopes described in Table 1B.1. It will be appreciated that 

15 depending on the analytical criteria used to predict antigenic determinants, the exact address of the 
determinant may vary slightly. Column 8 of Table 1B.1 ("Cytologic Band") provides the 
chromosomal location of polynucleotides corresponding to SEQ ID NO.X. Chromosomal location 
was determined by finding exact matches to EST and cDNA sequences contained in the NCBI 
(National Center for Biotechnology Information) UniGene database. Given a presumptive 

20 chromosomal location, disease locus association was determined by comparison with the Morbid 
Map, derived from Online MendeJian Inheritance in Man (Online Mendelian Inheritance in Man, 
OMJM™. McKusick-Nathans Institute for Genetic Medicine, Johns Hopkins University 
(Baltimore, MD) and National Center for Biotechnology Information, National Library of 
Medicine (Bethesda, MD) 2000. World Wide Web URL: http://www.ncbi.riln^nih.gov/omim/). If 

25 the putative chromosomal location of the Query overlaps with the chromosomal location of a 
Morbid Map entry, an OMIM identification number is disclosed in Table 1B.1, column 9 labeled 
"OMIM Disease Reference(s)". A key to the OMIM reference identification numbers is provided 
in Table 5. 

Table 1B.2 

30 Column 5 of Table 1B.2, 'Tissue Distribution" shows the expression profile of tissue, 

cells, and/or cell line libraries which express the polynucleotides of the invention. The first code 
number shown in Table IB. 2 column 5 (preceding the colon), represents the tissue/cell source 
identifier code corresponding to the key provided in Table 4. Expression of these polynucleotides 
was not observed in the other tissues and/or cell libraries tested. The second number in column 5 

35 (following the colon), represents the number of times a sequence corresponding to the reference 
polynucleotide sequence (e.g., SEQ ID NO:X) was identified in the corresponding tissue/cell 
source. Those tissue/cell source identifier codes in which the first two letters are "AR" designate 
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information generated using DNA array technology. Utilizing this technology, cDNAs were 
amplified by PCR and then transferred, in duplicate, onto the array. Gene expression was assayed 
through hybridization of first strand cDNA probes to the DNA array. cDNA probes were generated 
from total RNA extracted from a variety of different tissues and cell lines. Probe synthesis was 
5 performed in the presence of 33 P dCTP, using oligo(dT) to prime reverse transcription. After 
hybridization, high stringency washing conditions were employed to remove non-specific hybrids 
from the array. The remaining signal, emanating from each gene target, was measured using a 
Phosphorimager. Gene expression was reported as Phosphor Stimulating Luminescence (PSL) 
which reflects the level of phosphor signal generated from the probe hybridized to each of the gene 

10 targets represented on the array. A local background signal subtraction was performed before the 
total signal generated from each array was used to normalize gene expression between the 
different hybridizations. The value presented after "[array code]:" represents the mean of the 
duplicate values, following background subtraction and probe normalization. One of skill in the 
art could routinely use this information to identify normal and/or diseased tissue(s) which show a 

15 predominant expression pattern of the corresponding polynucleotide of the invention or to identify 
polynucleotides which show predominant and/or specific tissue and/or cell expression. 

Description of Table 1C 

Table 1C summarizes additional polynucleotides encompassed by the invention 
20 (including cDNA clones related to the sequences (Clone ID:), contig sequences (contig identifier 
(Contig ID:) contig nucleotide sequence identifiers (SEQ ID NO:X)), and genomic sequences 
(SEQ ID NO:B). The first column provides a unique clone identifier, "Clone ID:", for a cDNA 
clone related to each contig sequence. The second column provides the sequence identifier, "SEQ 
ID NO:X", for each contig sequence. The third column provides a unique contig identifier, 
25 "Contig ID:" for each contig sequence. The fourth column, provides a BAC identifier "BAC ID 
NO:A" for the BAC clone referenced in the corresponding row of the table. The fifth column 
provides the nucleotide sequence identifier, "SEQ ID NO:B" for a fragment of the BAC clone 
identified in column four of the corresponding row of the table. The sixth column, "Exon From- 
To", provides the location (i.e., nucleotide position numbers) within the polynucleotide sequence 
30 of SEQ ID NO:B which delineate certain polynucleotides of the invention that are also exemplary 
members of polynucleotide sequences that encode polypeptides of the invention (e.g., polypeptides 
containing amino acid sequences encoded by the polynucleotide sequences delineated in column 
six, and fragments and variants thereof). 

35 Description of Table ID 

Table ID: In preferred embodiments, the present invention encompasses a method of 
detecting, preventing, diagnosing, prognosticating, treating, and/or amehorating gastrointestinal 
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diseases or disorders; comprising administering to a patient in which such treatment, prevention, 
or amelioration is desired a protein, nucleic acid, or antibody of the invention (or fragment or 
variant thereof) represented by Table 1A, Table IB, and Table 1C, in an amount effective to 
detect, prevent, diagnose, prognosticate, treat, and/or ameliorate the disease or disorder. 
5 As indicated in Table ID, the polynucleotides, polypeptides, agonists, or antagonists of the 

present invention (including antibodies) can be used in assays to test for one or more biological 
activities. If these polynucleotides and polypeptides do exhibit activity in a particular assay, it is 
likely that these molecules may be involved in the diseases associated with the biological activity. 
Thus, the polynucleotides or polypeptides, or agonists or antagonists thereof (including antibodies) 

10 could be used to treat the associated disease. 

Table ID provides information related to biological activities for polynucleotides and 
polypeptides of the invention (including antibodies, agonists, and/or antagonists thereof). Table 
ID also provides information related to assays which may be used to test polynucleotides and 
polypeptides of the invention (including antibodies, agonists, and/or antagonists thereof) for the 

15 corresponding biological activities. The first column ("Gene No.") provides the gene number in 
the application for each clone identifier. The second column ("cDNA Clone ID:") provides the 
unique clone identifier for each clone as previously described and indicated in Tables 1 A, IB, and 
1C The third column ("AA SEQ ID NO: Y") indicates the Sequence Listing SEQ ID Number for 
polypeptide sequences encoded by the corresponding cDNA clones (also as indicated in Tables 

20 1A, IB, and 2). The fourth column ("Biological Activity") indicates a biological activity 

i 

corresponding to the indicated polypeptides (or polynucleotides encoding said polypeptides). The 
fifth column ("Exemplary Activity Assay") father describes the corresponding biological activity 
and provides information pertaining to the various types of assays which may be performed to test, 
demonstrate, or quantify the corresponding biological activity. Table ID describes the use of 

25 FMAT technology, inter alia, for testing or demonstrating various biological activities. 
Ruorometric microvolume assay technology (FMAT) is a fluorescence-based system which 
provides a means to perform nonradioactive cell- and bead-based assays to detect activation of cell 
signal transduction pathways. This technology was designed specifically for ligand binding and 
immunological assays. Using this technology, fluorescent cells or beads at the bottom of the well 

30 are detected as localized areas of concentrated fluorescence using a data processing system. 
Unbound flurophore comprising the background signal is ignored, allowing for a wide variety of 
homogeneous assays. FMAT technology may be used for peptide ligand binding assays, 
immunofluorescence, apoptosis, cytotoxicity, and bead-based immunocapture assays. See, 
Miraglia S et al., "Homogeneous cell and bead based assays for highthroughput screening using 

35 flourometric microvolume assay technology," Journal of Biomolecular Screening; 4:193-204 
(1999). In particular, FMAT technology may be used to test, confirm, and/or identify the ability of 
polypeptides (including polypeptide fragments and variants) to activate signal transduction 
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pathways. For example, FMAT technology may be used to test, confirm, and/or identify the 
ability of polypeptides to upregulate production of immunomodulatory proteins (such as, for 
example, interleukins, GM-CSF, Rantes, and Tumor Necrosis factors, as well as other cellular 
regulators (e.g. insulin)). 

5 Table ID also describes the use of kinase assays for testing, demonstrating, or quantifying 

biological activity. In this regard, the phosphorylation and de-phosphorylation of specific amino 
acid residues (e.g. Tyrosine, Serine, Threonine) on cell-signal transduction proteins provides a fast, 
reversible means for activation and de-activation of cellular signal transduction pathways. 
Moreover, cell signal transduction via phosphorylation/de-phosphorylation is crucial to the 

10 regulation of a wide variety of cellular processes (e.g. proliferation, differentiation, migration, 
apoptosis, etc.). Accordingly, kinase assays provide a powerful tool useful for testing, confirming, 
and/or identifying polypeptides (including polypeptide fragments and variants) that mediate cell 
signal transduction events via protein phosphorylation. See e.g M Forrer, P., Tamaskovic R., and 
Jaussi, R. "Enzyme-Linked Immunosorbent Assay for Measurement of JNK, ERK, and p38 Kinase 

15 Activities" Biol. Chem. 379(8-9): 1101-1 110 (1998). 

Description of Table 2 

Table 2 surnmarizes homology and features of some of the polypeptides of the invention. 
The first column provides a unique clone identifier, "Clone ID:", corresponding to a cDNA clone 

20 disclosed in Table 1A or Table IB. The second column provides the unique contig identifier, 
"Contig ID:" corresponding to contigs in Table IB and allowing for correlation with the 
information in Table IB. The third column provides the sequence identifier, "SEQ ID NO:X", for 
the contig polynucleotide sequence. The fourth column provides the analysis method by which the 
homology/identity disclosed in the Table was determined. Comparisons were made between 

25 polypeptides encoded by the polynucleotides of the invention and either a non-redundant protein 
database (herein referred to as "NR"), or a database of protein families (herein referred to as 
"PFAM") as further described below. The fifth column provides a description of the PFAM/NR 
hit having a significant match to a polypeptide of the invention. Column six provides the 
accession number of the PFAM/NR hit disclosed in the fifth column. Column seven, 

30 "Score/Percent Identity", provides a quality score or the percent identity, of the hit disclosed in 
columns five and six. Columns 8 and 9, "NT From" and "NT To" respectively, delineate the 
polynucleotides in "SEQ ID NO:X" that encode a polypeptide having a significant match to the 
PFAM/NR database as disclosed in the fifth and sixth columns. In specific embodiments 
polypeptides of the invention comprise, or alternatively consist of, an amino acid sequence 

35 encoded by a polynucleotide in SEQ ID NO:X as delineated in columns 8 and 9, or fragments or 
variants thereof. 

17 



t 



I 



WO 02/076488 PCT/US02/08276 
Description of Table 3 

Table 3 provides polynucleotide sequences that may be disclaimed according to certain 
embodiments of the invention. The first column provides a unique clone identifier, "Clone ID", 
for a cDNA clone related to contig sequences disclosed in Table IB. The second column provides 
5 the sequence identifier, "SEQ ID NO:X", for contig sequences disclosed in Table 1 A and/or Table 
IB. The third column provides the unique contig identifier, "Contig ID:", for contigs disclosed in 
Table IB. The fourth column provides a unique integer 'a* where V is any integer between 1 and 
the final nucleotide minus 15 of SEQ ID NO:X, and the fifth column provides a unique integer 'b' 
where *b' is any integer between 15 and the final nucleotide of SEQ ID NO:X, where both a and b 

10 correspond to the positions of nucleotide residues shown in SEQ ID NO:X, and where b is greater 
than or equal to a + 14. For each of the polynucleotides shown as SEQ ID NO:X, the uniquely 
defined integers can be substituted into the general formula of a-b, and used to describe 
polynucleotides which may be preferably excluded from the invention. In certain embodiments, 
preferably excluded from the invention are at least one, two, three, four, five, ten, or more of the 

15 polynucleotide sequence(s) having the accession number(s) disclosed in the sixth column of this 
Table (including for example, published sequence in connection with a particular B AC clone). Li 
further embodiments, preferably excluded from the invention are the specific polynucleotide 
sequence(s) contained in the clones corresponding to at least one, two, three, four, five, ten, or 
more of the available material having the accession numbers identified in the sixth column of this 

20 Table (including for example, the actual sequence contained in an identified BAC clone). 

Description of Table 4 

Table 4 provides a key to the tissue/cell source identifier code disclosed in Table 1B.2, 
column 5. Column 1 of Table 4 provides the tissue/cell source identifier code disclosed in Table 

25 1B.2, Column 5. Columns 2-5 provide a description of the tissue or cell source. Note that 
"Description" and 'Tissue" sources (i.e. columns 2 and 3) having the prefix "a_" indicates organs, 
tissues, or cells derived from "adult" sources. Codes corresponding to diseased tissues are 
indicated in column 6 with the word "disease." The use of the word "disease" in column 6 is non- 
limiting. The tissue or cell source may be specific (e.g. a neoplasm), or may be disease-associated 

30 (e.g., a tissue sample from a normal portion of a diseased organ). Furthermore, tissues and/or cells 
lacking the "disease" designation may still be derived from sources direcdy or indirectly involved 
in a disease state or disorder, and therefore may have a further utility in that disease state or 
disorder. In numerous cases where the tissue/cell source is a library, column 7 identifies the vector 
used to generate the library. 

35 

Description of Table 5 

Table 5 provides a key to the OMIM reference identification numbers disclosed in Table 
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1B.1, column 9. OMIM reference identification numbers (Column 1) were derived from Online 
Mendelian Inheritance in Man (Online Mendelian Inheritance in Man, OMIM- McKusick-Nathans 
Institute for Genetic Medicine, Johns Hopkins University (Baltimore, MD) and National Center 
for Biotechnology Information, National Library of Medicine, (Bethesda, MD) 2000. World Wide 
5 Web URL: hh^://www.ncbi.nIm.rim.gov/ornim/). Column 2 provides diseases associated with the 
cytologic band disclosed in Table 1B.1, column 8, as deterrnined using the Morbid Map database. 

Description of Table 6 

Table 6 summarizes some of the ATCC Deposits, Deposit dates, and ATCC designation 
10 numbers of deposits made with the ATCC in connection with the present application. These 
deposits were made in addition to those described in the Table 1 A. 

Description of Table 7 

Table 7 shows the cDNA libraries sequenced, and ATCC designation numbers and vector 
15 information relating to these cDNA libraries. 

The first column shows the first four letters indicating the Library from which each library 
clone was derived. The second column indicates the catalogued tissue description for the 
corresponding libraries. The third column indicates the vector containing the corresponding 
clones. The fourth column shows the ATCC deposit designation for each libray clone as indicated 
20 by the deposit information in Table 6. 

Definitions 

The following definitions are provided to facilitate understanding of certain terms used 
25 throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is altered "by the 
hand of man" from its natural state. For example, an isolated polynucleotide could be part of a 
vector or a composition of matter, or could be contained within a cell, and still be "isolated" 
30 because that vector, composition of matter, or particular cell is not' the original environment of the 
polynucleotide. The term "isolated" does not refer to genomic or cDNA libraries, whole cell total 
or mRNA preparations, genomic DNA preparations (including those separated by electrophoresis 
and transferred onto blots), sheared whole cell genomic DNA preparations or other compositions 
where the art demonstrates no distinguishing features of the polynucleotide/sequences of the 
35 present invention. 

In the present invention, a "secreted" protein refers to those proteins capable of being 
directed to the ER, secretory vesicles, or the extracellular space as a result of a signal sequence, as 
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well as those proteins released into the extracellular space without necessarily containing a signal 
sequence. If the secreted protein is released into the extracellular space, the secreted protein can 
undergo extracellular processing to produce a "mature" protein. Release into the extracellular 
space can occur by many mechanisms, including exocytosis and proteolytic cleavage. 
5 As used herein, a "polynucleotide" refers to a molecule having a nucleic acid sequence 

encoding SEQ ID NO:Y or a fragment or variant thereof (e.g., the polypeptide delinated in 
columns fourteen and fifteen of Table 1 A); a nucleic acid sequence contained in SEQ ID NO:X (as 
described in column 5 of Table 1A and/or column 3 of Table IB) or the complement thereof; a 
cDNA sequence contained in Clone ID: (as described in column 2 of Table 1A and/or Table IB 

10 and contained within a library deposited with the ATCC); a nucleotide sequence encoding the 
polypeptide encoded by a nucleotide sequence in SEQ ID NO:B as defined in column 6 (EXON 
From-To) of Table 1C or a fragment or variant thereof; or a nucleotide coding sequence in SEQ ID 
NO:B as defined in column 6 of Table 1C or the complement thereof. For example, the 
polynucleotide can contain the nucleotide sequence of the full length cDNA sequence, including 

15 the 5' and 3' untranslated sequences, the coding region, as well as fragments, epitopes, domains, 
and variants of the nucleic acid sequence. Moreover, as used herein, a "polypeptide" refers to a 
molecule having an amino acid sequence encoded by a polynucleotide of the invention as broadly 
defined (obviously excluding poly-Phenylalanine or poiy-Lysine peptide sequences which result 
from translation of a polyA tail of a sequence corresponding to a cDNA). 

20 In the present invention, "SEQ ID NO:X" was often generated by overlapping sequences 

contained in multiple clones (contig analysis). A representative clone containing all or most of the 
sequence for SEQ ID NO:X is deposited at Human Genome Sciences, Inc. (HGS) in a catalogued 
and archived library. As shown, for example, in column 2 of Table IB, each clone is identified by 
a cDNA Clone ID (identifier generally referred to herein as Clone ID:). Each Clone ID is unique to 

25 an individual clone and the Clone ID is all the information needed to retrieve a given clone from 
the HGS library. Table 7 provides a list of the deposited cDNA libraries. One can use the Clone 
ID: to determine the library source by reference to Tables 6 and 7. Table 7 lists the deposited 
cDNA libraries by name and links each library to an ATCC Deposit. Library names contain four 
characters, for example, "HTWE." The name of a cDNA clone (Clone ID) isolated from that 

30 library begins with the same four characters, for example "HTWEP07". As mentioned below, 
Table 1A and/or Table IB correlates the Clone ID names with SEQ ID NO:X. Thus, starting with 
an SEQ ID NO:X, one can use Tables 1 A, IB, 6, 7, and 9 to determine the corresponding Clone 
ID, which library it came from and which ATCC deposit the library is contained in. Furthermore, 
it is possible to retrieve a given cDNA clone from the source library by techniques known in the 

35 art and described elsewhere herein. The ATCC is located at 10801 University Boulevard, 

■ 

Manassas, Virginia 201 10-2209, USA. The ATCC deposits were made pursuant to the terms of the 
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Budapest Treaty on the international recognition of the deposit of microorganisms for the purposes 
of patent procedure. 

In specific embodiments, the polynucleotides of the invention are at least 15, at least 30, at 
least 50, at least 100, at least 125, at least 500, or at least 1000 continuous nucleotides but are less 
5 than or equal to 300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, 7.5kb, 5 kb, 2.5 kb, 2.0 kb, or 1 kb, in 
length. In a further embodiment, polynucleotides of the invention comprise a portion of the 
coding sequences, as disclosed herein, but do not comprise all or a portion of any intron. In 
another embodiment, the polynucleotides comprising coding sequences do not contain coding 
sequences of a genomic flanking gene (i.e., 5* or 3* to the gene of interest in the genome). In other 

10 embodiments, the polynucleotides of the invention do not contain the coding sequence of more 
than 1000, 500, 250, 100, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 1 genomic flanking gene(s). 

A "polynucleotide" of the present invention also includes those polynucleotides capable of 
hybridizing, under stringent hybridization conditions, to sequences contained in SEQ ID NO:X, or 
the complement thereof (e.g., the complement of any one, two, three, four, or more of the 

15 polynucleotide fragments described herein), the polynucleotide sequence delineated in columns 7 
and 8 of Table 1 A or the complement thereof, the polynucleotide sequence delineated in columns 
8 and 9 of Table 2 or the complement thereof, and/or cDNA sequences contained in Clone ID: 
(e.g., the complement of any one, two, three, four, or more of the polynucleotide fragments, or the 
cDNA clone within the pool of cDNA clones deposited with the ATCC, described herein), and/or 

20 the polynucleotide sequence delineated in column 6 of Table 1C or the complement thereof. 
"Stringent hybridization conditions" refers to an overnight incubation at 42 degree C in a solution 
comprising 50% formarnide, 5x SSC (750 mM NaCl, 75 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5x Denbardt's solution, 10% dextran sulfate, and 20 /ig/ml denatured, sheared 
salmon sperm DNA, followed by washing the fdters in O.lx SSC at about 65 degree C. 

25 Also contemplated are nucleic acid molecules that hybridize to the polynucleotides of the 

present invention at lower stringency hybridization conditions. Changes in the stringency of 
hybridization and signal detection are primarily accomplished through the manipulation of 
formarnide concentration (lower percentages of formarnide result in lowered stringency); salt 
conditions, or temperature. For example, lower stringency conditions include an overnight 

30 incubation at 37 degree C in a solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M 
NaH 2 P0 4 ; 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formarnide, 100 ug/ml salmon sperm blocking 
DNA; followed by washes at 50 degree C with 1XSSPE, 0.1% SDS. In addition, to achieve even 
lower stringency, washes performed following stringent hybridization can be done at higher salt 
concentrations (e.g. 5X SSC). 

35 Note that variations in the above conditions may be accomplished through the inclusion 

and/or substitution of alternate blocking reagents used to suppress background in hybridization 
experiments. Typical blocking reagents include Denhardt's reagent, BLOTTO, heparin, denatured 
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salmon sperm DNA, and commercially available proprietary formulations. The inclusion of 
specific blocking reagents may require modification of the hybridization conditions described 
above, due to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as any 3* 
5 terminal polyA+ tract of a cDNA shown in the sequence listing), or to a complementary stretch of 
T (or U) residues, would not be included in the definition of "polynucleotide," since such a 
polynucleotide would hybridize to any nucleic acid molecule containing a poly (A) stretch or the 
complement thereof (e.g., practically any double-stranded cDNA clone generated using oligo dT 
as a primer). 

10 The polynucleotide of the present invention can be composed of any polyribonucleotide or 

polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. For 
example, polynucleotides can be composed of single- and double-stranded DNA, DNA that is a 
mixture of single- and double-stranded regions, single- and double-stranded RNA, and RNA that 
is mixture of single- and double-stranded regions, hybrid molecules comprising DNA and RNA 

15 that may be single-stranded or, more typically, double-stranded or a mixture of single- and double- 
stranded regions. In addition, the polynucleotide can be composed of triple-stranded regions 
comprising RNA or DNA or both RNA and DNA. A polynucleotide may also contain one or 
more modified bases or DNA or RNA backbones modified for stability or for other reasons. 
"Modified" bases include, for example, tritylated bases and unusual bases such as inosine. A 

20 variety of modifications can be made to DNA and RNA; thus, "polynucleotide" embraces 
chemically, enzymatically, or metabolically modified forms. 

In specific embodiments, the polynucleotides of the invention are at least 15, at least 30, at 
least 50, at least 100, at least 125, at least 500, or at least 1000 continuous nucleotides but are less 
than or equal to 300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, 7.5kb, 5 kb, 2.5 kb, 2.0 kb, or 1 kb, in 

25 length. In a further embodiment, polynucleotides of the invention comprise a portion of the 
coding sequences, as disclosed herein, but do not comprise all or a portion of any intron. In 
another embodiment, the polynucleotides comprising coding sequences do not contain coding 
sequences of a genomic flanking gene (i.e., 5' or 3 1 to the gene of interest in the genome). In other 
embodiments, the polynucleotides of the invention do not contain the coding sequence of more 

30 than 1000, 500, 250, 100, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 1 genomic flanking gene(s). 

"SEQ ID NO:X" refers to a polynucleotide sequence described in column 5 of Table 1A, 
while "SEQ ID NO.Y" refers to a polypeptide sequence described in column 10 of Table 1A. SEQ 
ID NO:X is identified by an integer specified in column 6 of Table 1A The polypeptide sequence 
SEQ ID NO:Y is a translated open reading frame (ORF) encoded by polynucleotide SEQ ID 

35 NO:X. The polynucleotide sequences are shown in the sequence listing immediately followed by 
all of the polypeptide sequences. Thus, a polypeptide sequence corresponding to polynucleotide 
sequence SEQ ID NO:2 is the first polypeptide sequence shown in the sequence listing. The 
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second polypeptide sequence corresponds to the polynucleotide sequence shown as SEQ ID NO:3, 
and so on. 

The polypeptide of the present invention can be composed of amino acids joined to each 
other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may contain amino 
5 acids other than the 20 gene-encoded amino acids. The polypeptides may be modified by either 
natural processes, such as posttranslational processing, or by chemical modification techniques 
which are well known in the art. Such modifications are well described in basic texts and in more 
detailed monographs, as well as in a voluminous research literature. Modifications can occur 
anywhere in a polypeptide, including the peptide backbone, the arnino acid side-chains and the 

10 amino or carboxyl termini. It will be appreciated that the same type of modification may be 
present in the same or varying degrees at several sites in a given polypeptide. Also, a given 
polypeptide may contain many types of modifications. Polypeptides may be branched, for 
example, as a result of ubiquitination, and they may be cyclic, with or without branching. Cyclic, 
branched, and branched cyclic polypeptides may result from posttranslation natural processes or 

15 may be made by synthetic methods. Modifications include acetylation, acylation, ADP- 
ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme moiety, 
covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid 
derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine, formation of 

20 pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor formation, 
hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, proteolytic 
processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, and ubiquitination. (See, for 
instance, PROTEINS - STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. 

25 Creighton, W. H. Freeman and Company, New York (1993); POSTTRANSLATIONAL 
COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic Press, New 
York, pgs. 1-12 (1983); Seifter et al., Meth. EnzymoL 182:626-646 (1990); Rattan et al. t Ann. 
N.Y. Acad. Sci. 663:48-62 (1992)). 

"SEQ ID NO:X" refers to a polynucleotide sequence described, for example, in Tables 1 A, 

30 Table IB, or Table 2, while "SEQ ID NO:Y" refers to a polypeptide sequence described in column 
1 1 of Table 1A and or column 6 of Table 1B.1. SEQ ID NO:X is identified by an integer specified 
in column 4 of Table IB. The polypeptide sequence SEQ ID NO:Y is a translated open reading 
frame (ORF) encoded by polynucleotide SEQ ID NO:X. "Clone ID:" refers to a cDNA clone 
described in column 2 of Table 1 A and/or IB. 

35 "A polypeptide having functional activity" refers to a polypeptide capable of displaying 

one or more known functional activities associated with a full-length (complete) protein. Such 
functional activities include, but are not limited to, biological activity (e.g. activity useful in 
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treating, preventing and/or ameliorating gastrointestinal diseases and disorders), antigenicity 
(ability to bind [or compete with a polypeptide for binding] to an anti-polypeptide antibody), 
immunogenicity (ability to generate antibody which binds to a specific polypeptide of the 
invention), ability to form multimers with polypeptides of the invention, and ability to bind to a 
5 receptor or ligand for a polypeptide. 

The polypeptides of the invention can be assayed for functional activity (e.g. biological 
activity) using or routinely modifying assays known in the art, as well as assays described herein. 
Specifically, one of skill in the art may routinely assay secreted polypeptides (including fragments 
and variants) of the invention for activity using assays as described in the examples section below. 

10 "A polypeptide having biological activity" refers to a polypeptide exhibiting activity 

similar to, but not necessarily identical to, an activity of a polypeptide of the present invention, 
including mature forms, as measured in a particular biological assay, with or without dose 
dependency. In the case where dose dependency does exist, it need not be identical to that of the 
polypeptide, but rather substantially similar to the dose-dependence in a given activity as 

15 compared to the polypeptide of the present invention (i.e., the candidate polypeptide will exhibit 
greater activity or not more than about 25-fold less and, preferably, not more than about tenfold 
less activity, and most preferably, not more than about three-fold less activity relative to the 
polypeptide of the present invention). 

TABLES: 

Table 1A 

Table 1A summarizes information concerning certain polypnucleotides and polypeptides 
of the invention. The first column provides the gene number in the application for each clone 
identifier. The second column provides a unique clone identifier, "Clone ID:", for a cDNA clone 
related to each contig sequence disclosed in Table 1 A. Third column, the cDNA Clones identified 
in the second column were deposited as indicated in the third column (i.e. by ATCC Deposit No:Z 
and deposit date). Some of the deposits contain multiple different clones corresponding to the 
same gene. In the fourth column, "Vector" refers to the type of vector contained in the 
corresponding cDNA Clone identified in the second column. In the fifth column, the nucleotide 
sequence identified as "NT SEQ ID NO:X" was assembled from partially homologous 
("overlapping") sequences obtained from the corresponding cDNA clone identified in the second 
column and, in some cases, from additional related cDNA clones. The overlapping sequences 
were assembled into a single contiguous sequence of high redundancy (usually three to five 
overlapping sequences at each nucleotide position), resulting in a final sequence identified as SEQ 
ID NO:X. In the sixth column, 'Total NT Seq." refers to the total number of nucleotides in the 
contig sequence identified as SEQ ID NOrX." The deposited clone may contain all or most of 
these sequences, reflected by the nucleotide position indicated as "5' NT of Clone Seq." (seventh 
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column) and the "3* NT of Clone Seq ." (eighth column) of SEQ ID NO:X. In the ninth column, 
the nucleotide position of SEQ ID NO:X of the putative start codon (methionine) is identified as 
"5' NT of Start Codon." Similarly , in column ten, the nucleotide position of SEQ ID NO:X of the 
predicted signal sequence is identified as "5' NT of First AA of Signal Pep." In the eleventh 
5 column, the translated amino acid sequence, beginning with the methionine, is identified as "AA 
SEQ ID NO:Y," although other reading frames can also be routinely translated using known 
molecular biology techniques. The polypeptides produced by these alternative open reading 
frames are specifically contemplated by the present invention. 

In the twelfth and thirteenth columns of Table 1A, the first and last amino acid position of 

10 SEQ ID NO: Y of the predicted signal peptide is identified as "First AA of Sig Pep" and "Last AA 
of Sig Pep." In the fourteenth column, the predicted first amino acid position of SEQ ID NO:Y of 
the secreted portion is identified as "Predicted First AA of Secreted Portion". The amino acid 
position of SEQ ID NO:Y of the last amino acid encoded by the open reading frame is identified in 
the fifteenth column as "Last AA of ORF\ 

15 SEQ ID NO:X (where X may be any of the polynucleotide sequences disclosed in the 

sequence listing) and the translated SEQ ID NO:Y (where Y may be any of the polypeptide 
sequences disclosed in the sequence listing) are sufficiently accurate and otherwise suitable for a 
variety of uses well known in the art and described further below. For instance, SEQ ID NO:X is 
useful for designing nucleic acid hybridization probes that will detect nucleic acid sequences 

20 contained in SEQ ID NO:X or the cDNA contained in the deposited clone. These probes will also 
hybridize to nucleic acid molecules in biological samples, thereby enabling a variety of forensic 
and diagnostic methods of the invention. Similarly, polypeptides identified from SEQ ID NO:Y 
may be used, for example, to generate antibodies which bind specifically to proteins containing the 
polypeptides and the secreted proteins encoded by the cDNA clones identified in Table 1 A and/or 

25 elsewhere herein 

Nevertheless, DNA sequences generated by sequencing reactions can contain sequencing 
errors. The errors exist as misidentified nucleotides, or as insertions or deletions of nucleotides in 
the generated DNA sequence. The erroneously inserted or deleted nucleotides cause frame shifts 
in the reading frames of the predicted amino acid sequence. In these cases, the predicted amino 

30 acid sequence diverges from the actual amino acid sequence, even though the generated DNA 
sequence may be greater than 99.9% identical to the actual DNA sequence (for example, one base 
insertion or deletion in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide sequence or the 
amino acid sequence, the present invention provides not only the generated nucleotide sequence 

35 identified as SEQ ID NO:X, and the predicted translated amino acid sequence identified as SEQ 
ID NO:Y, but also a sample of plasmid DNA containing a human cDNA of the invention 
deposited with the ATCC, as set forth in Table 1A. The nucleotide sequence of each deposited 
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plasmid can readily be determined by sequencing the deposited plasmid in accordance with known 
methods 

The predicted amino acid sequence can then be verified from such deposits. Moreover, 
the amino acid sequence of the protein encoded by a particular plasmid can also be directly 
5 determined by peptide sequencing or by expressing the protein in a suitable host cell containing 
the deposited human cDNA, collecting the protein, and determining its sequence. 

Also provided in Table 1 A is the name of the vector which contains the cDNA plasmid. 
Each vector is routinely used in the art. The following additional information is provided for 
convenience. 

10 Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap XR (U.S. 

Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 5,128,256 and 5,286,636), 
pBluescript (pBS) (Short, J. M. et aL, Nucleic Acids Res. 76:7583-7600 (1988); Alting-Mees, M 
A. and Short, J. M., Nucleic Acids Res. 77:9494 (1989)) and pBK (Alting-Mees, M. A. et aL, 
Strategies 5:58-61 (1992)) are commercially available from Stratagene Cloning Systems, Inc., 

15 1 101 1 N. Torrey Pines Road, La Jolla, CA, 92037. pBS contains an ampicillin resistance gene and 
pBK contains a neomycin resistance gene. Phagemid pBS may be excised from the Lambda Zap 
and Uni-Zap XR vectors, and phagemid pBK may be excised from the Zap Express vector. Both 
phagemids may be transformed into E. coli strain XL-l Blue, also available from Stratagene 

Vectors pSportl, pCMVSport 1.0, pCMVSport 2.0 and pCMVSport 3.0, were obtained 

20 from Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors contain 
an ampicillin resistance gene and may be transformed into E. coli strain DH10B, also available 
from Life Technologies. See, for instance, Gruber, C. E., et al., Focus 75:59 (1993). Vector 
lafmid BA (Bento Soares, Columbia University, New York, NY) contains an ampicillin resistance 
gene and can be transformed into E. coli strain XL-1 Blue. Vector pCR®2.1, which is available 

25 from Invitrogen, 1600 Faraday Avenue, Carlsbad, CA 92008, contains an ampicillin resistance 
gene and may be transformed into E coli strain DH10B, available from Life Technologies. See, 
for instance, Clark, J. M., Nuc. Acids Res. 76:9677-9686 (1988) and Mead, D. et al t 
Bio/Technology 9: (1991). 

The present invention also relates to the genes corresponding to SEQ ID NO.X, SEQ ID 

30 NO:Y, and/or a deposited cDNA (cDNA Clone ID). The corresponding gene can be isolated in 
accordance with known methods using the sequence information disclosed herein. Such methods 
include, but are not limited to, preparing probes or primers from the disclosed sequence and 
identifying or amplifying the corresponding gene from appropriate sources of genomic material. 

Also provided in the present invention are allelic variants, orthologs, and/or species 

35 homologs. Procedures known in the art can be used to obtain full-length genes, allelic variants, 
splice variants, full-length coding portions, orthologs, and/or species homologs of genes 
corresponding to SEQ ID NO:X and SEQ ID NO:Y using information from the sequences 
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disclosed herein or the clones deposited with the ATCC. For example, allelic variants and/or 
species homologs may be isolated and identified by making suitable probes or primers from the 
sequences provided herein and screening a suitable nucleic acid source for allelic variants and/or 
the desired homologue. 

5 The present invention provides a polynucleotide comprising, or alternatively consisting of, 

the nucleic acid sequence of SEQ ID NO:X and/or a cDNA contained in ATCC Deposit No.Z. 
The present invention also provides a polypeptide comprising, or alternatively, consisting of, the 
polypeptide sequence of SEQ ID NO:Y, a polypeptide encoded by SEQ ID NO:X, and/or a 
polypeptide encoded by a cDNA contained in ATCC deposit No.Z. Polynucleotides encoding a 

10 polypeptide comprising, or alternatively consisting of the polypeptide sequence of SEQ ID NO: Y, 
a polypeptide encoded by SEQ ID NO:X and/or a polypeptide encoded by the cDNA contained in 
ATCC Deposit No.Z, are also encompassed by the invention. The present invention further 
encompasses a polynucleotide comprising, or alternatively consisting of the complement of the 
nucleic acid sequence of SEQ ID NO:X, and/or the complement of the coding strand of the cDNA 

1 5 contained in ATCC Deposit No.Z. 
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Table IB (Comprised of Tables 1B.1 and 1B.2) 

The first column in Table 1B.1 and Table IB .2 provides the gene number in the 
application corresponding to the clone identifier. The second column in Table 1B.1 and Table 1B.2 
provides a unique "Clone ID:" for the cDNA clone related to each contig sequence disclosed in Table 

5 1B.1 and Table 1B.2. This clone ID references the cDNA clone which contains at least the 5' most 
sequence of the assembled contig and at least a portion of SEQ ID NO:X as determined by directly 
sequencing the referenced clone. The referenced clone may have more sequence than described in the 
sequence listing or the clone may have less. In the vast majority of cases, however, the clone is 
believed to encode a full-length polypeptide. In the case where a clone is not full-length, a full-length 

L0 cDNA can be obtained by methods described elsewhere herein. The third column in Table 1B.1 and 
Table IB. 2 provides a unique "Contig ID" identification for each contig sequence. The fourth column 
in Table 1B.1 and Table 1B.2 provides the "SEQ ID NO:" identifier for each of the contig 
polynucleotide sequences disclosed in Table IB. 
Table 1B.1 

15 The fifth column in Table 1B.1, "ORF (From-To)", provides the location (i.e., nucleotide 

position numbers) within the polynucleotide sequence "SEQ ID NO:X" that delineate the preferred 
open reading frame (ORF) shown in the sequence listing and referenced in Table 1B.1, column 6, as 
SEQ ID NO:Y. Where the nucleotide position number 'To" is lower than the nucleotide position 
number "From", the preferred ORF is the reverse complement of the referenced polynucleotide 

20 sequence. The sixth column in Table 1B.1 provides the corresponding SEQ ID NO:Y for the 
polypeptide sequence encoded by the preferred ORF delineated in column 5. In one embodiment, the 
invention provides an amino acid sequence comprising, or alternatively consisting of, a polypeptide 
encoded by the portion of SEQ ID NO:X delineated by "ORF (From-To)". Also provided are 
polynucleotides encoding such amino acid sequences and the complementary strand thereto. Column 

25 7 in Table 1B.1 lists residues comprising epitopes contained in the polypeptides encoded by the 
preferred ORF (SEQ ID NO:Y), as predicted using the algorithm of Jameson and Wolf, (1988) Comp. 
Appl. Biosci. 4:181-186. The Jameson-Wolf antigenic analysis was performed using the computer 
program PROTEAN (Version 3.11 for the Power Macintosh, DNASTAR, Inc., 1228 South Park 
Street Madison, WI). In specific embodiments, polypeptides of the invention comprise, or 

30 alternatively consist of, at least one, two, three, four, five or more of the predicted epitopes as 
described in Table IB. It will be appreciated that depending on the analytical criteria used to predict 
antigenic determinants, the exact address of the determinant may vary slightly. 

Column 8 in Table 1B.1 provides a chromosomal map location for certain 
polynucleotides of the invention. Chromosomal location was determined by finding exact matches to 
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EST and cDNA sequences contained in the NCBI (National Center for Biotechnology Information) 
UniGene database. Each sequence in the UniGene database is assigned to a "cluster"; all of the ESTs, 
cDNAs, and STSs in a cluster are believed to be derived from a single gene. Chromosomal mapping 
data is often available for one or more sequence(s) in a UniGene cluster; this data (if consistent) is 

5 then applied to the cluster as a whole. Thus, it is possible to infer the chromosomal location of a new 
polynucleotide sequence by detenriining its identity with a mapped UniGene cluster. 

A modified version of the computer program BLASTN (Altshul, et aL, J. Mol. BioL 
215:403-410 (1990), and Gish, and States, Nat. Genet 3:266-272) (1993) was used to search the 
UniGene database for EST or cDNA sequences that contain exact or near-exact matches to a 

10 polynucleotide sequence of the invention (the 'Query')- A sequence from the UniGene database (the 
Subject') was said to be an exact match if it contained a segment of 50 nucleotides in length such that 
48 of those nucleotides were in the same order as found in the Query sequence. If all of the matches 
that met this criteria were in the same UniGene cluster, and mapping data was available for this 
cluster, it is indicated in Table IB under the heading "Cytologic Band". Where a cluster had been 

15 further localized to a distinct cytologic band, that band is disclosed; where no banding information 
was available, but the gene had been localized to a single chromosome, the chromosome is disclosed. 

Once a presumptive chromosomal location was determined for a polynucleotide of the 
invention, an associated disease locus was identified by comparison with a database of diseases which 
have been experimentally associated with genetic loci. The database used was the Morbid Map, 

20 derived from OMIM™ and National Center for Biotechnology Information, National Library of 
Medicine (Bethesda, MD) 2000;. If the putative chromosomal location of a polynucleotide of the 
invention (Query sequence) was associated with a disease in the Morbid Map database, an OMIM 
reference identification number was noted in column 9, Table 1B.1, labelled "OMIM Disease 
Reference(s). Table 5 is a key to the OMIM reference identification numbers (column 1), and 

25 provides a description of the associated disease in Column 2. 

Table 1B.2 

Column 5, in Table 1B.2, provides an expression profile and library code.count for each of 
the contig sequences (SEQ ID NO:X) disclosed in Table IB, which can routinely be combined with 
the information provided in Table 4 and used to determine the tissues, cells, and/or cell line libraries 
30 which predominantly express the polynucleotides of the invention. The first number in Table 1B.2, 
column 5 (preceding the colon), represents the tissue/cell source identifier code corresponding to the 
code and description provided in Table 4. The second number in column 5 (following the colon) 
represents the number of times a sequence corresponding to the reference polynucleotide sequence 
was identified in the corresponding tissue/cell source. Those tissue/cell source identifier codes, in 
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which the first two letters are "AR" designate information generated using DNA array technology. 
Utilizing this technology, cDNAs were amplified by PCR and then transferred, in duplicate, onto the 
array. Gene expression was assayed through hybridization of first strand cDNA probes to the DNA 
array. cDNA probes were generated from total RNA extracted from a variety of different tissues and 
cell lines. Probe synthesis was performed in the presence of 33 P dCTP, using oligo (dT) to prime 
reverse transcription. After hybridization, high stringency washing conditions were employed to 
remove non-specific hybrids from the array. The remaining signal, emanating from each gene target, 
was measured using a Phosphorimager. Gene expression was reported as Phosphor Stimulating 
Luminescence (PSL) which reflects the level of phosphor signal generated from the probe hybridized 
to each of the gene targets represented on the array. A local background signal subtraction was 
performed before the total signal generated from each array was used to normalize gene expression 
between the different hybridizations. The value presented after "[array code]:" represents the mean of 
the duplicate values, following background subtraction and probe normalization. One of skill in the 
art could routinely use this information to identify normal and/or diseased tissue(s) which show a 
predominant expression pattern of the corresponding polynucleotide of the invention or to identify 
polynucleotides which show predominant and/or specific tissue and/or cell expression. 
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Table 1C summarizes additional polynucleotides encompassed by the invention 
(including cDNA clones related to the sequences (Clone ID:), contig sequences (contig identifier 
(Contig ID:) contig nucleotide sequence identifiers (SEQ ID NO:X)), and genomic sequences 

5 (SEQ ID NO:B). The first column provides a unique clone identifier, "Clone ID:", for a cDNA 
clone related to each contig sequence. The second column provides the sequence identifier, "SEQ 
ID NO:X", for each contig sequence. The third column provides a unique contig identifier, 
"Contig ID:" for each contig sequence. The fourth column, provides a BAC identifier "BAC ID 
NO:A" for the BAC clone referenced in the corresponding row of the table. The fifth column 

10 provides the nucleotide sequence identifier, "SEQ ID NO:B" for a fragment of the BAC clone 
identified in column four of the corresponding row of the table. The sixth column, "Exon From- 
To", provides the location (i.e., nucleotide position numbers) within the polynucleotide sequence 
of SEQ ID NO:B which delineate certain polynucleotides of the invention that are also exemplary 
members of polynucleotide sequences that encode polypeptides of the invention (e.g., polypeptides 

15 containing amino acid sequences encoded by the polynucleotide sequences delineated in column 
six, and fragments and variants thereof). 



Table 1C 



cDNA Clone 


SEQ ID 


CONTIG ID: 


BAC ID: A 


SEQ ED 


EXON 


ID 


NO:X 






NO:B 


From-To 


HAUA183 


22 

r 


639009 


AC010422 

■ 


589 


1-326 
1552-2084 
2162-2261 
2300-2427 
4485-5868 
5948-6362 
7914-8017 
8569-8681 
8765-8875 
8968-9037 
9284-9499 
9742-9910 
10837-11187 
11271-11321 
11554-11707 
11783-12766 
12866-13225 
13256-13827 
14284-14367 
14890-15090 


HAUAI83 


22 


639009 


AC018761 


590 


1-326 
1176-1284 
1552-2084 
2162-2261 
2300-2426 
4485-5868 
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5948-6362 
8569-8681 
8765-8875 
8968-9037 
9284-9499 
9742-9910 
10317-10501 
1083741187 
11271-11321 
11554-11707 
11783-12766 
12866-13225 
13256-13827 
14284-14367 
14890-15090 


HAUAI83 


22 


639009 


AC010422 


591 


1-315 
2004-2289 
2650-2741 
3554-3830 


HAUAI83 


22 


639009 


AC010422 


592 


1-202 
938-1047 
1219-1395 
1758-1956 
2907-3429 
3792-3935 
5366-5485 
6391-6688 
6899-7269 
7890-8316 
8400-8524 
8607-8682 
8824-8999 
9190-9302 
9691-9796 
10106-10177 
10571-11051 
11164-11490 
12565-12696 
13364-13501 
13964-14592 
14740-15540 
15610-15798 
15947-16642 
16717-16832 
16968-17408 
17521-17612 
18331-18579 
19120-19303 
1935849514 
19599-19702 
20003-20245 


HAUAI83 


22 


639009 


AC018761 


593 


1-202 
938-1047 
1219-1395 
1758-1956 



144 



1 



WO 02/076488 PCT/US02/08276 













2907-3429 
3792-3935 
5366-5485 
6391-6688 
6899-7269 
7591-7711 
7890-8316 
8400-8524 
8607-8682 
8749-9073 
9190-9302 
9691-9796 


HAUAI83 


22 


639009 

i 


AC018761 


594 


1-82 
128-293 
1178-1447 
1986-2278 
2457-2711 
3543-3844 


HBINS58 


26 


1352386 


AL096774 


595 


1-1023 
2010-2239 
2581-2962 
3153-3223 
3324-3493 
3973^126 


HBINS58 


26 


1352386 


AL096774 


596 


1-341 


HB1NS58 


26 


1352386 


AL096774 


597 


1-142 


HCE3G69 


29 


728432 


AC068946 


598 


1-108 
1186-1324 
1746-1835 
2138-2284 
2448-2545 
2718-2861 
3091-5889 


HCE3G69 


29 


728432 


ACX)68946 


599 


1-191 


HCE3G69 


29 


728432 


AC068946 


600 


1-686 


HCHFB80 


31 


1143407 


AL022327 


601 


1-2271 
3506-3658 
4643^4810 
9039-9164 
9382-9509 
10587-10720 
11135-11195 
11265-11716 
14644-15466 
17451-17526 
18012-18114 
20530-20632 
20957-21009 
23696-23785 
25338-25575 
25969-26166 


HCNDR47 


34 


1016919 


AL122003 


602 


1-236 
531-696 
787-817 
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863-4508 
5158-5744 
6949-7029 


HCNDR47 


34 


1016919 


AL122003 


603 


1-888 
1304-2003 
2830-3284 
3719-4571 
4618-5268 
6131-6557 

. 9058-9726 
14176-14480 
18456-18915 
18960-19871 
22365-22454 
23082-23248 
28058-28215 


HDPGT01 


44 


771583 


AC020978 


604 


1-180 
2776-2899 
39164077 
42964335 
4436-4632 
4895-5181 

5l5J-824o 

yj4/-yooo 
9907-10007 

LVD /U-IUOlo 

10788-11046 

1 1096-1^49 3 

13764-15689 


HDPGT01 


44 


771583 


AC020978 


605 


1-384 


HDPSB18 


50 


1043263 


AL355512 


606 


1-2572 


HDPSB18 


50 


1043263 


AC006 


176 


607 


1-2571 
3048-3872 


HDPSB18 


50 


1043263 


AL355512 


608 


1-280 


HDJrXiUl 


OJ 


QHCIC\A Q 


AL354000 


■ 


1 mo 
o*f o-h y 1 Z> 

6561-6654 


r\\J±j\. X yJx 


jj 




AL035362 


610 


l lJ iU 

4844-4971 
5225-5596 
6557-6650 


HDPXY01 


55 


879048 


AL354000 


611 


1-460 


HDPXY01 


55 


879048 


AL354 




612 


1-400 


HDPXY01 


55 


879048 


AL035362 


613 


MOO 


HDPXY01 


55 


879048 


AL035362 


614 


M60 


HHGCG53 


80 


340818 


AC024037 


615 


1-518 


HHGCM76 


81 


662329 


AC003665 


616 


1-70 
304-609 
900-1090 
1240-1835 
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2272-2490 
2581-3598 


HHGCM76 


81 


662329 


AC003665 


617 


1-580 
851-995 
1224-1296 
1314-1663 
1930-1973 
2724-2905 

2968-3098 

loci ono 
3Z©3-33zo 

*\101 ^OQH 
31Z1-3Z3U 

333i-3uoy 


TTT A f^CVXCi 


54 




ALA/lo3IZ 


A1 R 


1 OOA 

1-/ /o 


TTT A pnOfl 

riJ A^vj3U 


54 




Amoo^fK 

At^UZZ3 U3 


A1G 

oiy 


1-5 to 


xjt A/TiQn 
ID Al^ljr3U 


54 


5y33U3 


AI-AJUZ3 lo 


aoo 
ozu 


1-1 jU 


TTT TTDfl/l 


11D 


lUO /JJJ 


AvAlll/431 


A0 1 
OZl 


1 1 ooo 

i-izyy 


TTT TTDQ/I 


inc 
1UD 


lUo /333 


ACUU / 4 3 1 


AO 0 
DZZ 


1 Hf\ 
1-33U 


JHJV15JLIL3 / 


11^ 
1 13 


y /3yyo 


AmiOORA 
ALAJ1ZU50 


AO^ 
OZ3 


1-33Z5 




113 


y /3yyo 


Arm cc 1 1 

Al^ulooll 1 


AO/1 
OZ4 


1-3U3 1 


T_r\/f CTYT 1*7 


113 


y /jyyo 


ACUl<S4y4 


AO^ 
OZ3 


1 <*rvoo 
i-3uzy 


rlJYloJLiLi / 


113 


y /3yyo 


AL/U1ZU50 


AO A 
OZO 


1 OO/l 

1-ZZ4 


T_T\yfOT*T T7 

HMoJL>.L3 / 


113 


y foyyo 


A /"Yil OAQA 
ALAJ1ZU50 


AOO 
OZ/ 


1 /1AQ 
1-405 


UTAAOTYT 11 
TlM5L)L,3 / 


1 1 c 

113 


y / jyyo 


A PAl OOI 1 

AUUlooll 


AOC 

OZo 


1 ooo 
1-ZZZ 


XTKAOTM 11 


113 


y /3yyo 


ALAJ15511 


AOO 

ozy 


1 _/t AQ 
1-405 


TTX/TOTYT n 

xiMoJJJUi / 


IK 

113 


y /jyyo 


ALAJ154y4 


A^n 

03 U 


1 OO/l 

1-ZZ4 




113 


y/3yyo 


AL^Ulo4y4 


A^1 
031 


1 1 Q^A 
1-1oj4 


HINljUllZ 


1 OQ 

lzo 


tf\A tine 
1U413 /3 


A /"VMY2 AO 

ALAJU30 /3 


AIO 
03Z 


1 01 OQ 
l-ZIZo 


rilNUUllZ 


1Z5 


i c\A i n ^ 
1U413 /3 


APnni ooq 
A^UUlZZo 


A*}** 
033 


1 O 1 OQ 

i-zizy 


HNGOI12 


128 


1041375 


AC013791 


634 


1-2132 


HNHFM14 


130 


. 664507 


AC020552 


635 


1-290 


HNHFM14 


130 


664507 


AC020552 


636 


1-96 


HPJBK12 


147 


1011467 


AC022033 


637 


1-2649 


HPJBK12 


147 


1011467 


AC013541 


638 


1-2649 


HPJBK12 


147 


1011467 


AC022033 


639 


1-190 


HPJBK12 


147 


1011467 


AC013541 


640 


1-190 


HPRAL78 


149 


1352342 


AC007783 


641 


1-2334 
2508-3084 
3578-3890 
4198-4294 
4376-4623 
4712-5349 
5369-6088 
6527-7107 
7298-7392 
7730-7846 
9147-9476 
10487-10575 
10791-11298 
11485-11601 
11783-13009 
13207-13501 
13540-13772 
14439-14800 
14923-14983 
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15133-15355 
15485-15653 
16750-16805 
16894-17078 
17162-17219 
18003-18089 
21085-21146 
21358-21501 


HPRAL78 


149 


1352342 


AC007783 


642 


1-308 


HPRAL78 


149 


. 1352342 


AC007783 


643 


1-1024 


HRGBL78 


153 


910133 


AL359541 


644 


1-254 
2777-3307 
3670-3823 
4113-4385 
4844-5381 
5995-7365 


HSAWD74 


156 


460527 


AC0Q4951 


645 


1-1651 
1740-2593 


HSAWD74 


156 


460527 


AC004951 


646 


1-149 


HSAWD74 


156 


460527 


AC004951 


647 


1-5057 
5082-8353 
8404-8996 


HTPCS72 


177 


854941 


AL008639 


648 


1-106 
1457-1595 
1666-2484 
2910-3006 
3705-4147 
4768-5141 
5304-5536 
5746-5874 
7114-7241 
7468-7711 
7963-8746 
9438-12408 
12884-14976 


HTPCS72 


177 


854941 


AL008639 


649 


1-720 


HTPIH83 


178 


919916 


AL158821 


650 


1-1862 
1880-3126 



Tables ID: The polynucleotides or polypeptides, or agonists or antagonists of the present 
invention can be used fai assays to test for one or more biological activities. If these 
polynucleotides and polypeptides do exhibit activity in a particular assay, it is likely that these 
molecules may be involved in the diseases associated with the biological activity. Thus, the 
polynucleotides or polypeptides, or agonists or antagonists could be used to treat the associated 
disease. 

The present invention encompasses methods of detecting, preventing, diagnosing, 
prognosticating, treating, and/or ameliorating a disease or disorder. In preferred embodiments, the 
present invention encompasses a method of treating a gastrointestinal disease or disorder 
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comprising administering to a patient in which such detection, treatment, prevention, and/or 
amelioration is desired a protein, nucleic acid, or antibody of the invention (or fragment or variant 
thereof) in an amount effective to detect, prevent, diagnose, prognosticate, treat, and/or ameliorate 
the gastrointestinal disease or disorder. 
5 In another embodiment, the present invention also encompasses methods of detecting, 

preventing, diagnosing, prognosticating, treating, and/or ameliorating a gastrointestinal disease or 
disorder, comprising administering to a patient combinations of the proteins, nucleic acids, or 
antibodies of the invention (or fragments or variants thereof), sharing similar indications as shown 
in the corresponding rows in Column 3 of Table ID. 

10 Table ID provides information related to biological activities for polynucleotides and 

polypeptides of the invention (including antibodies, agonists, and/or antagonists thereof). Table 
ID also provides information related to assays which may be used to test polynucleotides and 
polypeptides of the invention (including antibodies, agonists, and/or antagonists thereof) for the 
corresponding biological activities. The first column ("Gene No.") provides the gene number in 

15 the application for each clone identifier. The second column ("cDNA Clone ID:") provides the 
unique clone identifier for each clone as previously described and indicated in Table 1A through 
Table ID. The third column ("AA SEQ ID NO:Y") indicates the Sequence Listing SEQ ID 
Number for polypeptide sequences encoded by the corresponding cDNA clones (also as indicated 
in Tables 1A, Table IB, and Table 2). The fourth column ("Biological Activity") indicates a 

20 biological activity corresponding to the indicated polypeptides (or polynucleotides encoding said 
polypeptides). The fifth column ("Exemplary Activity Assay") further describes the corresponding 
biological activity and also provides information pertaining to the various types of assays which 
may be performed to test, demonstrate, or quantify the corresponding biological activity. 

Table ID describes the use of, inter alia, FMAT technology for testing or demonstrating 
25 various biological activities. Fluorometric microvolume assay technology (FMAT) is a 
fluorescence-based system which provides a means to perform nonradioactive cell- and bead- 
based assays to detect activation of cell signal transduction pathways. This technology was 
designed specifically for ligand binding and immunological assays. Using this technology, 
fluorescent cells or beads at the bottom of the well are detected as localized areas of concentrated 
30 fluorescence using a data processing system. Unbound flurophore comprising the background 
signal is ignored, allowing for a wide variety of homogeneous assays. FMAT technology may be 
used for peptide ligand binding assays, immunofluorescence, apoptosis, cytotoxicity, and bead- 
based immunocapture assays. See, Miraglia S eL al., "Homogeneous cell and bead based assays 
for highthroughput screening using flourometric microvolume assay technology," Journal of 
35 Biomolecular Screening; 4:193-204(1999). In particular, FMAT technology may be used to test, 
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confirm, and/or identify the ability of polypeptides (including polypeptide fragments and variants) 
to activate signal transduction pathways. For example, FMAT technology may be used to test, 
confirm, and/or identify the ability of polypeptides to upregulate production of immunomodulatory 
proteins (such as, for example, interleukins, GM-CSF, Rantes, and Tumor Necrosis factors, as well 
5 as other cellular regulators (e.g. insulin)). 

Table ID also describes the use of kinase assays for testing, demonstrating, or quantifying 
biological activity. In this regard, the phosphorylation and de-phosphorylation of specific amino 
acid residues (e.g. Tyrosine, Serine, Threonine) on cell-signal transduction proteins provides a fast, 

10 reversible means for activation and de-activation of cellular signal transduction pathways. 
Moreover, cell signal transduction via phosphorylation/de~phosphorylation is crucial to the 
regulation of a wide variety of cellular processes (e.g. proliferation, differentiation, migration, 
apoptosis, etc.). Accordingly, kinase assays provide a powerful tool useful for testing, confirming, 
and/or identifying polypeptides (including polypeptide fragments and variants) that mediate cell 

15 signal transduction events via protein phosphorylation. See e.g., Forrer, P., Tamaskovic R., and 
Jaussi, R. "Enzyme-Linked Immunosorbent Assay for Measurement of JNK, ERK, and p38 Kinase 
Activities" Biol. Chem. 379(8-9): 1101-1110 (1998). 
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Table 2 further characterizes certain encoded polypeptides of the invention, by providing 
the results of comparisons to protein and protein family databases. The first column provides a 
unique clone identifier, "Clone ID NO:", corresponding to a cDNA clone disclosed in Table 1A 
5 and/or Table IB. The second column provides the unique contig identifier, "Contig ID:" which 
allows correlation with the information in Table IB. The third column provides the sequence 
identifier, "SEQ ID NO:", for the contig polynucleotide sequences. The fourth column provides 
the analysis method by which the homology/identity disclosed in the Table was determined. The 
fifth column provides a description of the PFAM/NR hit identified by each analysis. Column six 

10 provides the accession number of the PFAM/NR hit disclosed in the fifth column. Column seven, 
score/percent identity, provides a quality score or the percent identity, of the hit disclosed in 
column five. Comparisons were made between polypeptides encoded by polynucleotides of the 
invention and a non-redundant protein database (herein referred to as "NR")» or a database of 
protein families (herein referred to as *TFAM")> as described below. 

15 The NR database, which comprises the NBRF PIR database, the NCBI GenPept database, 

and the SIB SwissProt and TrEMBL databases, was made non-redundant using the computer 
program nrdb2 (Warren Gish, Washington University in Saint Louis). Each of the polynucleotides 
shown in Table IB, column 3 (e.g., SEQ ID NO:X or the 'Query' sequence) was used to search 
against the NR database. The computer program BLASTX was used to compare a 6-frame 

20 translation of the Query sequence to the NR database (for information about the BLASTX 
algorithm please see Altshul et al., J. MoL Biol. 215:403-410 (1990), and Gish and States, Nat 
Genet. 3:266-272 (1993). A description of the sequence that is most similar to the Query 
sequence (the highest scoring 'Subject') is shown in column five of Table 2 and the database 
accession number for that sequence is provided in column six. The highest scoring 'Subject' is 

25 reported in Table 2 if (a) the estimated probability that the match occurred by chance alone is less 
than l.Oe-07, and (b) the match was not to a known repetitive element BLASTX returns 
alignments of short polypeptide segments of the Query and Subject sequences which share a high 
degree of similarity; these segments are known as High-Scoring Segment Pairs or HSPs. Table 2 
reports the degree of similarity between the Query and the Subject for each HSP as a percent 

30 identity in Column 7. The percent identity is determined by dividing the number of exact matches 
between the two aligned sequences in the HSP, dividing by the number of Query amino acids in 
the HSP and multiplying by 100. The polynucleotides of SEQ ID NO:X which encode the 
polypeptide sequence that generates an HSP are delineated by columns 8 and 9 of Table 2. 

The PFAM database, PFAM version 2.1, (Sonnhammer, Nucl. Acids Res., 26:320-322, 

35 1998))consists of a series of multiple sequence alignments; one alignment for each protein family. 
Each multiple sequence alignment is converted into a probability model called a Hidden Markov 
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Model, or HMM, that represents the position-specific variation among the sequences that make up 
the multiple sequence alignment (see, e.g., Durbin, et al., Biological sequence analysis: 
probabilistic models of proteins and nucleic acids, Cambridge University Press, 1998 for the 
theory of HMMs). The program HMMER version 1.8 (Sean Eddy, Washington University in 
5 Saint Louis) was used to compare the predicted protein sequence for each Query sequence (SEQ 
ED NO:Y in Table 1B.1) to each of the HMMs derived from PFAM version 2.1. A HMM derived 
from PFAM version 2.1 was said to be a significant match to a polypeptide of the invention if the 
score returned by HMMER 1.8 was greater than 0.8 times the HMMER 1.8 score obtained with 
the most distantly related known member of that protein family. Hie description of the PFAM 

10 family which shares a significant match with a polypeptide of the invention is listed in column 5 
of Table 2, and the database accession number of the PFAM hit is provided in column 6. Column 
7 provides the score returned by HMMER version 1.8 for the alignment. Columns 8 and 9 
delineate the polynucleotides of SEQ ID NO:X which encode the polypeptide sequence which 
show a significant match to a PFAM protein family. 

15 As mentioned, columns 8 and 9 in Table 2, "NT From" and "NT To", delineate the 

polynucleotides of "SEQ ID NO:X" that encode a polypeptide having a significant match to the 
PFAM/NR database as disclosed in the fifth column. In one embodiment, the invention provides a 
protein comprising, or alternatively consisting of, a polypeptide encoded by the polynucleotides 
of SEQ ID NO:X delineated in columns 8 and 9 of Table 2. Also provided are polynucleotides 

20 encoding such proteins, and the complementary strand thereto. 

The nucleotide sequence SEQ ID NO:X and the translated SEQ ID NO: Y are sufficiently 
accurate and otherwise suitable for a variety of uses well known in the art and described further 
below. For instance, the nucleotide sequences of SEQ ID NO:X are useful for designing nucleic 
acid hybridization probes that will detect nucleic acid sequences contained in SEQ ID NO:X or 

25 the cDNA contained in ATCC Deposit No:Z. These probes will also hybridize to nucleic acid 
molecules in biological samples, thereby enabling immediate applications in chromosome 
mapping, linkage analysis, tissue identification and/or typing, and a variety of forensic and 
diagnostic methods of the invention. Similarly, polypeptides identified from SEQ K> NO:Y may 
be used to generate antibodies which bind specifically to these polypeptides, or fragments thereof, 

30 and/or to the polypeptides encoded by the cDNA clones identified in, for example, Table 1A 
and/or IB. 

Nevertheless, DNA sequences generated by sequencing reactions can contain sequencing 
errors. The errors exist as misidentified nucleotides, or as insertions or deletions of nucleotides in 
the generated DNA sequence. The erroneously inserted or deleted nucleotides cause frame shifts 
35 in the reading frames of the predicted amino acid sequence. In these cases, the predicted amino 
acid sequence diverges from the actual amino acid sequence, even though the generated DNA 
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sequence may be greater than 99.9% identical to the actual DNA sequence (for example, one base 
insertion or deletion in an open reading frame of over 1000 bases). 

Accordingly, for those applications requiring precision in the nucleotide sequence or the 
amino acid sequence, the present invention provides not only the generated nucleotide sequence 
identified as SEQ ID NO:X, and a predicted translated amino acid sequence identified as SEQ E) 
NO:Y, but also a sample of plasmid DNA containing cDNA ATCC Deposit No:Z (e.g., as set 
forth in columns 2 and 3 of Table 1A and/or as set forth, for example, in Table IB, 6, and 7). 
The nucleotide sequence of each deposited clone can readily be determined by sequencing the 
deposited clone in accordance with known methods. Further, techniques known in the art can be 
used to verify the nucleotide sequences of SEQ ID NO;X. The predicted amino acid sequence 
can then be verified from such deposits. Moreover, the amino acid sequence of the protein 
encoded by a particular clone can also be directly determined by peptide sequencing or by 
expressing the protein in a suitable host cell containing the deposited human cDNA, collecting the 
protein, and determining its sequence. 
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(Q9WVT0) SEVEN TRANSMEMBRANE RECEPTOR, 


(Q9BQM3) DJ842G6.1.1 (NOVEL PROTEIN) 
(FRAGMENT). 


(Q9QZH5) PUTATIVE 
PHOSPHATE/PHOSPHOENOLPYRUVATE 

TRANSLOCATOR. 


(09HBN2) HYPOTHETICAL 15.8 KDA PROTEIN. 
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(Q9NYC6) NEURONAL SPECIFIC TRANSCRIPTION 
FACTOR D ATI. 


membrane glycoprotein M6 - mouse 


(AAK55521) PRO0764. 
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WEAKLY SIMILAR TO 3-0 
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(Q9DBD3) 1300017C12RIK PROTEIN. 


(Q96FV2) Unknown (protein for MAGE: 39457 15) 
(Fragment). 


(Q96FV2) Unknown (protein for IMAGE: 39457 15) 
(Fragment). 


PFAM: Peptidase family M20/M25/M40 


(Q96KN2) Glutamate carboxypeptidase-like protein 2. 


(Q9BWV3) PROTEIN KINASE NYD-SP15. 


(Q9UM21) UDP-GLCNAC.A-U-D-MANNOSIDE B-1,4- 
N-ACETYLGLUCOSAMINYLTRANS 


hypothetical protein DKFZp586J0619.1 - human (fragment) 


(Q9H5F8) CDNA: FLJ23476 FIS, CLONE HSI14935. 


(Q96LB9) Peptidoglycan recognition protein-I-alpha 
precursor. 


(Q96LB9) Peptidoglycan recognition protein-I-alpha 
precursor. 


PFAM: Kazal-type serine protease inhibitor domain 


(POlOOl) ACROS1N INHIBITORS HA AND IBS (BUSI-H). 


(Q96BH2) Hypothetical 34.4 kDa protein. 


(Q9NVA4) CDNA FLJ 10846 FIS, CLONE NT2RP400 1373. 
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(AAL32175) Chromosome 17 open reading frame 26. 


(Q9H651) CDNA: FLJ22604 FIS, CLONE HSI04630 (BBP- 
LIKE PROTEIN 2). 


(09H743) CDNA: FU21394 FIS, CLONE COL03536. 
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(Q96F65) Similar to RIKEN cDNA 0610031J06 gene 
(Fragment). 


(Q96F65) Similar to RIKEN cDNA 0610031J06 gene 
(Fragment)'. 


(Q96AA3) Putative endoplasmic reticulum multispan 
transmembrane prote 


PFAM: 7 transmembrane receDtor frhodoDsin family) 
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(Q9H1Y3) DJ317G22.2 (ENCEPHALOPSIN) (PANOPSIN). 


PFAM: 7 transmembrane receptor (rhodopsin family) 


(Q9H1Y3) DJ317G22.2 (ENCEPHALOPSIN) (PANOPSIN). 


(Q9H1S5) BA1 10H4.2 (SIMILAR TO MEMBRANE 
PROTEIN). 


(015232) MATRILIN-3 PRECURSOR. 


PFAM: von Willebrand factor tvt>e A domain 




(015232) MATRILIN-3 PRECURSOR. 
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PFAM: von Willebrand factor type A domain 


PFAM: Papain family cysteine protease 


(BAB22302) Adult male kidney cDNA, RIKEN full-lengt 


(CAC37795) H-l(3)mbt-like protein. 


PFAM: SET domain 


(Q9D8Y9) 18 10018L05RIK PROTEIN. 


(Q96NR6) CDNA FLJ30278 fis, clone BRACE2002755. 
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(Q9CQS3) 1110018M03RIK PROTEIN. 


(060448) NEURONAL THREAD PROTEIN AD7C-NTP. 


(Q9NP77) CDNA FU 10947 FIS, CLONE PLACE1000066, 
WEAKLY SIMILAR TO SSU 
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Q9H728 


Q9NX85 


PF00047 


Q9Y279 


(Q9NP77) CDNA FLJ 10947 FIS, CLONE PLACE1000066, 
WEAKLY SIMILAR TO SSU 


(AAH08720) Unknown (protein for MGG8447). 


(Q91XD7) Unknown (protein for MGC: 18896). 


1 (Q9HA75) CDNA FLJ12122 FIS, CLONE 
MAMMA1000129. 


(Q9HA75) CDNA ELJ12122 FIS, CLONE 
MAMMA1000129. 


(AAH08084) Hypothetical 50.4 kDa protein. 


(Q9Y5X6) BLOOD PLASMA GLUTAMATE 
CARBOXYPEPTIDASE PRECURSOR (EC 3.4.17 


(Q9Y5X6) BLOOD PLASMA GLUTAMATE 
CARBOXYPEPTIDASE PRECURSOR (EC 3.4.17 


AMINOPEPTIDASE. 


PFAM: Immunoglobulin domain 


(AAL58111)FREB. 


(Q96ES0) Unknown (protein for MGC: 16944). 


(Q9H728) CDNA: FLJ21463 FIS, CLONE COL04765. 


(Q9NX85) CDNA FLJ20378 HS, CLONE KAIA0536. 


PFAM: Immunoglobulin domain 


(Q9Y279) Z39IG PROTEIN PRECURSOR. 
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dbj|BAB397 
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Q9H728 
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18.1[ 
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AAG49022 
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Q9H728 


(Q9BYJ0) KSP37. 


(AB021123) Ksp37 [Homo sapiens] 


(060245) PCDH7 (BH-PCDH)A. 


(Q9H728) CDNA: FLJ21463 HS, CLONE COL04765. 


(Q9CRM1) 2610001E17RIK PROTEIN (FRAGMENT). 


(AK020169) putative [Mxis muscuius] 


(Q9CZY7) 2610307O08RIK PROTEIN. 


(Q9P195) PR01722. 


(Q95LL0) Hypothetical 11.3 kDa protein. 


cytoplasmic linker protein CLDM 15 - rat 


(Q9D4I2) 4932408F18RIK PROTEIN. 


PFAM: Immunoglobulin domain 


(AAG49022) Junctional adhesion molecule 2. 


(AAH20029) Hypothetical 39.4 kDa protein. 


.(Q9JB3) EPCS26 (PLACl) (PLACENTAL SPECIFIC 
■PROTEIN 1). 


(Q9H728) CDNA: FLJ21463 FIS; CLONE COL04765. 
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RACE Protocol For Recovery of Full-Length Genes 

Partial cDNA clones can be made full-length by utilizing the rapid amplification of cDNA 
ends (RACE) procedure described in Frohman, M.A., et aL, Proc. Natl. Acad. Sci. USA, 85:8998- 
9002 (1988). A cDNA clone missing either the 5' or 3' end can be reconstructed to include the 
5 absent base pairs extending to the translational start or stop codon, respectively. In some cases, 
cDNAs are missing the start codon of translation, therefor. The following briefly describes a 
modification of this original 5' RACE procedure. Poly A+ or total RNA is reverse transcribed with 
Superscript II (Gibco/BRL) and an antisense or complementary primer specific to the cDNA 
sequence. The primer is removed from the reaction with a Microcon Concentrator (Amicon). The 

10 first-strand cDNA is then tailed with dATP and terminal deoxynucleotide transferase 
(Gibco/BRL). Thus, an anchor sequence is produced which is needed for PCR amplification. The 
second strand is synthesized from the dA-tail in PCR buffer, Taq DNA polymerase (Peririn-Elrner 
Cetus), an oligo-dT primer containing three adjacent restriction sites (Xhol, Sail and Clal) at the 5' 
end and a primer containing just these restriction sites. This double-stranded cDNA is PCR 

15 amplified for 40 cycles with the same primers as well as a nested cDNA-specific antisense primer. 
The PCR products are size-separated on an ethidium bromide-agarose gel and the region of gel 
containing cDNA products the predicted size of missing protein-coding DNA is removed. cDNA 
is purified from the agarose with the Magic PCR Prep kit (Promega), restriction digested with 
Xhol or Sail, and ligated to a plasmid such as pBluescript SKII (Stratagene) at Xhol and EcoRV 

20 sites. This DNA is transformed into bacteria and the plasmid clones sequenced to identify the 
correct protein-coding inserts. Correct 5' ends are confirmed by comparing this sequence with the 
putatively identified homologue and overlap with the partial cDNA clone. Similar methods known 
in the art and/or commercial kits are used to amplify and recover 3' ends. 

Several quality-controlled kits are commercially available for purchase. Similar reagents 

25 and methods to those above are supplied in kit form from Gibco/BRL for both 5' and 3' RACE for 
recovery of full length genes. A second kit is available from Clontech which is a modification of a 
related technique, SLIC (single-stranded ligation to single-stranded cDNA), developed by Dumas 
et aL, Nucleic Acids Res., 19:5227-32 (1991). The major differences in procedure are that the 
RNA is alkaline hydrolyzed after reverse transcription and RNA ligase is used to join a restriction 

30 site-containing anchor primer to the first-strand cDNA. This obviates the necessity for the dA- 
tailing reaction which results in a polyT stretch that is difficult to sequence past 

An alternative to generating 5' or 3' cDNA from RNA is to use cDNA library double- 
stranded DNA. An asymmetric PCR-amplified antisense cDNA strand is synthesized with an 
antisense cDNA-specific primer and a plasmid-anchored primer. These primers are removed and a 

35 symmetric PCR reaction is performed with a nested cDNA-specific antisense primer and the 
plasmid-anchored primer. 
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RNA Ligase Protocol For Generating The 5' or 3' End Sequences To Obtain Full 
Length Genes 

Once a gene of interest is identified, several methods are available for the identification of 
the 5' or 3' portions of the gene which may not be present in the original cDNA plasmid. These 
5 methods include, but are not limited to, filter probing, clone enrichment using specific probes and 
protocols similar and identical to 5' and 3* RACE. While the full length gene may be present in the 
library and can be identified by probing, a useful method for generating the 5' or 3' end is to use 
the existing sequence information from the original cDNA to generate the missing information. A 
method similar to 5' RACE is available for generating the missing 5' end of a desired full-length 

10 gene. (This method was published by Fromont-Racine et al., Nucleic Acids Res., 21(7): 1683-1684 
(1993)). Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a population of RNA 
presumably containing full-length gene RNA transcript and a primer set containing a primer 
specific to the ligated RNA oligonucleotide and a primer specific to a known sequence of the gene 
of interest, is used to PCR amplify the 5' portion of the desired full length gene which may then be 

15 sequenced and used to generate the full length gene. This method starts with total RNA isolated 
from the desired source, poly A RNA may be used but is not a prerequisite for this procedure. The 
RNA preparation may then be treated with phosphatase if necessary to eliminate 5* phosphate 
groups on degraded or damaged RNA which may interfere with the later RNA ligase step. The 
phosphatase if used is then inactivated and the RNA is treated with tobacco acid pyrophosphatase 

20 in order to remove the cap structure present at the 5' ends of messenger RNAs. This reaction 
leaves a 5' phosphate group at the 5' end of the cap cleaved RNA which can then be ligated to an 
RNA oligonucleotide using T4 RNA ligase. This modified RNA preparation can then be used as a 
template for first strand cDNA synthesis using a gene specific oligonucleotide. The first strand 
synthesis reaction can then be used as a template for PCR amplification of the desired 5' end using 

25 a primer specific to the ligated RNA oligonucleotide and a primer specific to the known sequence 
of the gene of interest. The resultant product is then sequenced and analyzed to confirm that the 5' 
end sequence belongs to the relevant gene. 

The present invention also relates to vectors or plasmids which include such DNA 
sequences, as well as the use of the DNA sequences. The material deposited with the ATCC (e.g., 

30 as described in columns 2 and 3 of Table 1 A, and/or as set forth in Table IB, Table 6, or Table 7) 
is a mixture of cDNA clones derived from a variety of human tissue and cloned in either a plasmid 
vector or a phage vector, as described, for example, in Table 1 A and Table 7. These deposits are 
referred to as "the deposits" herein. The tissues from which some of the clones were derived are 
listed in Table 7, and the vector in which the corresponding cDNA is contained is also indicated in 

35 Table 7. The deposited material includes cDNA clones corresponding to SEQ ID NO:X described, 
for example, in Table 1A and/or Table IB (ATCC Deposit No:Z). A clone which is isolatable 
from the ATCC Deposits by use of a sequence listed as SEQ ID NO:X, may include the entire 
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coding region of a human gene or in other cases such clone may include a substantial portion of 
the coding region of a human gene. Furthermore, although the sequence listing may in some 
instances list only a portion of the DNA sequence in a clone included in the ATCC Deposits, it is 
well within the ability of one skilled in the art to sequence the DNA included in a clone contained 
5 in the ATCC Deposits by use of a sequence (or portion thereof) described in, for example Tables 
1 A and/or Table IB or Table 2, by procedures hereinafter further described, and others apparent to 
those skilled in the art. 

Also provided in Table 1 A and Table 7 is the name of the vector which contains the cDNA 
clone. Each vector is routinely used in the art. The following additional information is provided 

10 for convenience. 

Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap XR (U.S. 
Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 5,128,256 and 5,286,636), 
pBluescript (pBS) (Short, J. M. et al., Nucleic Acids Res. 76:7583-7600 (1988); Alting-Mees, M. 
A. and Short, J. M., Nucleic Acids Res. 1 7:9494 (1989)) and pBK (Alting-Mees, M. A. et al., 

15 Strategies 5:58-61 (1992)) are commercially available from Stratagene Cloning Systems, Inc., 
1101 1 N. Torrey Pines Road, La Jolla, CA, 92037. pBS contains an ampicillin resistance gene and 
pBK contains a neomycin resistance gene. Phagemid pBS may be excised from the Lambda Zap 
and Uni-Zap XR vectors, and phagemid pBK may be excised from the Zap Express vector. Both 
phagerrrids may be transformed into E. coli strain XL-1 Blue, also available from Stratagene. 

20 Vectors pSportl, pCMVSport 1.0, pCMVSport 2.0 and pCMVSport 3.0, were obtained 

from Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors contain 
an ampicillin resistance gene and may be transformed into K coli strain DH10B, also available 
from Life Technologies. See, for instance, Gruber, C. E., et al., Focus 15:59- (1993). Vector 
lafrnid BA (Bento Soares, Columbia University, New York, NY) contains an ampicillin resistance 

25 gene and can be transformed into E. coli strain XL-1 Blue. Vector pCR 2.1, which is available 
from Invitrogen, 1600 Faraday Avenue, Carlsbad, CA 92008, contains an ampicillin resistance 
gene and may be transformed into E. coli strain DH10B, available from Life Technologies. See, 
for instance, Clark, J. M., Nuc. Acids Res. /6:9677-9686 (1988) and Mead, D. et al, 
Bio/Technology 9: (1991). 

30 The present invention also relates to the genes corresponding to SEQ ID NO:X, SEQ ID 

NO: Y, and/or the deposited clone (ATCC Deposit No:Z). The corresponding gene can be isolated 
in accordance with known methods using the sequence information disclosed herein. Such 
methods include preparing probes or primers from the disclosed sequence and identifying or 
amplifying the corresponding gene from appropriate sources of genomic material. 

35 Also provided in the present invention are allelic variants, . orthologs, and/or species 

homologs. Procedures known in the art can be used to obtain full-length genes, allelic variants, 
splice variants, full-length coding portions, orthologs, and/or species homologs of genes 
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corresponding to SEQ ID NO:X or the complement thereof, polypeptides encoded by genes 
corresponding to SEQ ID NO:X or the complement thereof, and/or the cDNA contained in ATCC 
Deposit No:Z, using information from the sequences disclosed herein or the clones deposited with 
the ATCC. For example, allelic variants and/or species homologs may be isolated and identified by 
5 making suitable probes or primers from the sequences provided herein and screening a suitable 
nucleic acid source for allelic variants and/or the desired homologue. 

The polypeptides of the invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a combination of 

10 these methods. Means for preparing such polypeptides are well understood in the art. 

The polypeptides may be in the form of the secreted protein, including the mature form, or 
may be a part of a larger protein, such as a fusion protein (see below). It is often advantageous to 
include an additional amino acid sequence which contains secretory or leader sequences, pro- 
sequences, sequences which aid in purification, such as multiple histidine residues, or an additional 

1 5 sequence for stability during recombinant production. 

The polypeptides of the present invention are preferably provided in an isolated form, and 
preferably are substantially purified. A recombinantly produced version of a polypeptide, 
including the secreted polypeptide, can be substantially purified using techniques described herein 
or od*™* ttaown in the art, S0C h as, for ex^ta, by Ore one^p mathod dascribed ir. Saath 

20 and Johnson, Gene 67:31-40 (1988). Polypeptides of the invention also can be purified from 
natural, synthetic or recombinant sources using techniques described herein or otherwise known in 
the art, such as, for example, antibodies of the invention raised against the polypeptides of the 
present invention in methods which are well known in the art 

The present invention provides a polynucleotide comprising, or alternatively consisting of, 

25 the nucleic acid sequence of SEQ ID NO:X, and/or the cDNA sequence contained in ATCC 
Deposit No:Z. The present invention also provides a polypeptide comprising, or alternatively, 
consisting of, the polypeptide sequence of SEQ ID NO:Y, a polypeptide encoded by SEQ ID 
NO:X or a complement thereof, a polypeptide encoded by the cDNA contained in ATCC Deposit 
No:Z, and/or the polypeptide sequence encoded by a nucleotide sequence in SEQ ID NO:B as 

30 defined in column 6 of Table 1C. Polynucleotides encoding a polypeptide comprising, or 
alternatively consisting of the polypeptide sequence of SEQ ID NO:Y, a polypeptide encoded by 
SEQ ID NO:X, a polypeptide encoded by the cDNA contained in ATCC Deposit No:Z, and/or a 
polypeptide sequence encoded by a nucleotide sequence in SEQ ID NO:B as defined in column 6 
of Table 1C are also encompassed by the invention. The present invention further encompasses a 

35 polynucleotide comprising, or alternatively consisting of, the complement of the nucleic acid 
sequence of SEQ ID NO:X, a nucleic acid sequence encoding a polypeptide encoded by the 
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complement of the nucleic acid sequence of SEQ ID NO:X, and/or the cDNA contained in ATCC 
Deposit No:Z. 

Moreover, representative examples of polynucleotides of the invention comprise, or 
alternatively consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more of the 
sequences delineated in Table 1C column 6, or any combination thereof. Additional, 
representative examples of polynucleotides of the invention comprise, or alternatively consist of, 
one, two, three, four, five, six, seven, eight, nine, ten, or more of the complementary strand(s) of 
the sequences delineated in Table 1C column 6, or any combination thereof. In further 
embodiments, the above-described polynucleotides of the invention comprise, or alternatively 
consist of, sequences delineated in Table 1C, column 6, and have a nucleic acid sequence which is 
different from that of the BAC fragment having the sequence disclosed in SEQ ID NO:B (see 
Table 1C, column 5). In additional embodiments, the above-described polynucleotides of the 
invention comprise, or alternatively consist of, sequences delineated in Table 1C, column 6, and 
have a nucleic acid sequence which is different from that published for the BAC clone identified as 
BAC ID NO: A (see Table 1C, column 4). In additional embodiments, the above-described 
polynucleotides of the invention comprise, or alternatively consist of, sequences delineated in 
Table 1C, column 6, and have a nucleic acid sequence which is different from that contained in the 
BAC clone identified as BAC ID NO:A (see Table 1C, column 4). Polypeptides encoded by these 
polynucleotides, other polynucleotides that encode these polypeptides, and antibodies that bind 
these polypeptides are also encompassed by the invention. Additionally, fragments and variants of 
the above-described polynucleotides and polypeptides are also encompassed by the invention. 

Further, representative examples of polynucleotides of the invention comprise, or 
alternatively consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more of the 
sequences delineated in column 6 of Table 1C which correspond to the same Clone ID (see Table 
1C, column 1), or any combination thereof. Additional, representative examples of 
polynucleotides of the invention comprise, or alternatively consist of, one, two, three, four, five, 
six, seven, eight, nine, ten, or more of the complementary strand(s) of the sequences delineated in 
column 6 of Table 1C which correspond to the same Clone ED (see Table 1C, column 1), or any 
combination thereof. In further embodiments, the above-described polynucleotides of the 
invention comprise, or alternatively consist of, sequences delineated in column 6 of Table 1C 
which correspond to the same Clone ID (see Table 1C, column 1) and have a nucleic acid 
sequence which is different from that of the BAC fragment having the sequence disclosed in SEQ 
ID NO:B (see Table 1C, column 5). In additional embodiments, the above-described 
polynucleotides of the invention comprise, or alternatively consist of, sequences delineated in 
column 6 of Table 1C which correspond to the same Clone ID (see Table 1C, column 1) and have 
a nucleic acid sequence which is different from that published for the BAC clone identified as 
BAC ID NO:A (see Table 1C, column 4). In additional embodiments, the above-described 
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polynucleotides of the invention comprise, ox alternatively consist of, sequences delineated in 
column 6 of Table 1C which correspond to the same Clone ID (see Table 1C, column 1) and have 
a nucleic acid sequence which is different from that contained in the B AC clone identified as B AC 
ID NO:A (see Table 1C, column 4). Polypeptides encoded by these polynucleotides, other 
polynucleotides that encode these polypeptides, and antibodies that bind these polypeptides are 
also encompassed by the invention. Additionally, fragments and variants of the above-described 
polynucleotides and polypeptides are also encompassed by the invention. 

Further, representative examples of polynucleotides of the invention comprise, or 
alternatively consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more of the 
sequences delineated in column 6 of Table 1C which correspond to the same contig sequence 
identifier SEQ ID NO:X (see Table 1C, column 2), or any combination thereof. Additional, 
representative examples of polynucleotides of the invention comprise, or alternatively consist of, 
one, two, three, four, five, six, seven, eight, nine, ten, or more of the complementary strand(s) of 
the sequences delineated in column 6 of Table 1C which correspond to the same contig sequence 
identifier SEQ ID NO:X (see Table 1C, column 2), or any combination thereof. In further 
embodiments, the above-described polynucleotides of the invention comprise, or alternatively 
consist of, sequences delineated in column 6 of Table 1C which correspond to the same contig 
sequence identifier SEQ ID NO:X (see Table 1C, column 2) and have a nucleic acid sequence 
which is different from that of the BAC fragment having the sequence disclosed in SEQ ID NO:B 
(see Table 1C, column 5). In additional embodiments, the above-described polynucleotides of the 
invention comprise, or alternatively consist of, sequences delineated in column 6 of Table 1C 
which correspond to the same contig sequence identifier SEQ ID NO:X (see Table 1C, column 2) 
and have a nucleic acid sequence which is different from that published for the BAC clone 
identified as BAC ID NO:A (see Table 1C, column 4). In additional embodiments, the above- 
described polynucleotides of the invention comprise, or alternatively consist of, sequences 
delineated in column 6 of Table 1C which correspond to the same contig sequence identifier SEQ 
ID NO:X (see Table 1C, column 2) and have a nucleic acid sequence which is different from that 
contained in the BAC clone identified as BAC ID NO:A (See Table 1C, column 4). Polypeptides 
encoded by these polynucleotides, other polynucleotides that encode these polypeptides, and 
antibodies that bind these polypeptides are also encompassed by the invention. Additionally, 
fragments and variants of the above-described polynucleotides and polypeptides are also 
encompassed by the invention. 

Moreover, representative examples of polynucleotides of the invention comprise, or 
alternatively consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more of the 
sequences delineated in the same row of Table 1C column 6, or any combination thereof. 
Additional, representative examples of polynucleotides of the invention comprise, or alternatively 
consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more of the complementary 
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strand(s) of the sequences delineated in the same row of Table 1C column 6, or any combination 
thereof. In preferred embodiments, the polynucleotides of the invention comprise, ox alternatively 
consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more of the complementary 
strand(s) of the sequences delineated in the same row of Table 1C column 6, wherein sequentially 
5 delineated sequences in the table (i.e. corresponding to those exons located closest to each other) 
are directly contiguous in a 5* to 3* orientation. In further embodiments, above-described 
polynucleotides of the invention comprise, or alternatively consist of, sequences delineated in the 
same row of Table 1C, column 6, and have a nucleic acid sequence which is different from that of 
the BAC fragment having the sequence disclosed in SEQ ID NO:B (see Table 1C, column 5). In 

10 additional embodiments, the above-described polynucleotides of the invention comprise, or • 
alternatively consist of, sequences delineated in the same row of Table 1C, column 6, and have a 
nucleic acid sequence which is different from that published for the BAC clone identified as BAC 
ID NO:A (see Table 1C, column 4). In additional embodiments, the above-described 
polynucleotides of the invention comprise, or alternatively consist of, sequences delineated in the 

15 same row of Table 1C, column 6, and have a nucleic acid sequence which is different from that 
contained in the BAC clone identified as BAC ID NO:A (see Table 1C, column 4). Polypeptides 
encoded by these polynucleotides, other polynucleotides that encode these polypeptides, and 
antibodies that bind these polypeptides are also encompassed by the invention. 

In additional specific embodiments, polynucleotides of the invention comprise, or 

20 alternatively consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more of the 
sequences delineated in column 6 of Table 1C, and the polynucleotide sequence of SEQ ID NO:X 
(e.g., as defined in Table 1C, column 2) or fragments or variants thereof. Polypeptides encoded by 
these polynucleotides, other polynucleotides that encode these polypeptides, and antibodies that 
bind these polypeptides are also encompassed by the invention. 

25 In additional specific embodiments, polynucleotides of the invention comprise, or 

alternatively consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more of the 
sequences delineated in column 6 of Table 1C which correspond to the same Clone ID (see Table 
1C, column 1), and the polynucleotide sequence of SEQ ID NO.X (e.g., as defined in Table 1A, 
Table IB, or Table 1Q or fragments or variants thereof. In preferred embodiments, the delineated 

30 sequence(s) and polynucleotide sequence of SEQ ID NO:X correspond to the same Clone ID. 
Polypeptides encoded by these polynucleotides, other polynucleotides that encode these 
polypeptides, and antibodies that bind these polypeptides are also encompassed by the invention. 

In further specific embodiments, polynucleotides of the invention comprise, or 
alternatively consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more of the 

35 sequences delineated in the same row of column 6 of Table 1C, and the polynucleotide sequence 
of SEQ ID NO:X (e.g., as defined in Table 1A, Table IB, or Table 1Q or fragments or variants 
thereof. In preferred embodiments, the delineated sequence(s) and polynucleotide sequence of 
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SEQ ID NO:X correspond to the same row of column 6 of Table 1C. Polypeptides encoded by 
these polynucleotides, other polynucleotides that encode these polypeptides, and antibodies that 
bind these polypeptides are also encompassed by the invention. 

In additional specific embodiments, polynucleotides of the invention comprise, or 
5 alternatively consist of a polynucleotide sequence in which the 3' 10 polynucleotides of one of the 
sequences delineated in column 6 of Table 1C and the 5' 10 polynucleotides of the sequence of 
SEQ ID NO:X are directly contiguous. Nucleic acids which hybridize to the complement of these 
20 contiguous polynucleotides under stringent hybridization conditions or alternatively, under 
lower stringency conditions, are also encompassed by the invention. Polypeptides encoded by 

10 these polynucleotides and/or nucleic acids, other polynucleotides and/or nucleic acids that encode 
these polypeptides, and antibodies that bind these polypeptides are also encompassed by the 
invention. Additionally, fragments and variants of the above-described polynucleotides, nucleic 
acids, and polypeptides are also encompassed by the invention. 

In additional specific embodiments, polynucleotides of the invention comprise, or 

15 alternatively consist of, a polynucleotide sequence in which the 3' 10 polynucleotides of one of the 
sequences delineated in column 6 of Table 1C and the 5' 10 polynucleotides of a fragment or 
variant of the sequence of SEQ ID NO:X are directly contiguous Nucleic acids which hybridize to 
the complement of these 20 contiguous polynucleotides under stringent hybridization conditions or 
alternatively, under lower stringency conditions, are also encompassed by the invention. 

20 Polypeptides encoded by these polynucleotides and/or nucleic acids, other polynucleotides and/or 
nucleic acids encoding these polypeptides, and antibodies that bind these polypeptides are also 
encompassed by the invention. Additionally, fragments and variants of the above-described 
polynucleotides, nucleic acids, and polypeptides are also encompassed by the invention. 

In specific embodiments, polynucleotides of the invention comprise, or alternatively 

25 consist of, a polynucleotide sequence in which the 3' 10 polynucleotides of the sequence of SEQ 
ID NO:X and the 5* 10 polynucleotides of the sequence of one of the sequences delineated in 
column 6 of Table 1C are directly contiguous. Nucleic acids which hybridize to the complement 
of these 20 contiguous polynucleotides under stringent hybridization conditions or alternatively, 
under lower stringency conditions, are also encompassed by the invention. Polypeptides encoded 

30 by these polynucleotides and/or nucleic acids, other polynucleotides and/or nucleic acids encoding 
these polypeptides, and antibodies that bind these polypeptides are also encompassed by the 
invention. Additionally, fragments and variants of the above-described polynucleotides, nucleic 
acids, and polypeptides are also encompassed by the invention. 

In specific embodiments, polynucleotides of the invention comprise, or alternatively 

35 consist of, a polynucleotide sequence in which the 3' 10 polynucleotides of a fragment or variant 
of the sequence of SEQ ED NO:X and the 5* 10 polynucleotides of the sequence of one of the 
sequences delineated in column 6 of Table 1C are directly contiguous. Nucleic acids which 
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hybridize to the complement of these 20 contiguous polynucleotides under stringent hybridization 
conditions or alternatively, under lower stringency conditions, are also encompassed by the 
invention. Polypeptides encoded by these polynucleotides and/or nucleic acids, other 
polynucleotides and/or nucleic acids encoding these polypeptides, and antibodies that bind these 
5 polypeptides are also encompassed by the invention. Additionally, fragments and variants of the 
above-described polynucleotides, nucleic acids, and polypeptides, are also encompassed by the 
invention. 

In further specific embodiments, polynucleotides of the invention comprise, or 
alternatively consist of, a polynucleotide sequence in which the 3' 10 polynucleotides of one of the 

10 sequences delineated in column 6 of Table 1C and the 5* 10 polynucleotides of another sequence 
in column 6 are directly contiguous. Nucleic acids which hybridize to the complement of these 20 
contiguous polynucleotides under stringent hybridization conditions or alternatively, under lower 
stringency conditions, are also encompassed by the invention. Polypeptides encoded by these 
polynucleotides and/or nucleic acids, other polynucleotides and/or nucleic acids encoding these 

15 polypeptides, and antibodies that bind these polypeptides are also encompassed by the invention. 
Additionally, fragments and variants of the above-described polynucleotides, nucleic acids, and 
polypeptides are also encompassed by the invention. 

In specific embodiments, polynucleotides of the invention comprise, or alternatively 
consist of, a polynucleotide sequence in which the 3' 10 polynucleotides of one of the sequences 

20 delineated in column 6 of Table 1C and the 5' 10 polynucleotides of another sequence in column 6 
corresponding to the same Clone ID (see Table 1C, column 1) are directly contiguous. Nucleic 
acids which hybridize to the complement of these 20 lower stringency conditions, are also 
encompassed by the invention. Polypeptides encoded by these polynucleotides and/or nucleic 
acids, other polynucleotides and/or nucleic acids encoding these polypeptides, and antibodies that 

25 bind these polypeptides are also encompassed by the invention. Additionally, fragments and 
variants of the above-described polynucleotides, nucleic acids, and polypeptides are also 
encompassed by the invention. 

In specific embodiments, polynucleotides of the invention comprise, or alternatively 
consist of, a polynucleotide sequence in which the 3* 10 polynucleotides of one sequence in 

30 column 6 corresponding to the same contig sequence identifer SEQ ID NO:X (see Table 1C, 
column 2) are directly contiguous. Nucleic acids which hybridize to the complement of these 20 
contiguous polynucleotides under stringent hybridization conditions or alternatively, under lower 
stringency conditions, are also encompassed by the invention. Polypeptides encoded by these 
polynucleotides and/or nucleic acids, other polynucleotides and/or nucleic acids encoding these 

35 polypeptides, and antibodies that bind these polypeptides are also encompassed by the invention. 
Additionally, fragments and variants of the above-described polynucleotides, nucleic acids, and 
polypeptides are also encompassed by the invention. 
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In specific embodiments, polynucleotides of the invention comprise, or alternatively 
consist of a polynucleotide sequence in which the 3' 10 polynucleotides of one of the sequences 
delineated in column 6 of Table 1C and the 5' 10 polynucleotides of another sequence in column 6 
corresponding to the same row are directly contiguous. In preferred embodiments, the 3' 10 
5 polynucleotides of one of the sequences delineated in column 6 of Table 1C is directly contiguous 
with the 5' 10 polynucleotides of the next sequential exon delineated in Table 1C, column 6. 
Nucleic acids which hybridize to the complement of these 20 contiguous polynucleotides under 
stringent hybridization conditions or alternatively, under lower stringency conditions, are also 
encompassed by the invention. Polypeptides encoded by these polynucleotides and/or nucleic 
10 acids, other polynucleotides and/or nucleic acids encoding these polypeptides, and antibodies that 
bind these polypeptides are also encompassed by the invention. Additionally, fragments and 
variants of the above-described polynucleotides, nucleic acids, and polypeptides are also 
encompassed by the invention. 
Table 3 

15 Many polynucleotide sequences, such as EST sequences, are publicly available and 

accessible through sequence databases and may have been publicly available prior to conception of 
the present invention. Preferably, such related polynucleotides are specifically excluded from the 
scope of the present invention. Accordingly, for each contig sequence (SEQ ID NO:X) listed in the 
fifth column of Table 1A and/or the fourth column of Table IB, preferably excluded are one or 

20 - more polynucleotides comprising a nucleotide sequence described by the general formula of a-b, 
where a is any integer between 1 and the final nucleotide minus 15 of SEQ ID NO:X, b is an 
integer of 15 to the final nucleotide of SEQ ID NO:X, where both a and b correspond to the 

■ 

positions of nucleotide residues shown in SEQ 3D NO:X, and where b is greater than or equal to a 
+ 14. More specifically, preferably excluded are one or more polynucleotides comprising a 

25 nucleotide sequence described by the general formula of a-b, where a and b are integers as defined 
in columns 4 and 5, respectively, of Table 3. In specific embodiments, the polynucleotides of the 
invention do not consist of at least one, two, three, four, five, ten, or more of the specific 
polynucleotide sequences referenced by the Genbank Accession No. as disclosed in column 6 of 
Table 3 (including for example, published sequence in connection with a particular BAC clone). In 

30 further embodiments, preferably excluded from the invention are the specific polynucleotide 
sequence(s) contained in the clones corresponding to at least one, two, three, four, five, ten, or 
more of the available material having the accession numbers identified in the sixth column of this 
Table (including for example, the actual sequence contained in an identified BAC clone). In no 
way is this listing meant to encompass all of the sequences which may be excluded by the general 

35 formula, it is just a representative example. All references available through these accessions are 
hereby incorporated by reference in their entirety. 
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Description of Table 4 

Table 4 provides a key to the tissue/cell source identifier code disclosed in Table 1B.2, 
column 5. Column 1 of Table 4 provides the tissue/cell source identifier code disclosed in Table 
1B.2, Column 5. Columns 2-5 provide a description of the tissue or cell source. Note that 
"Description" and "Tissue" sources (i.e. columns 2 and 3) having the prefix "a_" indicates organs, 
tissues, or cells derived from "adult" sources. Codes corresponding to diseased tissues are 
indicated in column 6 with the word "disease." The use of the word "disease" in column 6 is non- 
limiting. The tissue or cell source may be specific (e.g. a neoplasm), or may be disease-associated 
(e.g., a tissue sample from a normal portion of a diseased organ). Furthermore, tissues and/or cells 
lacking the "disease" designation may still be derived from sources directly or indirectly involved 
in a disease state or disorder, and therefore may have a further utility in that disease state or * 
disorder. In numerous cases where the tissue/cell source is a library, column 7 identifies the vector 
use d to generate the library. 
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Disease 
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Cell Line 




























































Organ 
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Tissue 




CD 


a_Liver 


' a_mammary gland 


CO 

O 

J 


a_small intestine 


a_Stomach 


Blood B cells 


Blood B cells activated 


Blood B cells resting 


Blood T cells activated 


Blood T cells resting 


brain 


breast 


breast cancer 


Cell Line CAOV3 


cell line PA-1 


cell line transformed 


colon 


colon (9808co65R) 


colon (9809col5) 


colon cancer 


colon cancer (9808co64R) 


colon cancer 9809col4 


Donor II B Cells 24hrs 


Donor II B Cells 72hrs 


Donor II B -Cells 24 hrs. 


Donor II B-Cells 72hrs 


Donor II Resting B Cells 


Description 


s 


> 

1 


CS 

Tab 

3 


CD 

■4—1 

_cd 

to 
O 


a_small intestine 


o 

cd 

B 
o 


Blood B cells 


Blood B cells activated 


Blood B cells resting 


Blood T cells activated 


Blood T cells resting 


brain 


breast 


breast cancer 


Cell Line CAOV3 


cell line PA-1 


cell line transformed 


colon 


colon (9808co65R) 


colon (9809co 15) 


colon cancer 


colon cancer (9808co64R) 


colon cancer 9809col4 


Donor II B Cells 24hrs 


Donor II B Cells 72hrs 


Donor IIB -Cells 24 hrs. 


Donor HB-Cells 72hrs 


Donor II Resting B Cells 


Code 


AR022 


AR023 


AR024 


AR025 


AR026 


AR027 


AR028 


AR029 


AR030 , 


AR031 


AR032 


AR033 


AR034 


AR035 


AR036 


AR037 


AR038 


AR039 


AR040 


AR041 


AR042 


AR043 


AR044 


AR050 


AR051 


AR052 


AR053 


AR054 
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Heart 


Human Lung (clonetech) 


Human Mammarv fclontech) 


> 


Human Thymus (clonetech) 


Jurkat (unstimulated) 


Kidney 


Liver 


Liver (Clontech) 


Lymphocytes chronic 
lymphocytic leukaemia 


Lymphocytes diffuse large B 
cell lymphoma 


Lymphocytes follicular 
lymphoma 


normal breast 


Normal Ovarian (4004901) 


Normal Ovary 9508G045 


Normal Ovary 9701G208 


Normal Ovary 9806G005 


Ovarian Cancer 


Ovarian Cancer (9702G001) 


Ovarian Cancer (9707G029) 


Ovarian Cancer (9804G01 1) 


Ovarian Cancer (9806G019) 


Ovarian Cancer (9807G017) 


Heart 


Human Lung (clonetech) 


Human Mammarv 


(clontech) 


Human Thymus 
(clonetech) 


Jurkat (unstimulated) 


Kidney 


Liver 


Liver (Clontech) ~] 


Lymphocytes chronic 
lymphocytic leukaemia 


Lymphocytes diffuse 
large B cell lymphoma 


Lymphocytes follicular 
lymphoma 


normal breast 


Normal Ovarian 
(4004901) 


Normal Ovary 9508G045 


Normal Ovary 9701G208 


Normal Ovary 9806G005 


Ovarian Cancer 


Ovarian Cancer 
(9702G001) 


Ovarian Cancer 
.(9707G029) 


Ovarian Cancer 
(9804G011) 


Ovarian Cancer 
(9806GO19) 


Ovarian Cancer 
(9807G017) 


AR055 


AR056 


AR057 


AR058 


AR059 


AR060 


AR061 


AR062 


AR063 


AR064 


AR065 


AR066 


AR067 


AR068 


AR069 


AR070 


AR071 


<N 

o 

% 


AR073 


AR074 


AR075 


AR076 
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Ovarian Cancer (9809G001) 


ovarian cancer 15799 


Ovarian Cancer 177 17 AID 


Ovarian Cancer 4004664B1 


Ovarian Cancer 40053 15A1 


ovarian cancer 94127303 


Ovarian Cancer 96069304 


Ovarian Cancer 9707G029 


Ovarian Cancer 9807G045 


ovarian cancer 9809G001 


Ovarian Cancer 9905C032RC 


Ovarian cancer 9907 COO 3rd 


Prostate 


Prostate (clonetech) 


prostate cancer 


prostate cancer #15176 


prostate cancer #15509 


prostate cancer #15673 


Small Intestine (Clontech) 


Spleen 


Thymus T cells activated 


Thymus T cells resting 


Tonsil 


Tonsil geminal center 
centroblast 


Ovarian Cancer 
(9809G001) 


ovarian cancer 15799 


Ovarian Cancer 
177 17 AID 


Ovarian Cancer 
4004664B1 


Ovarian Cancer 
4005315A1 


ovarian cancer 94127303 


Ovaiian Cancer 96069304 


Ovarian Cancer 
9707G029 


Ovarian Cancer 
9807G045 


ovarian cancer 9809G001 


Ovarian Cancer 
9905C032RC 


Ovarian cancer 9907 COO 
3rd 


Prostate 


Prostate (clonetech) 


prostate cancer 


prostate cancer #15176 


prostate cancer #15509 


prostate cancer #15673 


Small Intestine (Clontech) 


Spleen 


Thymus T cells activated 


Thymus T cells resting 


Tonsil 


Tonsil geminal center 
centroblast 


AR077 


AR078 


AR079 


AR080 


AR081 


AR082 


AR083 


AR084 


AR085 


AR086 


AR087 


AR088 


AR089 


AR090 1 


AR091 


AR092 


AR093 


AR094 


AR095 I 


AR096 


AR097 


AR098 


AR099 | 


AR100 

1 
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Tonsil germinal center B cell 


Tonsil lymph node 


Tonsil memory B cell 


Whole Brain 


Xenograft ES-2 


Xenograft SW626 


* * 
r— 1 

8 


1 

• •> 

O 

o 


3 

* p 

r— 1 

o 
o 


002 : Monocytes untreated 
(Ihr) 


002 : Monocytes untreated 
(5hrs) 


002: Control. IC 


002: IL2.1C 


003 : Placebo-treated Rat 
Lacrimal Gland 


003 : Placebo-treated Rat 
Submandibular Gland 


004 : Monocytes untreated 
(5hrs) 


004 : Monocytes untreated Ihr 


005: Placebo (48hrs) 


006: pC4 (24hrs) 


006: pC4 (48hrs) 


007: PHA(lhr) 


007: PHA(6HRS) 


007: PMA(6hrs) 


008: 1449_#2 


1449 Sample 


Tonsil germinal center B 
cell 


Tonsil lymph node 


Tonsil memory B cell 


Whole Brain 


Xenograft ES-2 


Xenograft SW626 


001: IL-2 | 


001: IL-2. 1 


3 

1 

o 
o 


002 : Monocytes untreated 
(Ihr) 


002 : Monocytes untreated 
| (5hrs) 


002: Control. IC 


002:IL2.1C 


003 : Placebo-treated Rat 
Lacrimal Gland 


003 : Placebo-treated Rat 
Submandibular Gland 


004 : Monocytes untreated 
(5hrs) 


004 : Monocytes untreated 
Ihr 


005: Placebo (48hrs) 


006: pC4 (24hrs) 


/■ — \ 

CO 

OO 

v / 

5 

VO 

o 
o 


007: PHA(lhr) 


007: PHA(6HRS) 


007: PMA(6hrs) 


s 

Ok 

r— 1 

OO 

o 
o 


1449 Sample 


ARlOl 


AR102 


AR103 


AR104 


AR105 


AR106 


AR119 1 


AR120 


AR121 


AR124 


AR125 


AR126 


AR127 


AR130 


AR131 


AR135 


AR136 


AR139 


AR140 


AR141 


AR152 


AR153 


AR154 


AR155 


AR167 
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3T3P10 LOuM insulin 


3T3P1010nM Insulin 


3T3P10 lOuM insulin 


3T3P10 No Insulin 


3T3P4 


Adipose (41892) 


Adipose Diabetic (41611) 


Adipose Diabetic (41661) 


Adipose Diabetic (41689) 


Adipose Diabetic (41706) 


Adipose Diabetic (42352) 


Adipose Diabetic (42366) 


Adipose Diabetic (42452) 


Adipose Diabetic (42491) 


Adipose Normal (41843) 


Adipose Normal (41893) 


Adipose Normal (42452) 


Adrenal Gland 


Adrenal Gland + Whole Brain 


B7(lhr)+ (inverted) 


Breast (18275A2B) 


Breast (4004199) 


Breast (4004399) 
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Description of Table 5 

Table 5 provides a key to the OMIM reference identification numbers disclosed in Table 
5 1B.1, column 9. OMIM reference identification numbers (Column 1) were derived from Online 
Mendelian Inheritance in Man (Online Mendelian Inheritance in Man, OMIM. McKusick-Nathans 
Institute for Genetic Medicine, Johns Hopkins University (Baltimore, MD) and National Center 
for Biotechnology Information, National Library of Medicine, (Bethesda, MD) 2000. World Wide 
Web URL: http://www.ncbi.nlm.nm.go\Vomim/). Column 2 provides diseases associated with the 
10 cytologic band disclosed in Table 1B.1, column 8, as determined using the Morbid Map database. 



Table 5 



OMIM Reference 


Description 


101000 


Meningioma, NF2-related, sporadic Schwannoma, sporadic 


101000 


Neurofibromatosis, type 2 


101000 


Neurolemmomatosis 


101000 


Malignant mesothelioma, sporadic 


102200 ' 


Somatotrophinoma 


102772 


[AMP deaminase deficiency, erythrocytic] 


103600 


[Dysalbuminemic hyperthyroxinemia] 


103600 


IPysalbuminemichyperzincemia], 194470 


103600 


Analbuminemia 


103850 


Aldolase A deficiency 


104150 


[AFP deficiency, congenital] 


104150 


[Hereditary persistence of alpha-fetoprotein] 


104500 


Amelogenesis imperfecta-2, hypoplastic local type 


104770 


Amyloidosis, secondary, susceptibility to 


106100 


Angioedema, hereditary 


106210 


Peters anomaly 


106210 


Cataract, congenital, with late-onset corneal dystrophy 


106210 


Foveal hypoplasia, isolated, 136520 


106210 


Aniridia 


107271 


CD59 deficiency 


107300 


Antithrombin m deficiency 


107670 


Apolipoprotein A-II deficiency 


110700 


Vivax malaria, susceptibility to 


112261 


Fibrodysplasia ossificans progressiva 


114550 


Hepatocellular carcinoma 


114835 


Monocyte carboxyesterase deficiency 


115500 


Acatalasemia 


116800 


Cataract, Marner type 


116806 


Colorectal cancer 


116860 


Cavernous angiomatous malformations 


118485 


Polycystic ovary syndrome with hyperandrogenemia 


120070 


Alport syndrome, autosomal recessive, 203780 


120131 


Alport syndrome, autosomal recessive, 203780 


120131 


Hematuria, familial benign 


120140 


Osteoarthrosis, precocious 
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120140 


SED congenita 


120140 


SMED Strudwick type 


120140 


Stickler syndrome, type I 


120140 


Wagner syndrome, type II 


120140 


Achondrogenesis-hypochondrogenesis, type II 


120140 


Kniest dysplasia . 


120220 


Bethlem myopathy, 158810 


120240 


Bethlem myopathy, 158810 


120260 


Epiphyseal dysplasia, multiple, type 2, 600204 


120550 


Clq deficiency, type A 


120570 


Clq deficiency, type B 


120575 


Clq deficiency, type C 


121800 


Corneal dystrophy, crystalline, Schnyder 


123000 


Craniometaphyseal dysplasia 


123580 


Cataract, congenital, autosomal dominant 


123620 


Cataract, cerulean, type 2, 601547 


126060 


Anemia, megaloblastic, due to DHFR deficiency 


126090 


Hyperphenylalaninemia due to pterin-4a-carbinolamine dehydratase 
deficiency, 264070 


126337 


Myxoid liposarcoma 


126600 


Doyne honeycomb retinal dystrophy 


126600 


Drusen, radial, autosomal dominant 


129010 


Neuropathy, congenital hypomyelinating, 1 


129900 


EEC syndrome-l 


130500 


Elliptocytosis-1 


131100 


Multiple endocrine neoplasia I 


131100 


Prolactinoma, hyperparathyroidism, carcinoid syndrome 


131100 


Carcinoid tumor of lung 


131210 


Atherosclerosis, susceptibility to 


133200 


Erythrokeratodermia variabilis 


133701 


Exostoses, multiple, type 2 


133780 


Vitreoretinopathy, exudative, familial 


135940 


Ichthyosis vulgaris, 146700 


136132 


[Fish-odor syndrome], 602079 


136435 


Ovarian dysgenesis, hypergonadotropic, with normal karyotype, 233300 


136530 


Male infertility, familial 


138030 


[Hyperproglucagonemia] 


138140 


Glucose transport defect, blood-brain barrier 


138760 


[Glyoxalase II deficiency] 


138981 


Pulmonary alveolar proteinosis, 265120 


140100 


[Anhaptoglobinemia] 


140100 


[Hypohaptogloginemia] 


142600 


Hemolytic anemia due to hexokinase deficiency 


143200 


Wagner syndrome 


143200 


Erosive vitreoretinopathy 


145001 


Hyperparathyroidism-jaw tumor syndrome 


146760 


[IgG receptor I, phagocytic, familial deficiency of| 


146790 


Lupus nephritis, susceptibility to 


147050 


Atopy 


148900 


Klippel-Feil syndrome with laryngeal malformation 


151385 


Leukemia, acute myeloid 


151390 


Leukemia, acute T-cell 
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151670 


Hepatic lipase deficiency 


152445 


Vohwinkel syndrome, 124500 


152445 


Erythrokeratoderma, progressive symmetric, 602036 


153700 


Macular dystrophy, vitelliform type 


154545 


Chronic infections, due to opsonin defect 


155555 


[Red hair/fair skin] 


155555 


UV-induced skin damage, vulnerability to 


159001 


Muscular dystrophy, hmb-girdle, type IB 


160980 


Carney myxoma-endocrine complex 


161015 


Mitochondrial complex I deficiency, 252010 


164009 


Leukemia, acute promyelocytic, NUMA/RARA type 


164500 


Spinocerebellar ataxia-7 


164920 


Piebaldism 


164920 


Mast cell leukemia 


164920 


Mastocytosis with associated hematologic disorder 


168461 


Multiple myeloma, 254250 


168461 


Parathyroid adenomatosis 1 


168461 


Centrocytic lymphoma 


168468 


Metaphyseal chondrodysplasia, Murk Jansen type, 156400 


168500 


Parietal foramina 


170650 


Periodontitis, juvenile 


171650 


Lysosomal acid phosphatase deficiency 


171760 


Hypophosphatasia, adult, 146300 


171760 


Hypophosphatasia, infantile, 241500 


171860 


Hemolytic anemia due to phosphofructokinase deficiency 


173610 


Platelet alpha/delta storage pool deficiency 


174000 


Medullary cystic kidney disease, AD 


174810 


Osteolysis, familial expansile 


176640 


Creutzfeldt- Jakob disease, 123400 


176640 


Gerstmann-Straussler disease, 137440 


176640 


Insomnia, fatal familial 


176880 


Protein S deficiency 


176930 


Dysprothrombinemia 


176930 


Hypoprothrombmernia 


178300 


Ptosis, hereditary congenital, 1 


179615 


Reticulosis, familial histiocytic, 267700 


179615 


Severe combined immunodeficiency, B cell-negative, 601457 


179616 


Severe combined immunodeficiency, B cell-negative, 601457 


179755 


Renal cell carcinoma, papillary, 1 


180105 


Retinitis pigmentosa- 10 


180200 


Osteosarcoma, 259500 


180200 


Pinealoma with bilateral retinoblastoma 


180200 


Retinoblastoma 


180200 


Bladder cancer, 109800 


180385 


Leukemia, acute T-cell 


180721 


Retinitis pigmentosa, digenic 


180840 


Susceptibility to EDDM 


181510 


Schizophrenia 


182280 


Small-cell cancer of lung 


182860 


Pyropoikilocytosis 


182860 


Spherocytosis, recessive 


182860 


Elliptocytosis-2 I 
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186580 


Arthrocutaneouveal granulomatosis 


188826 


Sorsby fundus dystrophy, 136900 


189800 


Preeclampsia/eclampsia 


190685 


Down syndrome 1 


191181 


Cervical carcinoma 


191315 


Insensitivity to pain, congenital, with anhidrosis, 256800 


192090 


Ovarian carcinoma 


192090 


Breast cancer, lobular 


192090 


Endometrial carcinoma 


192090 


Gastric cancer, familial, 137215 


193235 


Vitreoretinopathy, neovascular inflammatory 


193300 


Renal cell carcinoma 


193300 


von Hippel-Lindau syndrome 


194070 


Wilms tumor, type 1 


194070 


Denys-Drash syndrome 


194070 


Frasier syndrome, 136680 


208400 


Aspartylglucosaminuria 


209901 


Bardet-Biedl syndrome 1 


212138 


Carnitme-acylcamitine translocase deficiency 


216550 


Cohen syndrome 


222800 


Hemolytic anemia due to bisphosphoglycerate mutase deficiency 


222900 


Sucrose intolerance 


227646 


Fanconi anemia, type D 


227650 


Fanconi anemia, type A 


230800 


Gaucher disease 


230800 


Gaucher disease with cardiovascular calcification 


231675 


Glutaricaciduria, type EC 


231680 


Glutaricaciduria, type 1IA 


232500 


Glycogen storage disease IV 


232600 


McArdle disease 


233700 


Chronic granulomatous disease due to deficiency of NCF-1 


236100 


Holoprosencephaly- 1 


236200 


Homocystinuria, B6-responsive and nonresponsive types 


236700 


McKusick-Kaufman syndrome 


240300 


Autoimmune polyglandular disease, type I 


245349 


Lacticacidemia due to PDX1 deficiency 


245900 


Norum disease 


245900 


Fish-eye disease 


249100 


Familial Mediterranean fever 


250850 


Hypermethioninemia, persistent, autosomal dominant, due to methionine 
adenosyltransferase I/HI deficiency 


253000 


Mucopolysaccharidosis IVA 


253200 


Maroteaux-Lamy syndrome, several forms 


255800 


Schwartz Jampel syndrome 


259700 


Osteopetrosis, recessive 


259770 


Osteoporosis-psendoglioma syndrome 


259900 


Hyperoxaluria, primary, type 1 


266200 


Anemia, hemolytic, due to PK deficiency 


266600 


Inflammatory bowel disease- 1 


267750 


Knobloch syndrome 


268800 


Sandhoff disease, infantile, juvenile, and adult forms 


268800 


Spinal muscular atrophy, HEXB -related 
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272800 


Tay-Sachs disease 


272800 


[Hex A pseudodeficiency] 


272800 


GM2-gangliosidosis, juvenile, adult 


274180 


Thromboxane synthase deficiency 


276600 


Tyrosinemia, type II 


276700 


Tyrosinemia, type I 


300011 


Menkes disease, 309400 




Occipital horn syndrome, 304150 


300011 


Cutis laxa, neonatal 


300046 


Mental retardation, X-linked 23, nonspecific 


300047 


Mental retardation, X-linked 20 


300067 


Subcortical laminar heterotopia, X-linked dominant 


300067 


Lissencephaly, X-linked 


300071 


Night blindness, congenital stationary, type 2 


300075 


Coffin-Lowry syndrome, 303600 


300077 


Mental retardation, X-linked 29 \ 


300110 


Night blindness, congenital stationary, X-linked incomplete, 300071 


300121 


Subcortical laminal heteropia, X-linked, 300067 


300121 


Lissencephaly, X-linked, 300067 


300127 


Mental retardation, X-linked, 60 


300600 


Ocular albinism, Forsius-Eriksson type 


301000 


Thrombocytopenia, X-linked, 313900 


301000 


Wiskott-Aldrich syndrome 


■^019/10 

JU1 ZA>1_/ 


Amelogenesis imperfecta 


301201 


Amelogenesis imperfecta-3, hypoplastic type 




Arthrogryposis, X-linked (spinal muscular atrophy, infantile, X-linked) 


301835 


Arts syndrome 




Nance-Horan syndrome 


iUZoUl 


Charcot-Marie-Tooth neuropathy, X-linked-2, recessive 




Heterocellular hereditary persistence of fetal hemoglobin, Swiss type 




FG syndrome 


3UoUUU 


Glycogenosis. X-linked hepatic* tvoe I 


jUoUUU 


Glycogenosis, X-linked hepatic, type II 


1fY7Qf\f\ 
D\)/oW 


TTvnonho<inhatpmta hf*rf*ditflrv 


D\JOO\J\J 


Keratosis follicularis spinulosa decalvans 




IVXC/lILO-l 1 t^LtU \JdLLUil, A U 11 HO LI, o Y 1JU.L J, VY I LI 1 i3 puaLl^/ b 




RpnnAnmnty Q\m r?TO ttvp— 1 
XxCJJpC/llill Hg o y HULL X 


309510 


A/Tpntftl Tf^JrirflatifYn y-linlrpH ^vnrfrnrrnr-l with Hv<itnnif* mnvprnpntQ 

IVXVUUll l&UUUaUUll) /V UllIVV^H, O Y U11JL1L- A, Willi UYjIvLUv Lilv V OmV-lI - 

ataxia and seizures 


309605 


Mental retardation X-linked. svndromic-4 with congenital contractures 
and low fingertip arches 


309610 


Mental retardation. X-linked, svndromic-2. with dysmorphism and cerebral 
atrophy 


309850 


B runner syndrome 


311050 


Optic atrophy, X-linked 


311200 


Orai-facial-digital syndrome 1 


311850 


Phosphoribosyl pyrophosphate synthetase-related gout 


312040 


N syndrome, 310465 


312060 


Properdin deficiency, X-linked 


312170 


Pyruvate dehydrogenase deficiency 


312700 


Retinoschisis 


313400 


Spondyloepiphyseal dysplasia tarda 
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313700 


Perineal hvoosnadias 


313700 


Prostate cancer 


313700 


Spinal and bulbar muscular atrophy of Kennedy, 313200 


313700 


Breast cancer, male, with Reifenstein syndrome 


313700 


Androgen insensitivitv several forms 


314580 


Wieacker-Wolff syndrome 


600045 


Xeroderma pianentosum, group E, subtype 2 


600065 


Leukocyte* adhesion deficiency, 116920 


600079 


Colon cancer 


600151 


Bardet-Biedl syndrome 3 


600163 


Lons OT syndrome -3 


600223 


Spinocerebellar ataxia-4 


600319 


Diabetes mellitus, insul in-dependent. 4 


600354 


Spinal muscular atxoohv-L 253300 


600354 


Soinal muscular atroDhv~2. 253550 


600354 

XX XX V ^X ^X i 


Spinal muscular atronhv-3. 253400 


600359 

V XX XX *x xx ^ 


Bartter svndrome tvoe 2 


600374 

XX XX V # ■ 


Bardet-Biedl svndrome 4 


600528 


CPT deficiency, hepatic, type 1, 255120 


600623 


Prostate cancer, 176807 


600631 


Enuresis, nocturnal, 1 


600678 


Cancer susceptibility 


600760 


Pseudohvpoaldosteronism type I 264350 

X W Xf'X*^* X-' T XX VMJ|X« XXXX XX^A. VllAUAJUt XT ' X^ i « MX* ■ *X ~X XX 


600760 


Liddle syndrome, 177200 


600761 

xx x^ xx # x^ 


Pseudohvpoaldosteronism tvoe L 264350 


600761 


Liddle syndrome, 177200 


600795 


Dementia familial, nonspecific 


600808 

XX VX XX W W VJ 


Enuresis nocturnal 2 

t «ll XXWJLkX y XX XX XX L»X*A A.M* 1 y Am* 


600811 


Xeroderma oiementosum eroun E DDB-neeative subtvne 278740 


600850 

XX XX XX XX w" XX 


Schizophrenia disorder-4 


600882 

VX XX XX XX XX Mr 


Charcot-Marie-Tooth neuronathv-2B 

WlMUVX/t 1T1U1 I v X Vx Vx Vj JL JLx%> 111 \x LfUUIJl T jtxJLx' 


600887 

XX XXXX XX Vx r 


Endometrial carcinoma 

XjIIUVLUvU 1(11 VlUviUUlilU 


600897 

VxxVxv^x r 


Cataract 7onnlar milvprnlent-l 116700 

xxUUIilivlj IjvIIUIIU, Lx KMX T xxl. UivilL X » X IV/uVV/ 


600900 

v/WX XX XX 


Muscular dvstrorjhv limb-girdle tvne 2E 


600958 

XX vx vx .x xX XX 


Cardiomvooathv familial hvoertroohic 4 115197 

xx UJl V** WX1 1 J Vx L/UU A Jf y iUXIliUUl _J f-'**'*. LI V/ LxXXXxx ^ T j JL X. xx X <x / 


600975 


Glaucoma 3. priinarv infantile. B 


601072 


Deafness autosomal recessive 8 


601105 

XX XX ^» x^ ^» 


Pvcnodvsostosis 265800 


601145 


Enilepsv progressive myoclonic I 254800 

JW^JXvi/l^ ¥ * LXX ^X^ fm—m. Vk/wA ▼ XX XXX V XXXX X XX JKX X^X X. x MX * V* VX XX 


601284 


Hereditarv hemorrhagic teianeiectasia-2. 600376 

A AVA lrXP* V'i 1 J JUVAUVA AAlUwIV tVWl IAI»AWWUA|4 ^X ^ \J XX VX ^X # XX 


601362 

XXXX A- *X V«V 


IxiGeor&e svndrome/velocardiofacial svndrome comolex-2 

4^ Jl XX WA fcjV kX T XXXJfcX V/XiAVI ▼ V/lVArt** W1V1UV1U1 fcX T A A **X. vx JLXXXX VVU-lk/lVJl Ami 


601386 

XX XX A *X XX XX 


Deafness autosomal recessive 12 


601412 

XX XX ■ * 


Deafness, autosomal dominant 7 


601493 

XX XX A I ^ 


CardiomvoDathv. dilated 1C 

V/UX **ax/X*a f Vx lx T y v* JLX X* v VX X* A V 


601567 


Combined factor V and VIH deficiency, 227300 


601652 


Glaucoma 1A, primary open angle, juvenile-onset, 137750 


601669 


Hirschsprung disease, one form 


601769 


Osteoporosis, involutional 


601769 


Rickets, vitamin D-resistant, 277440 


601780 


Ceroid-lipofuscinosis, neuronal-6, variant late infantile 


601863 


Bare lymphocyte syndrome, complementation group C 


601884 


[High bone mass] 
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601920 


Alagille syndrome, 1 ] 8450 


602080 


Paget disease of bone-2 




602092 


Deafness, autosomal recessive 18 


602116 


Glioma 


602491 


Hyperlipidemia, familial combined, 1 


602568 


Homocystmuria-megaloblastic anemia, .cbl E type, 236270 


602574 


Deafness, autosomal dominant 12, 601842 


602574 


Deafness, autosomal dominant 8, 601543 


602783 


Spastic paraplegia-7 



Mature Polypeptides 

The present invention also encompasses mature forms of a polypeptide having the amino 
acid sequence of SEQ ID NO:Y and/or the amino acid sequence encoded by the cDNA in a 
deposited clone. Polynucleotides encoding the mature forms (such as, for example, the 
polynucleotide sequence in SEQ ID NO:X and/or the polynucleotide sequence contained in the 
cDNA of a deposited clone) are also encompassed by the invention. Moreover, fragments or 
variants of these polypeptides (such as, fragments as described herein, polypeptides at least 80%, 
85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to these polypeptides, or polypeptides 
encoded by a polynucleotide that hybridizes under stringent conditions to the complementary 
strand of the polynucleotide encoding these polypeptides) are also encompassed by the invention. 
In preferred embodiments, these fragments or variants retain one or more functional acitivities of 
the full-length or mature form of the polypeptide (e.g., biological activity (such as, for example, 
activity useful in detecting, preventing, diagnosing, prognosticating, treating, and/or ameliorating 
gastrointestinal disorders), antigenicity (ability to bind, or compete with a polypeptide of the 
invention for binding, to an anti-polypeptide of the invention antibody), immunogenicity (ability 
to generate antibody which binds to a specific polypeptide of the invention), ability to form 
multimers with polypeptides of the invention, and ability to bind to a receptor or ligand for a 
polypeptide of the invention). Antibodies that bind the polypeptides of the invention, and 
polynucleotides encoding these polypeptides are also encompassed by the invention. 

According to the signal hypothesis, proteins secreted by mammalian cells have a signal or 
secretary leader sequence which is cleaved from the mature protein once export of the growing 
protein chain across the rough endoplasmic reticulum has been initiated. Most mammalian cells 
and even insect cells cleave secreted proteins with the same specificity. However, in some cases, 
cleavage of a secreted protein is not entirely uniform, which results in two or more mature species 
of the protein. Further, it has long been known that cleavage specificity of a secreted protein is 
ultimately determined by the primary structure of the complete protein, that is, it is inherent in the 
amino acid sequence of the polypeptide. 

Methods for predicting whether a protein has a signal sequence, as well as the cleavage 
point for that sequence, are available. For instance, the method of McGeoch, Virus Res. 3:271- 

513 



WO 02/076488 PCTAJS02/08276 

286 (1985), uses the information from a short N-tenninal charged region and a subsequent 
uncharged region of the complete (uncleaved) protein. The method of von Heinje, Nucleic Acids 
Res. 14:4683-4690 (1986) uses the information from the residues surrounding the cleavage site, 
typically residues -13 to +2, where +1 indicates the amino terminus of the secreted protein. The 
5 accuracy of predicting the cleavage points of known mammalian secretory proteins for each of 
these methods is in the range of 75-80%. (von Heinje, supra.) However, the two methods do not 
always produce the same predicted cleavage point(s) for a given protein. 

In the present case, the deduced amino acid sequence of the secreted polypeptide was 
analyzed by a computer program called SignalP (Henrik Nielsen et al., Protein Engineering 10:1-6 

10 (1997)), which predicts the cellular location of a protein based on the amino acid sequence. As 
part of this computational prediction of localization, the methods of McGeoch and von Heinje are 
incorporated. The analysis of the amino acid sequences of the secreted proteins described herein 
by this program provided the results shown in Table 1 A. 

In specific embodiments, polypeptides of the invention comprise, or alternatively consist 

15 of, the predicted mature form of the polypeptide as delineated in columns 14 and 15 of Table 1A. 
Moreover, fragments or variants of these polypeptides (such as, fragments as described herein, 
polypeptides at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to these 
polypeptides, or polypeptides encoded by a polynucleotide that hybridizes under stringent 
conditions to the complementary strand of the polynucleotide encoding these polypeptides) are 

20 also encompassed by the invention. In preferred embodiments, these fragments or variants retain 
one or more functional acitivities of the full-length or mature form of the polypeptide (e.g., 
biological activity (such as, for example, activity useful in detecting, preventing, diagnosing, 
prognosticating, treating, and/or ameliorating gastrointestinal disorders), antigenicity (ability to 
bind, or compete with a polypeptide of the invention for binding, to an anti-polypeptide of the 

25 invention antibody), immunogenicity (ability to generate antibody which binds to a specific 
polypeptide of the invention), ability to form multimers with polypeptides of the invention, and 
ability to bind to a receptor or ligand for a polypeptide of the invention). Antibodies that bind the 
polypeptides of the invention, and polynucleotides encoding these polypeptides are also 
encompassed by the invention. 

30 Polynucleotides encoding proteins comprising, or consisting of, the predicted mature form 

of polypeptides of the invention (e.g., polynucleotides having the sequence of SEQ ID NO: X 
(Table 1 A, column 4), the sequence delineated in columns 7 and 8 of Table 1A, and a sequence 
encoding the mature polypeptide delineated in columns 14 and 15 of Table 1 A (e.g., the sequence 
of SEQ ID NO:X encoding the mature polypeptide delineated in columns 14 and 15 of Table 1)) 

35 are also encompassed by the invention, as are fragments or variants of these polynucleotides (such 
as, fragments as described herein, polynucleotides at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 

m 
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99%, or 100% identical to these polyncueotides, and nucleic acids which hybridizes under 
stringent conditions to the complementary strand of the polynucleotide). 

As one of ordinary skill would appreciate, however, cleavage sites sometimes vary from 
organism to organism and cannot be predicted with absolute certainty. Accordingly, the present 
invention provides secreted polypeptides having a sequence shown in SEQ ID NO:Y which have 
an N^erminus beginning within 15 residues of the predicted cleavage point (i.e., having 1, 2, 3, 4, 
5, 6, 7, 8 , 9, 10, 11, 12, 13, 14, or 15 more or less contiguous residues of SEQ ID NO:Y at the N- 
terminus when compared to the predicted mature form of the polypeptide (e.g., the mature 
polypeptide delineated in columns 14 and 15 of Table 1). Similarly, it is also recognized that in 
some cases, cleavage of the signal sequence from a secreted protein is not entirely uniform, 
resulting in more than one secreted species. These polypeptides, and the polynucleotides encoding 
such polypeptides, are contemplated by the present invention. 

Moreover, the signal sequence identified by the above analysis may not necessarily 
predict the naturally occurring signal sequence. For example, the naturally occurring signal 
sequence may be further upstream from the predicted signal sequence. However, it is likely that 
the predicted signal sequence will be capable of directing the secreted protein to the ER. 
Nonetheless, the present invention provides the mature protein produced by expression of the 
polynucleotide sequence of SEQ ID NO:X and/or the polynucleotide sequence contained in the 
cDNA of a deposited clone, in a mammalian cell (e.g., COS cells, as desribed below). These 
polypeptides, and the polynucleotides encoding such polypeptides, are contemplated by the 
present invention. 

Polynucleotide and Polypeptide Variants 

The present invention is also directed to variants of the polynucleotide sequence disclosed 
in SEQ ID NO:X or the complementary strand thereto, nucleotide sequences encoding the 
polypeptide of SEQ ID NO:Y, the nucleotide sequence of SEQ ID NO:X that encodes the 
polypeptide sequence as defined in columns 13 and 14 of Table 1A, nucleotide sequences 
encoding the polypeptide sequence as defined in columns 13 and 14 of Table 1A, the nucleotide 
sequence of SEQ ID NO:X encoding the polypeptide sequence as defined in Table IB, the 
nucleotide sequence as defined in columns 8 and 9 of Table 2, nucleotide sequences encoding the 
polypeptide encoded by the nucleotide sequence as defined in columns 8 and 9 of Table 2, the 
nucleotide sequence as defined in column 6 of Table 1C, nucleotide sequences encoding the 
polypeptide encoded by the nucleotide sequence as defined in column 6 of Table 1C, the cDNA 
sequence contained in ATCC Deposit No:Z, nucleotide sequences encoding the polypeptide 
encoded by the cDNA sequence contained in ATCC Deposit No:Z, and/or nucleotide sequences 
encoding a mature (secreted) polypeptide encoded by the cDNA sequence contained in ATCC 
Deposit No:Z. 
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The present invention also encompasses variants of the polypeptide sequence disclosed in 
SEQ ID NO:Y, the polypeptide as defined in columns 13 and 14 of Table 1A, the polypeptide 
sequence as defined in columns 6 and 7 of Table 1B.1, a polypeptide sequence encoded by the 
polynucleotide sequence in SEQ ID NO:X, a polypeptide sequence encoded by the nucleotide 
5 sequence as defined in columns 8 and 9 of Table 2, a polypeptide sequence encoded by the 
nucleotide sequence as defined in column 6 of Table 1C, a polypeptide sequence encoded by the 
complement of the polynucleotide sequence in SEQ ID NO:X, the polypeptide sequence encoded 
by the cDNA sequence contained in ATCC Deposit No:Z and/or a mature (secreted) polypeptide 
encoded by the cDNA sequence contained in ATCC Deposit No:Z. 
10 "Variant" refers to a polynucleotide or polypeptide differing from the polynucleotide or 

polypeptide of the present invention, but retaining essential properties thereof. Generally, variants 
are overall closely similar, and, in many regions, identical to the polynucleotide or polypeptide of 
the present invention. 

Thus, one aspect of the invention provides an isolated nucleic acid molecule comprising, 

15 or alternatively consisting of, a polynucleotide having a nucleotide sequence selected from the 
group consisting of: (a) a nucleotide sequence described in SEQ ID NO:X or contained in the 
cDNA sequence of ATCC Deposit No:Z; (b) a nucleotide sequence in SEQ ID NO:X or the cDNA 
in ATCC Deposit No:Z which encodes the complete amino acid sequence of SEQ ID NO:Y or the 
complete amino acid sequence encoded by the cDNA in ATCC Deposit No:Z; (c) a nucleotide 

20 sequence in SEQ ID NO:X or the cDNA in ATCC Deposit No:Z which encodes a mature 
polypeptide (i.e., a secreted polypeptide (e.g., as delineated in columns 14 and 15 of Table 1A)); 
(d) a nucleotide sequence in SEQ ID NO:X or the cDNA sequence of ATCC Deposit No:Z, which 
encodes a biologically active fragment of a polypeptide; (e) a nucleotide sequence in SEQ ID 
NO:X or the cDNA sequence of ATCC Deposit No:Z, which encodes an antigenic fragment of a 

25 polypeptide; (f) a nucleotide sequence encoding a polypeptide comprising the complete amino 
acid sequence of SEQ ID NO:Y or the complete amino acid sequence encoded by the cDNA in 
ATCC Deposit No:Z; (g) a nucleotide sequence encoding a mature polypeptide of the amino acid 
sequence of SEQ ID NO:Y (i.e., a secreted polypeptide (e.g., as delineated in columns 14 and 15 
of Table 1 A)) or a mature polypeptide of the amino acid sequence encoded by the cDNA in ATCC 

30 Deposit No:Z ; (h) a nucleotide sequence encoding a biologically active fragment of a polypeptide 
having the complete amino acid sequence of SEQ ID NO: Y or the complete amino acid sequence 
encoded by the cDNA in ATCC Deposit No:Z; (i) a nucleotide sequence encoding an antigenic 
fragment of a polypeptide having the complete amino acid sequence of SEQ ID NO:Y or the 
complete amino acid sequence encoded by the cDNA in ATCC Deposit No:Z; and (j) a nucleotide 

35 sequence complementary to any of the nucleotide sequences in (a), (b), (c), (d), (e), (f), (g), (h), or 
(i) above. 
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The present invention is also directed to nucleic acid molecules which comprise, or 
alternatively consist of, a nucleotide sequence which is at least 80%, 85%, 90%, 95%, 96%, 97%, 
98%, 99% or 100%, identical to, for example, any of the nucleotide sequences in (a), (b), (c), (d), 
(«0> (f)> (g). (h), (i), or (j) above, the nucleotide coding sequence in SEQ ID NO:X or the 
5 complementary strand thereto, the nucleotide coding sequence of the cDNA contained in ATCC 
Deposit No:Z or the complementary strand thereto, a nucleotide sequence encoding the 
polypeptide of SEQ ID NO:Y, a nucleotide sequence encoding a polypeptide sequence encoded by 
the nucleotide sequence in SEQ ID NO:X, a polypeptide sequence encoded by the complement of 
the polynucleotide sequence in SEQ ID NO:X, a nucleotide sequence encoding the polypeptide 

10 encoded by the cDNA contained in ATCC Deposit No:Z, the nucleotide coding sequence in SEQ 
ID NO:X as defined in columns 8 and 9 of Table 2 or the complementary strand thereto, a 
nucleotide sequence encoding the polypeptide encoded by the nucleotide sequence in SEQ ID 
NO:X as defined in columns 8 and 9 of Table 2 or the complementary strand thereto, the 
nucleotide coding sequence in SEQ ID NO:B as defined in column 6 of Table 1C or the 

15 complementary strand thereto, a nucleotide sequence encoding the polypeptide encoded by the 
nucleotide sequence in SEQ ID NO:B as defined in column 6 of Table 1C or the complementary 
strand thereto, the nucleotide sequence in SEQ ID NO:X encoding the polypeptide sequence as 
defined in columns 6 and 7 of Table 1B.1 or the complementary strand thereto, nucleotide 
sequences encoding the polypeptide as defined in column 6 and 7 of Table IB. I or the 

20 complementary strand thereto, and/or polynucleotide fragments of any of these nucleic acid 
molecules (e.g., those fragments described herein). Polynucleotides which hybridize to the 
complement of these nucleic acid molecules under stringent hybridization conditions or 
alternatively, under lower stringency conditions, are also encompassed by the invention, as are 
polypeptides encoded by these polynucleotides and nucleic acids. 

25 In a preferred embodiment, the invention encompasses nucleic acid molecules which 

comprise, or alternatively, consist of a polynucleotide which hybridizes under stringent 
hybridization conditions, or alternatively, under lower stringency conditions, to a polynucleotide in 
(a), (b), (c), (d), (e), (f), (g), (h), or (i), above, as are polypeptides encoded by these 
polynucleotides. In another preferred embodiment, polynucleotides which hybridize to the 

30 complement of these nucleic acid molecules under stringent hybridization conditions, or 
alternatively, under lower stringency conditions, are also encompassed by the invention, as are 
polypeptides encoded by these polynucleotides. 

In another embodiment, the invention provides a purified protein comprising, or 
alternatively consisting of, a polypeptide having an amino acid sequence selected from the group 

35 consisting of: (a) the complete amino acid sequence of SEQ ID NO:Y or the complete amino acid 
sequence encoded by the cDNA in ATCC Deposit No:Z; (b) the amino acid sequence of a mature 
(secreted) form of a polypeptide having the amino acid sequence of SEQ ID NO:Y (e.g., as 
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delineated in columns 14 and 15 of Table 1A) or a mature form of the amino acid sequence 
encoded by the cDNA in ATCC Deposit No:Z mature; (c) the amino acid sequence of a 
biologically active fragment of a polypeptide having the complete amino acid sequence of SEQ ID 
NO:Y or the complete amino acid sequence encoded by the cDNA in ATCC Deposit No:Z; and 
5 (d) the amino acid sequence of an antigenic fragment of a polypeptide having the complete amino 
acid sequence of SEQ ID NO:Y or the complete arnino acid sequence encoded by the cDNA in 
ATCC Deposit No:Z. 

The present invention is also directed to proteins which comprise, or alternatively consist 
of, an amino acid sequence which is at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 

10 100%, identical to, for example, any of the amino acid sequences in (a), (b), (c), or (d), above, the 
arnino acid sequence shown in SEQ ID NO:Y, the amino acid sequence encoded by the cDNA 
contained in ATCC Deposit No:Z, the amino acid sequence of the polypeptide encoded by the 
nucleotide sequence in SEQ ID NO:X as defined in columns 8 and 9 of Table 2, the amino acid 
sequence of the polypeptide encoded by the nucleotide sequence in SEQ ID NO:B as defined in 

15 column 6 of Table 1C, the amino acid sequence as defined in columns 6 and 7 of Table 1B.1, an 
amino acid sequence encoded by the nucleotide sequence in SEQ ID NO:X, and an amino acid 
sequence encoded by the complement of the polynucleotide sequence in SEQ ID NO:X. 
Fragments of these polypeptides are also provided (e.g., those fragments described herein). Further 
proteins encoded by polynucleotides which hybridize to the complement of the nucleic acid 

20 molecules encoding these arnino acid sequences under stringent hybridization conditions or 
alternatively, under lower stringency conditions, are also encompassed by the invention, as are the 
polynucleotides encoding these proteins. 

By a nucleic acid having a nucleotide sequence at least, for example, 95% "identical" to a 
reference nucleotide sequence of the present invention, it is intended that the nucleotide sequence 

25 of the nucleic acid is identical to the reference sequence except that the nucleotide sequence may 
include up to five point mutations per each 100 nucleotides of the reference nucleotide sequence 
encoding the polypeptide. In other words, to obtain a nucleic acid having a nucleotide sequence at 
least 95% identical to a reference nucleotide sequence, up to 5% of the nucleotides in the reference 
sequence may be deleted or substituted with another nucleotide, or a number of nucleotides up to 

30 5% of the total nucleotides in the reference sequence may be inserted into the reference sequence. 
The query sequence may be an entire sequence referred to in Table IB or 2 as the ORF (open 
reading frame), or any fragment specified as described herein. 

As a practical matter, whether any particular nucleic acid molecule or polypeptide is at 
least 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the 

35 present invention can be determined conventionally using known computer programs. A preferred 
method for detennining the best overall match between a query sequence (a sequence of the 
present invention) and a subject sequence, also referred to as a global sequence ahgnrnent, can be 
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determined using the FASTDB computer program based on the algorithm of Brutlag et al. (Corap. 
App. Biosci. 6:237-245 (1990)). In a sequence alignment the query and subject sequences are both 
DNA sequences. An RNA sequence can be compared by converting LPs to T's. The result of said 
global sequence alignment is expressed as percent identity. Preferred parameters used in a 

5 FASTDB alignment of DNA sequences to calculate percent identity are: Matrix=Unitary, k- 
tuple=4, Mismatch Penalty=l, Joining Penalty=30, Randomization Group Length=0, Cutoff 
Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the length of the subject 
nucleotide sequence, whichever is shorter. 

If the subject sequence is shorter than the query sequence because of 5' or 3' deletions, not 

10 because of internal deletions, a manual correction must be made to the results. This is because the 
FASTDB program does not account for 5' and 3' truncations of the subject sequence when 
calculating percent identity. For subject sequences truncated at the 5' or 3* ends, relative to the 
query sequence, the percent identity is corrected by calculating the number of bases of the query 
sequence that are 5' and 3* of the subject sequence, which are not matched/aligned, as a percent of 

15 the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by 
results of the FASTDB sequence alignment. This percentage is then subtracted from the percent 
identity, calculated by the above FASTDB program using the specified parameters, to arrive at a 
final percent identity score. This corrected score is what is used for the purposes of the present 
invention. Only bases outside the 5' and 3' bases of the subject sequence, as displayed by the 

20 FASTDB alignment, which are not matched/aligned with the query sequence, are calculated for 
the purposes of manually adjusting the percent identity score. 

For example, a 90 base subject sequence is aligned to a 100 base query sequence to 
determine percent identity. The deletions occur at the 5' end of the subject sequence and 
therefore, the FASTDB alignment does not show a matched/alignment of the first 10 bases at 5' 

25 end. The 10 unpaired bases represent 10% of the sequence (number of bases at the 5* and 3' ends 
not matched/total number of bases in the query sequence) so 10% is subtracted from the percent 
identity score calculated by the FASTDB program. If the remaining 90 bases were perfectly 
matched the final percent identity would be 90%. In another example, a 90 base subject sequence 
is compared with a 100 base query sequence. This time the deletions are internal deletions so that 

30 there are no bases on the 5' or 3' of the subject sequence which are not matched/aligned with the 
query. In this case the percent identity calculated by FASTDB is not manually corrected. Once 
again, only bases 5' and 3' of the subject sequence which are not matched/aligned with the query 
sequence are manually corrected for. No other manual corrections are to be made for the purposes 
of the present invention. 

35 By a polypeptide having an amino acid sequence at least, for example, 95% "identical" to 

a query amino acid sequence of the present invention, it is intended that the amino acid sequence 
of the subject polypeptide is identical to the query sequence except that the subject polypeptide 
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sequence may include up to five amino acid alterations per each 100 amino acids of the query 
amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at 
least 95% identical to a query amino acid sequence, up to 5% of the amino acid residues in the 
subject sequence may be inserted, deleted, (indels) or substituted with another amino acid. These 
5 alterations of the reference sequence may occur at the amino or carboxy terminal positions of the 
reference amino acid sequence or anywhere between those terminal positions, interspersed either 
individually among residues in the reference sequence or in one or more contiguous groups within 
the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 80%, 85%, 90%, 95%, 

10 96%, 97%, 98% or 99% identical to, for instance, the amino acid sequence of a polypeptide 
referred to in Table 1A (e.g., the amino acid sequence delineated in columns 14 and 15) or a 
fragment thereof, Table 1B.1 (e.g., the amino acid sequence identified in column 6) or a fragment 
thereof, Table 2 (e.g., the amino acid sequence of the polypeptide encoded by the polynucleotide 
sequence defined in columns 8 and 9 of Table 2) or a fragment thereof, the amino acid sequence of 

15 the polypeptide encoded by the polynucleotide sequence in SEQ ID NO:B as defined in column 6 
of Table 1C or a fragment thereof, the amino acid sequence of the polypeptide encoded by the 
nucleotide sequence in SEQ ID NO:X or a fragment thereof, or the amino acid sequence of the 
polypeptide encoded by cDNA contained in ATCC Deposit No:Z, or a fragment thereof, the amino 
acid sequence of a mature (secreted) polypeptide encoded by cDNA contained in ATCC Deposit 

20 No:Z> or a fragment thereof, can be determined conventionally using known computer programs. 
A preferred method for deternaining the best overall match between a query sequence (a sequence 
of the present invention) and a subject sequence, also referred to as a global sequence alignment, 
can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. 
(Comp. App. Biosci.6:237-245 (1990)). In a sequence alignment the query and subject sequences 

25 are either both nucleotide sequences or both amino acid sequences. The result of said global 
sequence alignment is expressed as percent identity. Preferred parameters used in a FASTDB 
amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch Penalty=l, Joining Penalty =20, 
Randomization Group Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, 
Gap Size Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, 

30 whichever is shorter. 

If the subject sequence is shorter than the query sequence due to N- or C-tenninal 
deletions, not because of internal deletions, a manual correction must be made to the results. This 
is because the FASTDB program does not account for N- and C-terminal truncations of the subject 
sequence when calculating global percent identity. For subject sequences truncated at the N- and 

35 C^ermini, relative to the query sequence, the percent identity is corrected by calculating the 
number of residues of the query sequence that are N- and C-terminal of the subject sequence, 
which are not matched/aligned with a corresponding subject residue, as a percent of the total bases 
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of the query sequence. Whether a residue is matched/aligned is determined by results of the 
FASTDB sequence alignment This percentage is then subtracted from the percent identity, 
calculated by the above FASTDB program using the specified parameters, to arrive at a final 
percent identity score. This final percent identity score is what is used for the purposes of the 
5 present invention. Only residues to the N- and C-termini of the subject sequence, which are not 
matched/aligned with the query sequence, are considered for the purposes of manually adjusting 
the percent identity score. That is, only query residue positions outside the farthest N- and C- 
terminal residues of the subject sequence. 

■ 

For example, a 90 amino acid residue subject sequence is aligned with a 100 residue query 

10 sequence to determine percent identity. The deletion occurs at the N-terminus of the subject 
sequence and therefore, the FASTDB alignment does not show a matching/alignment of the first 
10 residues at the N-terminus. The 10 unpaired residues represent 10% of the sequence (number 
of residues at the N- and C- termini not matched/total number of residues in the query sequence) 
so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the 

15 remaining 90 residues were perfectly matched the final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared with a 100 residue query sequence. This time 
the deletions are internal deletions so there are no residues at the N- or C-termini of the subject 
sequence which are not matched/aligned with the query. La this case the percent identity 
calculated by FASTDB is not manually corrected. Once again, only residue positions outside the 

20 N- and C-terminal ends of the subject sequence, as displayed in the FASTDB alignment, which 
are not matched/aligned with the query sequnce are manually corrected for. No other manual 
corrections are to made for the purposes of the present invention. 

The polynucleotide variants of the invention may contain alterations in the coding regions, 
non-coding regions, or both. Especially preferred are polynucleotide variants containing 

25 alterations which produce silent substitutions, additions, or deletions, but do not alter the 
properties or activities of the encoded polypeptide. Nucleotide variants produced by silent 
substitutions due to the degeneracy of the genetic code are preferred. Moreover, polypeptide 
variants in which less than 50, less than 40, less than 30, less than 20, less than 10, or 5-50, 5-25, 
5-10, 1-5, or 1-2 amino acids are substituted, deleted, or added in any combination are also 

30 preferred. Polynucleotide variants can be produced for a variety of reasons, e.g., to optimize 
codon expression for a particular host (change codons in the human mRNA to those preferred by a 
bacterial host such as R coli). 

Naturally occurring variants are called "allelic variants," and refer to one of several 
alternate forms of a gene occupying a given locus on a chromosome of an organism (Genes II, 

35 Lewin, B., ed., John Wiley & Sons, New York (1985)). These allelic variants can vary at either 
the polynucleotide and/or polypeptide level and are included in the present invention. 
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Alternatively, nonrnarurally occurring variants may be produced by mutagenesis techniques or by 
direct synthesis. 

Using known methods of protein engineering and recombinant DNA technology, variants 
may be generated to improve or alter the characteristics of the polypeptides of the present 
5 invention. For instance, one or more amino acids can be deleted from the N-terminus or C- 
terminus of the polypeptide of the present invention without substantial loss of biological function. 
As an example, Ron et al. (J. Biol. Chern. 268: 2984-2988 (1993)) reported variant KGF proteins 
having heparin binding activity even after deleting 3, 8, or 27 amino-tenninal amino acid residues. 
Similarly, Interferon gamma exhibited up to ten times higher activity after deleting 8-10 amino 
10 acid residues from the carboxy terminus of this protein. (Dobeli et al., J. Biotechnology 7:199-216 
(1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological activity 
similar to that of the naturally occurring protein. For example, Gayle and coworkers (J. Biol. 
Chem. 268:22105-22111 (1993)) conducted extensive mutational analysis of human cytokine IL- 

15 la. They used random mutagenesis to generate over 3,500 individual EL-la mutants that averaged 
2.5 amino acid changes per variant over the entire length of the molecule. Multiple mutations 
were examined at every possible amino acid position. The investigators found that "[m]ost of the 
molecule could be altered with little effect on either [binding or biological activity]." In fact, only 
23 unique amino acid sequences, out of more than 3,500 nucleotide sequences examined, produced 

20 a protein that significantly differed in activity from wild-type. 

Furthermore, even if deleting one or more amino acids from the N-terminus or C-terminus 
of a polypeptide results in modification or loss of one or more biological functions, other 
biological activities may still be retained. For example, the ability of a deletion variant to induce 
and/or to bind antibodies which recognize the secreted form will likely be retained when less than 

25 the majority of the residues of the secreted form are removed from the N-terminus or C-terminus. 
Whether a particular polypeptide lacking N- or C-terminal residues of a protein retains such 
immunogenic activities can readily be determined by routine methods described herein and 
otherwise known in the art. 

Thus, the invention further includes polypeptide variants which show a biological or 

30 functional activity of the polypeptides of the invention (such as, for example, activity useful in 
detecting, preventing, diagnosing, prognosticating, treating, and/or ameliorating cardiovascular 
disorders). Such variants include deletions, insertions, inversions, repeats, and substitutions 
selected according to general rules known in the art so as have litde effect on activity. 

The present application is directed to nucleic acid molecules at least 80%, 85%, 90%, 

35 95%, 96%, 97%, 98%, 99% or 100% identical to the nucleic acid sequences disclosed herein, (e.g., 
encoding a polypeptide having the amino acid sequence of an N and/or C terminal deletion), 
irrespective of whether they encode a polypeptide having functional activity. This is because even 
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where a particular nucleic acid molecule does not encode a polypeptide having functional activity, 
one of skill in the art would still know how to use the nucleic acid molecule, for instance, as a 
hybridization probe or a polymerase chain reaction (PGR) primer. Uses of the nucleic acid 
molecules of the present invention that do not encode a polypeptide having functional activity 
5 include, inter alia, (1) isolating a gene or allelic or splice variants thereof in a cDNA library; (2) in 
situ hybridization (e.g., "FISH") to metaphase chromosomal spreads to provide precise 
chromosomal location of the gene, as described in Verma et al., Human Chromosomes: A Manual 
of Basic Techniques, Pergamon Press, New York (1988); (3) Northern Blot analysis for detecting 
mRNA expression in specific tissues (e.g., normal or diseased tissues); and (4) in situ 

10 hybridization (e.g., histochemistry) for detecting mRNA expression in specific tissues (e.g., 
normal or diseased tissues). 

Preferred, however, are nucleic acid molecules having sequences at least 80%, 85%, 90%, 
95%, 96%, 97%, 98%, 99% or 100% identical to the nucleic acid sequences disclosed herein, 
which do, in fact, encode a polypeptide having functional activity. By a polypeptide having 

15 "functional activity" is meant, a polypeptide capable of displaying one or more known functional 
activities associated with a full-length (complete) protein and/or a mature (secreted) protein of the 
invention. Such functional activities include, but are not limited to, biological activity (such as, 
for example, activity useful in detecting, preventing, diagnosing, prognosticating, treating, and/or 
ameliorating gastrointestinal diseases and disorders), antigenicity (ability to bind, or compete with 

20 a polypeptide of the invention for binding, to an anti-polypeptide of the invention antibody), 
immunogenicity (ability to generate antibody which binds to a specific polypeptide of the 
invention), ability to form multimers with polypeptides of the invention, and ability to bind to a 
receptor or ligand for a polypeptide of the invention. 

The functional activity of the polypeptides, and fragments, variants and derivatives of the 

25 invention, can be assayed by various methods. 

For example, in one embodiment where one is assaying for the ability to bind or compete 
with a full-length polypeptide of the present invention for binding to an anti-polypetide antibody, 
various immunoassays known in the art can be used, including but not limited to, competitive and 
non-competitive assay systems using techniques such as radioimmunoassays, ELISA (enzyme 

30 linked immunosorbent assay), "sandwich" immunoassays, immunoradiometric assays, gel 
diffusion precipitation reactions* immunodiffusion assays, in situ immunoassays (using colloidal 
gold, enzyme or radioisotope labels, for example), western blots, precipitation reactions, 
agglutination assays (e.g., gel agglutination assays, hemagglutination assays), complement fixation 
assays, immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, etc. In 

35 one embodiment, antibody binding is detected by detecting a label on the primary antibody. In 
another embodiment, the primary antibody is detected by detecting binding of a secondary 
antibody or reagent to the primary antibody. In a further embodiment, the secondary antibody is 
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labeled. Many means are known in the art for detecting binding in an immunoassay and are within 
the scope of the present invention. 

In another embodiment, where a ligand is identified, or the ability of a polypeptide 
fragment, variant or derivative of the invention to multimerize is being evaluated, binding can be 
assayed, e.g., by means well-known in the art, such as, for example, reducing and non-reducing gel 
chromatography, protein affinity chromatography, and affinity blotting. See generally, Phizicky et 
aL, Microbiol. Rev. 59:94-123 (1995). In another embodiment, the ability of physiological 
correlates of a polypeptide of the present invention to bind to a substrate(s) of the polypeptide of 
the invention can be routinely assayed using techniques known in the art 

In addition, assays described herein (see Examples) and otherwise known in the art may 
routinely be applied to measure the ability of polypeptides of the present invention and fragments, 
variants and derivatives thereof to elicit polypeptide related biological activity (either in vitro or in 
vivo). Other methods will be known to the skilled artisan and are within the scope of the 
invention. 

Of course, due to the degeneracy of the genetic code, one of ordinary skill in the art will 
immediately recognize that a large number of the nucleic acid molecules having a sequence at 
least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to, for example, the nucleic 
acid sequence of the cDNA contained in ATCC Deposit No:Z, the nucleic acid sequence referred 
to in Table IB (SEQ ID NO:X), the nucleic acid sequence disclosed in Table 1A (e.g., the nucleic 
acid sequence delineated in columns 7 and 8), the nucleic acid sequence disclosed in Table 2 (e.g., 
the nucleic acid sequence delineated in columns 8 and 9) or fragments thereof, will encode 
polypeptides "having functional activity." In fact, since degenerate variants of any of these 
nucleotide sequences all encode the same polypeptide, in many instances, this will be clear to the 
skilled artisan even without perforating the above described comparison assay. It will be further 
recognized in the art that, for such nucleic acid molecules that are not degenerate variants, a 
reasonable number will also encode a polypeptide having functional activity. This is because the 
skilled artisan is fully aware of amino acid substitutions that are either less likely or not likely to 
significantly effect protein function (e.g., replacing one aliphatic amino acid with a second 
aliphatic amino acid), as further described below. 

For example, guidance concerning how to make phenotypically silent amino acid 
substitutions is provided in Bowie et aL, "Deciphering the Message in Protein Sequences: 
Tolerance to Amino Acid Substitutions," Science 247:1306-1310 (1990), wherein the authors 
indicate that there are two main strategies for studying the tolerance of an amino acid sequence to 
change. 

The first strategy exploits the tolerance of amino acid substitutions by natural selection 
during the process of evolution. By comparing amino acid sequences in different species, 
conserved amino acids can be identified. These conserved amino acids are likely important for 
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protein function. In contrast, the amino acid positions where substitutions have been tolerated by 
natural selection indicates that these positions are not critical for protein function. Thus, positions 
tolerating amino acid substitution could be modified while still maintaining biological activity of 
the protein. 

5 The second strategy uses genetic engineering to introduce amino acid changes at specific 

positions of a cloned gene to identify regions critical for protein function. For example, site 
directed mutagenesis or alanine-scanning mutagenesis (introduction of single alanine mutations at 
every residue in the molecule) can be used. See Cunningham and Wells, Science 244:1081-1085 
(1989). The resulting mutant molecules can then be tested for biological activity. 

10 As the authors state, these two strategies have revealed that proteins are surprisingly 

tolerant of amino acid substitutions. The authors further indicate which amino acid changes are 
likely to be permissive at certain amino acid positions in the protein. For example, most buried 
(within the tertiary structure of the protein) amino acid residues require nonpolar side chains, 
whereas few features of surface side chains are generally conserved. Moreover, tolerated 

15 conservative amino acid substitutions involve replacement of the aliphatic or hydrophobic amino 
acids Ala, Val, Leu and He; replacement of the hydroxyl residues Ser and Thr; replacement of the 
acidic residues Asp and Glu; replacement of the amide residues Asn and Gin, replacement of the 
basic residues Lys, Arg, and His; replacement of the aromatic residues Phe, Tyr, and Tip, and 
replacement of the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

20 Besides conservative amino acid substitution, variants of the present invention include (i) 

substitutions with one or more of the non-conserved amino acid residues, where the substituted 
amino acid residues may or may not be one encoded by the genetic code, or (ii) substitutions with 
one or more of the amino acid residues having a substituent group, or (iii) fusion of the mature 
polypeptide with another compound, such as a compound to increase the stability and/or solubility 

25 of the polypeptide (for example, polyethylene glycol), (iv) fusion of the polypeptide with 
additional amino acids, such as, for example, an IgG Fc fusion region peptide, serum albumin 
(preferably human serum albumin) or a fragment thereof, or leader or secretory sequence, or a 
sequence facilitating purification, or (v) fusion of the polypeptide with another compound, such as 
albumin (including but not limited to recombinant albumin (see, e.g., U.S. Patent No. 5,876,969, 

30 issued March 2, 1999, EP Patent 0 413 622, and U.S. Patent No. 5,766,883, issued June 16, 1998, 
herein incorporated by reference in their entirety)). Such variant polypeptides are deemed to be 
within the scope of those skilled in the art from the teachings herein. 

For example, polypeptide variants containing amino acid substitutions of charged amino 
acids with other charged or neutral amino acids may produce proteins with improved 

35 characteristics, such as less aggregation. Aggregation of pharmaceutical formulations both 
reduces activity and increases clearance due to the aggregate's immunogenic activity. See 
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Pinckard et al., Clin. Exp. Immunol. 2:331-340 (1967); Robbins et aL, Diabetes 36: 838-845 
(1987); Cleland et al., Crit. Rev. Therapeutic Drug Carrier Systems 10:307-377 (1993). 

A further embodiment of the invention relates to polypeptides which comprise the amino 
acid sequence of a polypeptide having an amino acid sequence which contains at least one amino 
5 acid substitution, but not more than 50 amino acid substitutions, even more preferably, not more 
than 40 amino acid substitutions, still more preferably, not more than 30 amino acid substitutions, 
and still even more preferably, not more than 20 amino acid substitutions from a polypeptide 
sequence disclosed herein. Of course it is highly preferable for a polypeptide to have an amino 
acid sequence which, for example, comprises the amino acid sequence of a polypeptide of SEQ ID 

10 NO:Y, the amino acid sequence of the mature (e.g., secreted) polypeptide of SEQ ID NO:Y, an 
arnino acid sequence encoded by SEQ ID NO:X, an amino acid sequence encoded by the portion 
of SEQ ID NO:X as defined in columnns 8 and 9 of Table 2, an amino acid sequence encoded by 
the complement of SEQ ID NO:X, an arnino acid sequence encoded by cDNA contained in ATCC 
Deposit No:Z, and/or the amino acid sequence of a mature (secreted) polypeptide encoded by 

15 cDNA contained in ATCC Deposit No:Z, or a fragment thereof, which contains, in order of ever- 
increasing preference, at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 amino acid 
substitutions. 

In specific embodiments, the polypeptides of the invention comprise, or alternatively, 
consist of, fragments or variants of a reference amino acid sequence selected from: (a) the amino 
20 acid sequence of SEQ ID NO:Y or fragments thereof (e.g., the mature formand/or other fragments 
described herein); (b) the amino acid sequence encoded by SEQ ID NO:X or fragments thereof; 

(c) the amino acid sequence encoded by the complement of SEQ ID NO:X or fragments thereof; 

(d) the amino acid sequence encoded by the portion of SEQ ID NO:X as defined m columns 8 and 
9 of Table 2 or fragments thereof; and (e) the amino acid sequence encoded by cDNA contained in 

25 ATCC Deposit No:Z or fragments thereof; wherein the fragments or variants have 1-5, 5-10, 5-25, 
5-50, 10-50 or 50-150, arnino acid residue additions, substitutions, and/or deletions when 
compared to the reference amino acid sequence. In preferred embodiments, the amino acid 
substitutions are conservative. Polynucleotides encoding these polypeptides are also encompassed 
by the invention. 

30 

Polynucleotide and Polypeptide Fragments 

The present invention is also directed to polynucleotide fragments of the polynucleotides 
(nucleic acids) of the invention. In the present invention, a "polynucleotide fragment" refers to a 
polynucleotide having a nucleic acid sequence which, for example: is a portion of the cDNA 
35 contained in ATCC Deposit No:Z or the complementary strand thereto; is a portion of the 
polynucleotide sequence encoding the polypeptide encoded by the cDNA contained in ATCC 
Deposit No:Z or the complementary strand thereto; is a portion of the polynucleotide sequence 
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encoding the mature (secreted) polypeptide encoded by the cDNA contained in ATCC Deposit 
No:Z or the complementary strand thereto; is a portion of a polynucleotide sequence encoding the 
mature arnino acid sequence as defined in columns 14 and 15 of Table 1 A or the complementary 
strand thereto; is a portion of a polynucleotide sequence encoding the amino acid sequence 
5 encoded by the region of SEQ ID NO:X as defined in columns 8 and 9 of Table 2 or the 
complementary strand thereto; is a portion of the polynucleotide sequence of SEQ ID NO:X as 
defined in columns 8 and 9 of Table 2 or the complementary strand thereto; is a portion of the 
polynucleotide sequence in SEQ ID NO:X or the complementary strand thereto; is a 
polynucleotide sequence encoding a portion of the polypeptide of SEQ ID NO:Y; is a 

10 polynucleotide sequence encoding a portion of a polypeptide encoded by SEQ ID NO:X; is a 
polynucleotide sequence encoding a portion of a polypeptide encoded by the complement of the 
polynucleotide sequence in SEQ ID NO:X; is a portion of a polynucleotide sequence encoding the 
amino acid sequence encoded by the region of SEQ ID NO:B as defined in column 6 of Table 1C 
or the complementary strand thereto; or is a portion of the polynucleotide sequence of SEQ ID 

15 NO:B as defined in column 6 of Table 1C or the complementary strand thereto. 

The polynucleotide fragments of the invention are preferably at least about 15 nt, and 
more preferably at least about 20 nt, still more preferably at least about 30 nt, and even more 
preferably, at least about 40 nt, at least about 50 nt, at least about 75 nt, or at least about 150 nt in 
length. A fragment "at least 20 nt in length," for example, is intended to include 20 or more 

20 contiguous bases from the cDNA sequence contained in ATCC Deposit No:Z, or the nucleotide 
sequence shown in SEQ ID NO:X or the complementary stand thereto. In this context "about" 
includes the particularly recited value or a value larger or smaller by several (5, 4, 3, 2, or 1) 
nucleotides, at either tenninus or at both terrmni. These nucleotide fragments have uses that 
include, but are not limited to, as diagnostic probes and primers as discussed herein. Of course, 

25 larger fragments (e.g., at least 160, 170, 180, 190, 200, 250, 500, 600, 1000, or 2000 nucleotides in 
length ) are also encompassed by the invention. 

Moreover, representative examples of polynucleotide fragments of the invention comprise, 
or alternatively consist of, a sequence from about nucleotide number 1-50, 51-100, 101-150, 151- 
200, 201-250, 251-300, 301-350, 351-400, 401-450, 451-500, 501-550, 551-600, 601-650, 651- 

30 700, 701-750, 751-800, 801-850, 851-900, 901-950, 951-1000, 1001-1050, 1051-1100, 1101- 
1150, 1151-1200, 1201-1250, 1251-1300, 1301-1350, 1351-1400, 1401-1450, 1451-1500, 1501- 
1550, 1551-1600, 1601-1650, 165M700, 1701-1750, 1751-1800, 1801-1850, 1851-1900, 1901- 
1950, 1951-2000, 2001-2050, 2051-2100, 2101-2150, 2151-2200, 2201-2250, 2251-2300, 2301- 
2350, 2351-2400, 2401-2450, 2451-2500, 2501-2550, 2551-2600, 2601-2650, 2651-2700, 2701- 

35 2750, 2751-2800, 2801-2850, 2851-2900, 2901-2950, 2951-3000, 3001-3050, 3051-3100, 3101- 
3150, 3151-3200, 3201-3250, 3251-3300, 3301-3350, 3351-3400, 3401-3450, 3451-3500, 3501- 
3550, 3551-3600, 3601-3650, 3651-3700, 3701-3750, 3751-3800, 3801-3850, 3851-3900, 3901- 
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3950, 3951-4000, 4001-4050, 40514100, 4101-4150, 4151-4200, 42014250, 4251-4300, 4301- 
4350, 4351-4400, 4401-4450, 4451-4500, 450M550, 4551-4600, 4601-4650, 46514700, 4701- 
4750, 47514800, 480M850, 48514900, 49014950, 4951-5000, 5001-5050, 5051-5100, 5101- 
5150, 5151-5200, 5201-5250, 5251-5300, 5301-5350, 5351-5400, 5401-5450, 5451-5500, 5501- 
5 5550, 5551-5600, 5601-5650, 5651-5700, 5701-5750, 5751-5800, 5801-5850, 5851-5900, 5901- 
5950, 5951-6000, 6001-6050, 6051-6100, 6101-6150, 6151-6200, 6201-6250, 6251-6300, 6301- 
6350, 6351-6400, 6401-6450, 6451-6500, 6501-6550, 6551-6600, 6601-6650, 6651-6700, 6701- 
6750, 6751-6800, 6801-6850, 6851-6900, 6901-6950, 6951-7000, 7001-7050, 7051-7100, 7101- 
7150, 7151-7200, 7201-7250, 7251-7300 or 7301 to the end of SEQ ID NO:X, or the 

10 complementary strand thereto. In this context "about" includes the particularly recited range or a 
range larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Preferably, these fragments encode a polypeptide which has a functional activity (e.g., 
biological activity; such as, for example, activity useful in detecting, preventing, diagnosing, 
prognosticating, treating, and/or ameliorating gastrointestinal diseases and disorders). More 

15 preferably, these polynucleotides can be used as probes or primers as discussed herein. 
Polynucleotides which hybridize to one or more of these polynucleotides under stringent 
hybridization conditions or alternatively, under lower stringency conditions are also encompassed 
by the invention, as are polypeptides encoded by these polynucleotides. 

Further representative examples of polynucleotide fragments of the invention comprise, or 

20 alternatively consist of, a sequence from about nucleotide number 1-50, 51-100, 101-150, 151- 
200, 201-250, 251-300, 301-350, 351400, 401450, 451-500, 501-550, 551-600, 601-650, 651- 
700, 701-750, 751-800, 801-850, 851-900, 901-950, 951-1000, 1001-1050, 1051-1100, 1101- 
1150, 1151-1200, 1201-1250, 1251-1300, 1301-1350, 1351-1400, 140M450, 1451-1500, 1501- 
1550, 1551-1600, 1601-1650, 1651-1700, 1701-1750, 1751-1800, 1801-1850, 1851-1900, 1901- 

25 1950, 1951-2000, 2001-2050, 2051-2100, 2101-2150, 2151-2200, 2201-2250, 2251-2300, 2301- 
2350, 2351-2400, 2401-2450, 2451-2500, 2501-2550, 2551-2600, 2601-2650, 2651-2700, 2701- 
2750, 2751-2800, 2801-2850, 2851-2900, 2901-2950, 2951-3000, 3001-3050, 3051-3100, 3101- 
3150, 3151-3200, 3201-3250, 3251-3300, 3301-3350, 3351-3400, 3401-3450, 3451-3500, 3501- 
3550, 3551-3600, 3601-3650, 3651-3700, 3701-3750, 3751-3800, 3801-3850, 3851-3900, 3901- 

30 3950, 3951-4000, 4001^050, 405M100, 41014150, 4151-4200, 4201-4250, 4251-4300, 4301- 
4350, 4351^1400, 44014450, 44514500, 45014550, 455W600, 46014650, 4651-4700, 4701- 
4750, 4751-4800, 4801-4850, 48514900, 49014950, 4951-5000, 5001-5050, 5051-5100, 5101- 
5150, 5151-5200, 5201-5250, 5251-5300, 5301-5350, 5351-5400, 5401-5450, 5451-5500, 5501- 
5550, 5551-5600, 5601-5650, 5651-5700, 5701-5750, 5751-5800, 5801-5850, 5851-5900, 5901- 

35 5950, 5951-6000, 6001-6050, 6051-6100, 6101-6150, 6151-6200, 6201-6250, 6251-6300, 6301- 
6350, 6351-6400, 6401-6450, 6451-6500, 6501-6550, 6551-6600, 6601-6650, 6651-6700, 6701- 
6750, 6751-6800, 6801-6850, 6851-6900, 6901-6950, 6951-7000, 7001-7050, 7051-7100, 7101- 
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7150, 7151-7200, 7201-7250, 7251-7300 or 7301 to the end of the cDNA sequence contained in 
ATCC Deposit No:Z, or the complementary strand thereto. In this context "about" includes the 
particularly recited range or a range larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at 
either terminus or at both termini. Preferably, these fragments encode a polypeptide which has a 
5 functional activity (e.g., biological activity). More preferably, these polynucleotides can be used 
as probes or primers as discussed herein. Polynucleotides which hybridize to one or more of these 
polynucleotides under stringent hybridization conditions or alternatively, under lower stringency 
conditions are also encompassed by the invention, as are polypeptides encoded by these 
polynucleotides. 

10 Moreover, representative examples of polynucleotide fragments of the invention comprise, 

or alternatively consist of, a nucleic acid sequence comprising one, two, three, four, five, six, 
seven, eight, nine, ten, or more of the above described polynucleotide fragments of the invention 
in combination with a polynucleotide sequence delineated in Table 1C column 6. Additional, 
representative examples of polynucleotide fragments of the invention comprise, or alternatively 

15 consist of, a nucleic acid sequence comprising one, two, three, four, five, six, seven, eight, nine, 
ten, or more of the above described polynucleotide fragments of the invention in combination with 
a polynucleotide sequence that is the complementary strand of a sequence delineated in column 6 
of Table 1C. In further embodiments, the above-described polynucleotide fragments of the 
invention comprise, or alternatively consist of, sequences delineated in Table 1C, column 6, and 

20 have a nucleic acid sequence which is different from that of the BAC fragment having the 
sequence disclosed in SEQ ID NO:B (see Table 1C, column 5). In additional embodiments, the 
above-described polynucleotide fragments of the invention comprise, or alternatively consist of, 
sequences delineated in Table 1C, column 6, and have a nucleic acid sequence which is different 
from that published for the BAC clone identified as BAC ID NO:A (see Table 1C, column 4). In 

25 additional embodiments, the above-described polynucleotides of the invention comprise, or 
alternatively consist of, sequences delineated Table 1C, column 6, and have a nucleic acid 
sequence which is different from that contained in the BAC clone identified as BAC ID NO: A (see 
Table 1C, column 4). Polypeptides encoded by these polynucleotides, other polynucleotides that 
encode these polypeptides, and antibodies that bind these polypeptides are also encompassed by 

30 the invention. Additionally, fragments and variants of the above-described polynucleotides and 
polypeptides are also encompassed by the invention. 

In additional specific embodiments, polynucleotides of the invention comprise, or 
alternatively consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more fragments of 
the sequences delineated in column 6 of Table 1C, and the polynucleotide sequence of SEQ ID 

35 NO:X (e.g., as defined in Table 1C, column 2) or fragments or variants thereof. Polypeptides 
encoded by these polynucleotides, other polynucleotides that encode these polypeptides, and 
antibodies that bind these polypeptides are also encompassed by the invention. 
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In additional specific embodiments, polynucleotides of the invention comprise, or 
alternatively consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more fragments of 
the sequences delineated in column 6 of Table 1C which correspond to the same ATCC Deposit 
No:Z (see Table 1C, column 1), and the polynucleotide sequence of SEQ ID NO:X (e.g., as 
5 defined in Table 1A, IB, or IQ or fragments or variants thereof. Polypeptides encoded by these 
polynucleotides, other polynucleotides that encode these polypeptides, and antibodies that bind 
these polypeptides are also encompassed by the invention. 

In further specific embodiments, polynucleotides of the invention comprise, or 
alternatively consist of, one, two, three, four, five, six, seven, eight, nine, ten, or more fragments of 

10 the sequences delineated in the same row of column 6 of Table 1C, and the polynucleotide 
sequence of SEQ ID NO:X (e.g., as defined in Table 1A, IB, or 1C) or fragments or variants 
thereof. Polypeptides encoded by these polynucleotides, other polynucleotides that encode these 
polypeptides, and antibodies that bind these polypeptides are also encompassed by the invention. 

In additional specific embodiments, polynucleotides of the invention comprise, or 

15 alternatively consist of a polynucleotide sequence in which the 3' 10 polynucleotides of one of the 
sequences delineated in column 6 of Table 1C and the 5' 10 polynucleotides of the sequence of 
SEQ ID NO:X are directly contiguous. Nucleic acids which hybridize to the complement of these 
20 contiguous polynucleotides under stringent hybridization conditions or alternatively, under 
lower stringency conditions, are also encompassed by the invention. Polypeptides encoded by 

20 these polynucleotides and/or nucleic acids, other polynucleotides and/or nucleic acids that encode 
these polypeptides, and antibodies that bind these polypeptides are also encompassed by the 
invention. Additionally, fragments and variants of the above-described polynucleotides, nucleic 
acids, and polypeptides are also encompassed by the invention. 

In additional specific embodiments, polynucleotides of the invention comprise, or 

25 alternatively consist of a polynucleotide sequence in which the 3' 10 polynucleotides of one of the 
sequences delineated in column 6 of Table 1C and the 5' 10 polynucleotides of a fragment or 
variant of the sequence of SEQ ID NO:X (e.g., as described herein) are directly contiguous 
Nucleic acids which hybridize to the complement of these 20 contiguous polynucleotides under 
stringent hybridization conditions or alternatively, under lower stringency conditions, are also 

30 encompassed by the invention. Polypeptides encoded by these polynucleotides and/or nucleic 
acids, other polynucleotides and/or nucleic acids encoding these polypeptides, and antibodies that 
bind these polypeptides are also encompassed by the invention. Additionally, fragments and 
variants of the above-described polynucleotides, nucleic acids, and polypeptides are also 
encompassed by the invention. 

35 In further specific embodiments, polynucleotides of the invention comprise, or 

alternatively consist of a polynucleotide sequence in which the 3' 10 polynucleotides of a fragment 
or variant of the sequence of SEQ ID NO:X and the 5' 10 polynucleotides of the sequence of one 
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of the sequences delineated in column 6 of Table 1C are directly contiguous. Nucleic acids which 
hybridize to the complement of these 20 contiguous polynucleotides under stringent hybridization 
conditions or alternatively, under lower stringency conditions, are also encompassed by the 
invention. Polypeptides encoded by these polynucleotides and/or nucleic acids, other 
5 polynucleotides and/or nucleic acids encoding these polypeptides, and antibodies that bind these 
polypeptides are also encompassed by the invention. Additionally, fragments and variants of the 
above-described polynucleotides, nucleic acids, and polypeptides are also encompassed by the 
invention. 

In specific embodiments, polynucleotides of the invention comprise, or alternatively 
10 consist of a polynucleotide sequence in which the 3' 10 polynucleotides of one of the sequences 
delineated in column 6 of Table 1C and the 5* 10 polynucleotides of another sequence in column 6 
are directly contiguous. In preferred embodiments, the 3' 10 polynucleotides of one of the 
sequences delineated in column 6 of Table 1C is directly contiguous with the 5' 10 
polynucleotides of the next sequential exon delineated in Table 1C, column 6. Nucleic acids 
15 which hybridize to the complement of these 20 contiguous polynucleotides under stringent 
hybridization conditions or alternatively, under lower stringency conditions, are also encompassed 
by the invention. Polypeptides encoded by these polynucleotides and/or nucleic acids, other 
polynucleotides and/or nucleic acids encoding these polypeptides, and antibodies that bind these 
polypeptides are also encompassed by the invention. Additionally, fragments and variants of the 
20 above-described polynucleotides, nucleic acids, and polypeptides are also encompassed by the 
invention. 

In the present invention, a "polypeptide fragment" refers to an amino acid sequence which 
is a portion of the amino acid sequence contained in SEQ ID NO:Y, is a portion of the mature 
form of SEQ ID NO: Y as defined in columns 14 and 15 of Table 1 A, a portion of an amino acid 

25 sequence encoded by the portion of SEQ ID NO:X as defined in columnns 8 and 9 of Table 2, is a 
portion of an amino acid sequence encoded by the polynucleotide sequence of SEQ ID NO:X, is a 
portion of an amino acid sequence encoded by the complement of the polynucleotide sequence in 
SEQ ID NO:X, is a portion of the amino acid sequence of a mature (secreted) polypeptide encoded 
by the cDNA contained in ATCC Deposit NorZ, and/or is a portion of an amino acid sequence 

30 encoded by the cDNA contained in ATCC Deposit No:Z. Protein (polypeptide) fragments may be 
"free-standing," or comprised within a larger polypeptide of which the fragment forms a part or 
region, most preferably as a single continuous region. Representative examples of polypeptide 
fragments of the invention, include, for example, fragments comprising, or alternatively consisting 
of, from about amino acid number 1-20, 21-40, 41-60, 61-80, 81-100, 101-120, 121-140, 141-160, 

35 161-180, 181-200, 201-220, 221-240, 241-260, 261-280, 281-300, 301-320, 321-340, 341-360, 
361-380, 38MOO, 401-420, 421^40, 441-460, 461-480, 481-500, 501-520, 521-540, 541-560, 
561-580, 581-600, 601-620, 621-640, 641-660, 661-680, 681-700, 701-720, 721-740, 741-760, 
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761-780, 781-800, 801-820, 821-840, 841-860, 861-880, 881-900, 901-920, 921-940, 941-960, 
961-980, 981-1000, 1001-1020, 1021-1040, 1041-1060, 1061-1080, 1081-1100, 1101-1120, 1121- 
1140, 1141-1160, 1161-1180, 1181-1200, 1201-1220, 1221-1240, 1241-1260, 1261-1280, 1281- 
1300, 1301-1320, 1321-1340, 1341-1360, 1361-1380, 1381-1400, 1401-1420, 1421-1440, or 1441 
5 to the end of the coding region of cDNA and SEQ ID NO: Y. In a preferred embodiment, 
polypeptide fragments of the invention include, for example, fragments comprising, or 
alternatively consisting of, from about amino acid number 1-20, 21-40, 41-60, 61-80, 81-100, 101- 
120, 121-140, 141-160, 161-180, 181-200, 201-220, 221-240, 241-260, 261-280, 281-300, 301- 
320, 321-340, 341-360, 361-380, 381-400, 401-420, 421-440, 441-460, 461^80, 481-500, 501- 

10 520, 521-540, 541-560, 561-580, 581-600, 601-620, 621-640, 641-660, 661-680, 681-700, 701- 
720, 721-740, 741-760, 761-780, 781-800, 801-820, 821-840, 841-860, 861-880, 881-900, 901- 
920, 921-940, 941-960, 961-980, 981-1000, 1001-1020, 1021-1040, 1041-1060, 1061-1080, 1081- 
1100, 1101-1120, 1121-1140, 1141-1160, 1161-1180, 1181-1200, 1201-1220, 1221-1240, 1241- 
1260, 1261-1280, 1281-1300, 1301-1320, 1321-1340, 1341-1360, 1361-1380, 1381-1400, 1401- 

15 1420, 1421-1440, or 1441 to the end of the coding region of SEQ ID NO:Y. Moreover, 
polypeptide fragments of the invention may be at least about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 
60, 65, 70, 75, 80, 85, 90, 100, 110, 120, 130, 140, or 150 amino acids in length. In this context 
"about" includes the particularly recited ranges or values, or ranges or values larger or smaller by 
several (5, 4, 3, 2, or 1) amino acids, at either extreme or at both extremes. Polynucleotides 

20 encoding these polypeptide fragments are also encompassed by the invention. 

Even if deletion of one or more amino acids from the N-terminus of a protein results in 
modification of loss of one or more biological functions of the protein, other functional activities 
(e.g., biological activities; such as, for example, activity useful in detecting, preventing, 
diagnosing, prognosticating, treating, and/or ameliorating gastrointestinal diseases and disorders; 

25 ability to miritimerize; ability to bind a ligand; antigenic ability useful for production of 
polypeptide specific antibodies) may still be retained. For example, the ability of shortened 
muteins to induce and/or bind to antibodies which recognize the complete or mature forms of the 
polypeptides generally will be retained when less than the majority of the residues of the complete 
or mature polypeptide are removed from the N-terminus. Whether a particular polypeptide lacking 

30 N-terminal residues of a complete polypeptide retains such immunologic activities can readily be 
determined by routine methods described herein and otherwise known in the art. It is not unlikely 
that a mutein with a large number of deleted N-terminal amino acid residues may retain some 
biological or immunogenic activities. In fact, peptides composed of as few as six amino acid 
residues may often evoke an immune response. 

35 Accordingly, polypeptide fragments include the secreted protein as well as the mature 

form. Further preferred polypeptide fragments include the secreted protein or the mature form 
having a continuous series of deleted residues from the amino or the carboxy terminus, or both. 
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For example, any number of amino acids, ranging from 1-60, can be deleted from the amino 
terminus of either the secreted polypeptide or the mature form. Similarly, any number of amino 
acids, ranging from 1-30, can be deleted from the carboxy terminus of the secreted protein or 
mature form. Furthermore, any combination of the above amino and carboxy terminus deletions 
5 are preferred. Similarly, polynucleotides encoding these polypeptide fragments are also preferred. 

The present invention further provides polypeptides having one or more residues deleted 
from the amino terminus of the amino acid sequence of a polypeptide disclosed herein (e.g., a 
polypeptide of SEQ ID NO:Y, a polypeptide as defined in columns 14 and 15 of Table 1A, a 
polypeptide encoded by the polynucleotide sequence contained in SEQ ID NO:X or the 

10 complement thereof, a polypeptide encoded by the portion of SEQ ID NO:X as defined in columns 
8 and 9 of Table 2, a polypeptide encoded by the portion of SEQ ID NO:B as defined in column 6 
of Table 1C, a polypeptide encoded by the cDNA contained in ATCC Deposit No:Z, and/or a 
mature polypeptide encoded by the cDNA contained in ATCC Deposit No:Z). In particular, N- 
terminal deletions may be described by the general formula m-q, where q is a whole integer 

15 representing the total number of amino acid residues in a polypeptide of the invention (e.g., the 
polypeptide disclosed in SEQ ID NO: Y, the mature (secreted) portion of SEQ ID NO.Y as defined 
in columns 14 and 15 of Table 1A, or the polypeptide encoded by the portion of SEQ ID NO:X as 
defined in columns 8 and 9 of Table 2), and m is defined as any integer ranging from 2 to q-6. 
Polynucleotides encoding these polypeptides are also encompassed by the invention. 

20 The present invention further provides polypeptides having one or more residues from the 

carboxy terminus of the amino acid sequence of a polypeptide disclosed herein (e.g., a polypeptide 
of SEQ ID NO:Y, the mature (secreted) portion of SEQ ID NO:Y as defined in columns 14 and 15 
of Table 1A, a polypeptide encoded by the polynucleotide sequence contained in SEQ ID NO:X, a 
polypeptide encoded by the portion of SEQ ID NO:X as defined in columns 8 and 9 of Table 2, a 

25 polypeptide encoded by the portion of SEQ ID NO:B as defined in column 6 of Table 1C, a 
polypeptide encoded by the cDNA contained in ATCC Deposit No:Z, and/or a mature polypeptide 
encoded by the cDNA contained in ATCC Deposit No:Z). In particular, C-terminal deletions may 
be described by the general formula 1-n, where n is any whole integer ranging from 6 to q-1, and 
where n corresponds to the position of amino acid residue in a polypeptide of the invention. 

30 Polynucleotides encoding these polypeptides are also encompassed by the invention. 

In addition, any of the above described N- or C-terminal deletions can be combined to 
produce a N- and C4erminal deleted polypeptide. The invention also provides polypeptides 
having one or more amino acids deleted from both the amino and the carboxy! termini, which may 
be described generally as having residues m-n of a polypeptide encoded by SEQ ID NO:X (e.g., 

35 including, but not limited to, the preferred polypeptide disclosed as SEQ ID NO:Y, the mature 
(secreted) portion of SEQ ID NO:Y as defined in columns 14 and 15 of Table 1A, and the 
polypeptide encoded by the portion of SEQ ID NO:X as defined in columns 8 and 9 of Table 2), 
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the cDNA contained in ATCC Deposit No:Z, and/or the complement thereof, where n and m are 
integers as described above. Polynucleotides encoding these polypeptides are also encompassed by 
the invention. 

Also as mentioned above, even if deletion of one or more amino acids from the 
5 C-terminus of a protein results in modification of loss of one or more biological functions of the 
protein, other functional activities (e.g., biological activities such as, for example, activity useful in 
detecting, preventing, diagnosing, prognosticating, treating, and/or ameliorating gastrointestinal 
diseases and disorders; ability to multimerize; ability to bind a ligand; antigenic ability useful for 
production of polypeptide specific antibodies) may still be retained. For example the ability of the 

10 shortened mutein to induce and/or bind to antibodies which recognize the complete or mature 
forms of the polypeptide generally will be retained when less than the majority of the residues of 
the complete or mature polypeptide are removed from the C-terminus. Whether a particular 
polypeptide lacking C-terminal residues of a complete polypeptide retains such immunologic 
activities can readily be determined by routine methods described herein and otherwise known in 

15 the art. It is not unlikely that a mutein with a large number of deleted C-tenninal amino acid 
residues may retain some biological or immunogenic activities. In fact, peptides composed of as 
few as six amino acid residues may often evoke an immune response. 

The present application is also directed to proteins containing polypeptides at least 80%, 
85%, 90%, 95%, 96%, 97%, 98% or 99% identical to a polypeptide sequence set forth herein. In 

20 preferred embodiments, the application is directed to proteins containing polypeptides at least 
80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identical to polypeptides having the amino acid 
sequence of the specific N- and C-terminal deletions. Polynucleotides encoding these 
polypeptides are also encompassed by the invention. 

Any polypeptide sequence encoded by, for example, the polynucleotide sequences set 

25 forth as SEQ ID NO:X or the complement thereof, (presented, for example, in Tables 1A and 2), 
the cDNA contained in ATCC Deposit No:Z, or the polynucleotide sequence as defined in column 
6 of Table 1C, may be analyzed to determine certain preferred regions of the polypeptide. For 
example, the amino acid sequence of a polypeptide encoded by a polynucleotide sequence of SEQ 
ID NO:X (e.g., the polypeptide of SEQ ID NO:Y and the polypeptide encoded by the portion of 

30 SEQ ID NO:X as defined in columnns 8 and 9 of Table 2) or the cDNA contained in ATCC 
Deposit No:Z may be analyzed using the default parameters of the DNASTAR computer 
algorithm (DNASTAR, Inc., 1228 S. Park St., Madison, WI 53715 USA; 
http://www.dnastar.com/). 

Polypeptide regions that may be routinely obtained using the DNASTAR computer 

35 algorithm include, but are not limited to, Garnier-Robson alpha-regions, beta-regions, 
turn-regions, and coil-regions; Chou-Fasman alpha-regions, beta-regions, and turn-regions; 
Kyte-Doolittle hydrophilic regions and hydrophobic regions; Eisenberg alpha- and 
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beta-amphipathic regions; Karplus-Schulz flexible regions; Emini surface-forming regions; and 
Jameson-Wolf regions of high antigenic index. Among highly preferred polynucleotides of the 
invention in this regard are those that encode polypeptides comprising regions that combine 
several structural features, such as several (e.g., 1, 2, 3 or 4) of the features set out above. 

* 5 Additionally, Kyte-Doolittle hydrophilic regions and hydrophobic regions, Emini 

surface-forming regions, and Jameson-Wolf regions of high antigenic index (i.e., containing four 
or more contiguous amino acids having an antigenic index of greater than or equal to 1.5, as 
identified using the default parameters of the Jameson-Wolf program) can routinely be used to 
determine polypeptide regions that exhibit a high degree of potential for antigenicity. Regions of 

10 high antigenicity are determined from data by DNASTAR analysis by choosing values which 
represent regions of the polypeptide which are likely to be exposed on the surface of the 
polypeptide in an environment in which antigen recognition may occur in the process of initiation 
of an immune response. 

Preferred polypeptide fragments of the invention are fragments comprising, or 

15 alternatively, consisting of, an amino acid sequence that displays a functional activity (e.g. 
biological activity such as, for example, activity useful in detecting, preventing, diagnosing, 
prognosticating, treating, and/or ameliorating gastrointestinal diseases and disorders; ability to 
multimerize; ability to bind a ligand; antigenic ability useful for production of polypeptide specific 
antibodies) of the polypeptide sequence of which the amino acid sequence is a fragment By a 

20 polypeptide displaying a "functional activity" is meant a polypeptide capable of one or more 
known functional activities associated with a full-length protein, such as, for example, biological 
activity, antigenicity, immunogenicity, and/or multimerization, as described herein. 

Other preferred polypeptide fragments are biologically active fragments. Biologically 
active fragments are those exhibiting activity similar, but not necessarily identical, to an activity of 

25 the polypeptide of the present invention. The biological activity of the fragments may include an 

# 

improved desired activity, or a decreased undesirable activity. 

In preferred embodiments, polypeptides of the invention comprise, or alternatively consist 
of, one, two, three, four, five or more of the antigenic fragments of the polypeptide of SEQ ID 
NO: Y, or portions thereof. Polynucleotides encoding these polypeptides are also encompassed by 
30 the invention. 

Epitopes and Antibodies 

The present invention encompasses polypeptides comprising, or alternatively consisting 
of, an epitope of: the polypeptide sequence shown in SEQ ID NO:Y; a polypeptide sequence 
35 encoded by SEQ ID NO:X or the complementary strand thereto; the polypeptide sequence 
encoded by the portion of SEQ ID NO:X as defined in columns 8 and 9 of Table 2; the 
polypeptide sequence encoded by the portion of SEQ ID NO:B as defined in column 6 of Table 1C 
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or the complement thereto; the polypeptide sequence encoded by the cDNA contained in ATCC 
Deposit No:Z; or the polypeptide sequence encoded by a polynucleotide that hybridizes to the 
sequence of SEQ ID NO:X, the complement of the sequence of SEQ ID NO:X, the complement of 
a portion of SEQ ID NO:X as defined in columns 8 and 9 of Table 2, or the cDNA sequence 
5 contained in ATCC Deposit No:Z under stringent hybridization conditions or alternatively, under 
lower stringency hybridization as defined supra. The present invention further encompasses 
polynucleotide sequences encoding an epitope of a polypeptide sequence of the invention (such as, 
for example, the sequence disclosed in SEQ ID NO:X, or a fragment thereof), polynucleotide 
sequences of the complementary strand of a polynucleotide sequence encoding an epitope of the 

10 invention, and polynucleotide sequences which hybridize to the complementary strand under 
stringent hybridization conditions or alternatively, under lower stringency hybridization 
conditions defined supra. 

The term "epitopes," as used herein, refers to portions of a polypeptide having antigenic 
or immunogenic activity in an animal, preferably a rnammal, and most preferably in a human. In 

15 a preferred embodiment, the present invention encompasses a polypeptide comprising an epitope, 
as well as the polynucleotide encoding this polypeptide. An "immunogenic epitope," as used 
herein, is defined as a portion of a protein that elicits an antibody response in an animal, as 
determined by any method known in the art, for example, by the methods for generating 
antibodies described infra. (See, for example, Geysen et aL, Proc. Natl. Acad. Sci. USA 81:3998- 

20 4002 (1983)). The term "antigenic epitope," as used herein, is defined as a portion of a protein to 

■ 

which an antibody can irnmuriospecifically bind its antigen as determined by any method well 
known in the art, for example, by the immunoassays described herein. Immunospecific binding 
excludes non-specific binding but does not necessarily exclude cross- reactivity with other 
antigens. Antigenic epitopes need not necessarily be immunogenic. 

25 Fragments which function as epitopes may be produced by any conventional means. (See, 

e.g., Houghten, R. A., Proc. Natl. Acad Sci. USA 82:5131-5135 (1985) further described in U.S. 
Patent No. 4,631,211.) 

In the present invention, antigenic epitopes preferably contain a sequence of at least 4, at 
least 5, at least 6, at least 7, more preferably at least 8, at least 9, at least 10, at least 11, at least 12, 

30 at least 13, at least 14, at least 15, at least 20, at least 25, at least 30, at least 40, at least 50, and, 
most preferably, between about 15 to about 30 amino acids. Preferred polypeptides comprising 
immunogenic or antigenic epitopes are at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 85, 90, 95, or 100 amino acid residues in length. Additional non-exclusive preferred antigenic 
epitopes include the antigenic epitopes disclosed herein, as well as portions thereof. Antigenic 

35 epitopes are useful, for example, to raise antibodies, including monoclonal antibodies, that 
specifically bind the epitope. Preferred antigenic epitopes include the antigenic epitopes disclosed 
herein, as well as any combination of two, three, four, five or more of these antigenic epitopes. 
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Antigenic epitopes can be used as the target molecules in immunoassays. (See, for instance, 
Wilson et al, Cell 37:767-778 (1984); Sutcliffe et aL, Science 219:660-666 (1983)). 

Non-limiting examples of epitopes of polypeptides that can be used to generate antibodies 
of the invention include a polypeptide comprising, or alternatively consisting of, at least one, two, 
5 three, four, five, six or more of the porrion(s) of SEQ ID NO: Y specified in column 6 of Table 
1B.1. These polypeptide fragments have been determined to bear antigenic epitopes of the 
proteins of the invention by the analysis of the Jameson-Wolf antigenic index which is included in 
the DNAStar suite of computer programs. By "comprise" it is intended that a polypeptide contains 
at least one, two, three, four, five, six or more of the portion(s) of SEQ ID NO: Y shown in column 

10 6 of Table 1B.1, but it may contain additional flanking residues on either the amino or carboxyl 
termini of the recited portion. Such additional flanking sequences are preferably sequences 
naturally found adjacent to the portion; i.e., contiguous sequence shown in SEQ ID NO:Y. The 
flanking sequence may, however, be sequences from a heterolgous polypeptide, such as from 
another protein described herein or from a heterologous polypeptide not described herein. In 

15 particular embodiments, epitope portions of a polypeptide of the invention comprise one, two, 
three, or more of the portions of SEQ ID NO. Y shown in column 6 of Table 1B.1, 

Similarly, immunogenic epitopes can be used, for example, to induce antibodies 
according to methods well known in the art. See, for instance, Sutcliffe et al., supra; Wilson et 
al., supra; Chow et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle et al., J. Gen. Virol. 

20 66:2347-2354 (1985). Preferred immunogenic epitopes include the immunogenic epitopes 
disclosed herein, as well as any combination of two, three, four, five or more of these 
immunogenic epitopes. The polypeptides comprising one or more immunogenic epitopes may be 
presented for eliciting an antibody response together with a carrier protein, such as an albumin, to 
an animal system (such as rabbit or mouse), or, if the polypeptide is of sufficient length (at least 

25 about 25 amino acids), the polypeptide may be presented without a carrier. However, 
immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to be sufficient 
to raise antibodies capable of binding to, at the very least, linear epitopes in a denatured 
polypeptide (e.g., in Western blotting). 

Epitope-bearing polypeptides of the present invention may be used to induce antibodies 

30 according to methods well known in the art including, but not limited to, in vivo immunization, in 
vitro immunization, and phage display methods. See, e,g., Sutcliffe et al., supra; Wilson et al., 
supra, and Bittle et al., J. Gen. Virol., 66:2347-2354 (1985). If in vivo immunization is used, 
animals may be immunized with free peptide; however, anti-peptide antibody titer may be boosted 
by coupling the peptide to a macromolecular carrier, such as keyhole limpet hemacyanin (KLH) 

35 or tetanus toxoid. For instance, peptides containing cysteine residues may be coupled to a carrier 
using a linker such as nialeimidobenzoyl- N-hydroxysuccinimide ester (MBS), while other 
peptides may be coupled to carriers using a more general linking agent such as glutaraldehyde. 
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Animals such as rabbits, rats and mice are immunized with either free or carrier- coupled peptides, 
for instance, by intraperitoneal and/or intradermal injection of emulsions containing about 100 fig 
of peptide or carrier protein and Freund's adjuvant or any other adjuvant known for stimulating an 
immune response. Several booster injections may be needed, for instance, at intervals of about 
5 two weeks, to provide a useful titer of anti-peptide antibody which can be detected, for example, 
by ELISA assay using free peptide adsorbed to a solid surface. The titer of anti-peptide antibodies 
in serum from an immunized animal may be increased by selection of anti-peptide antibodies, for 
instance, by adsorption to the peptide on a solid support and elution of the selected antibodies 
according to methods well known in the art. 

10 As one of skill in the art will appreciate, and as discussed above, the polypeptides of the 

present invention (e.g., those comprising an immunogenic or antigenic epitope) can be fused to 
heterologous polypeptide sequences. For example, polypeptides of the present invention 
(including fragments or variants thereof), may be fused with the constant domain of 
immunoglobulins (IgA, IgE, IgG, IgM), or portions thereof (CHI, CH2, CH3, or any combination 

15 thereof and portions thereof, resulting in chimeric polypeptides. By way of another non-limiting 
example, polypeptides and/or antibodies of the present invention (including fragments or variants 
thereof) may be fused with albumin (including but not limited to recombinant human serum 
albumin or fragments or variants thereof (see, e.g., U.S. Patent No. 5,876,969, issued March 2, 
1999, EP Patent 0 413 622, and U.S. Patent No. 5,766,883, issued June 16, 1998, herein 

20 incorporated by reference in their entirety)). In a preferred embodiment, polypeptides and/or 
antibodies of the present invention (including fragments or variants thereof) are fused with the 
mature form of human serum albumin (i.e., amino acids 1 - 585 of human serum albumin as 
shown in Figures 1 and 2 of EP Patent 0 322 094) which is herein incorporated by reference in its 
entirety. In another preferred embodiment, polypeptides and/or antibodies of the present invention 

25 (including fragments or variants thereof) are fused with polypeptide fragments comprising, or 
alternatively consisting of, amino acid residues 1-z of human serum albumin, where z is an integer 
from 369 to 419, as described in U.S. Patent 5,766,883 herein incorporated by reference in its 
entirety. Polypeptides and/or antibodies of the present invention (including fragments or variants 
thereof) may be fused to either the N- or C-terminal end of the heterologous protein (e.g., 

30 immunoglobulin Fc polypeptide or human serum albumin polypeptide). Polynucleotides encoding 
fusion proteins of the invention are also encompassed by the invention. 

Such fusion proteins as those described above may facilitate purification and may increase 
half-life in vivo. This has been shown for chimeric proteins consisting of the first two domains of 
the human CD4-polypeptide and various domains of the constant regions of the heavy or light 

35 chains of mammalian immunoglobulins. See, e.g., EP 394,827; Traunecker et al., Nature, 33 1:84- 
86 (1988). Enhanced delivery of an antigen across the epithelial barrier to the immune system has 
been demonstrated for antigens (e.g., insulin) conjugated to an FcRn binding partner such as IgG 
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or Fc fragments (see, e.g., PCT Publications WO 96/22024 and WO 99/04813). IgG fusion 
proteins that have a disulfide-linked dimeric structure due to the IgG portion desulfide bonds have 
also been found to be more efficient in binding and neutralizing other molecules than monomeric 
polypeptides or fragments thereof alone. See, e.g., Fountoulalcis et aL, J. Biochenx, 270:3958- 
5 3964 (1995). Nucleic acids encoding the above epitopes can also be recombined with a gene of 
interest as an epitope tag (e.g., the hemagglutinin (HA) tag or flag tag) to aid in detection and 
purification of the expressed polypeptide. For example, a system described by Janknecht et al. 
allows for the ready purification of non-denatured fusion proteins expressed in human cell lines 
(Janknecht et aL, 1991, Proc. Natl. Acad. Sci. USA 88:8972- 897). In this system, the gene of 
10 interest is subcloned into a vaccinia recombination plasmid such that the open reading frame of 
the gene is translationally fused to an ammo-terminal tag consisting of six histidine residues. The 
tag serves as a matrix binding domain for the fusion protein. Extracts from cells infected with the 
recombinant vaccinia virus are loaded onto Ni2+ nitriloacetic acid-agarose column and histidine- 
tagged proteins can be selectively eluted with imidazole-containing buffers. 

15 

Fusion Proteins 

Any polypeptide of the present invention can be used to generate fusion proteins. For 
example, the polypeptide of the present invention, when fused to a second protein, can be used as 
an antigenic tag. Antibodies raised against the polypeptide of the present invention can be used to 

20 indirectly detect the second protein by binding to the polypeptide. Moreover, because secreted 
proteins target cellular locations based on trafficking signals, polypeptides of the present invention 
which are shown to be secreted can be used as targeting molecules once fused to other proteins. 

Examples of domains that can be fused to polypeptides of the present invention include 
not only heterologous signal sequences, but also other heterologous functional regions. The fusion 

25 does not necessarily need to be direct, but may occur through linker sequences. 

In certain preferred embodiments, proteins of the invention are fusion proteins comprising 
an amino acid sequence that is an N and/or C- terminal deletion of a polypeptide of the invention. 
In preferred embodiments, the invention is directed to a fusion protein comprising an amino acid 
sequence that is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a polypeptide sequence of 

30 the invention. Polynucleotides encoding these proteins are also encompassed by the invention. 

Moreover, fusion proteins may also be engineered to improve characteristics of the 
polypeptide of the present invention. For instance, a region of additional arnrno acids, particularly 
charged amino acids, may be added to the N-terminus of the polypeptide to improve stability and 
persistence during purification from the host cell or subsequent handling and storage. Also, 

35 peptide moieties may be added to the polypeptide to facilitate purification. Such regions may be 
removed prior to final preparation of the polypeptide. The addition of peptide moieties to facilitate 
handling of polypeptides are familiar and routine techniques in the art 
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As one of skill in the art will appreciate that, as discussed above, polypeptides of the 
present invention, and epitope-bearing fragments thereof, can be combined with heterologous 
polypeptide sequences. For example, the polypeptides of the present invention may be fused with 
heterologous polypeptide sequences, for example, the polypeptides of the present invention may 
5 be fused with the constant domain of immunoglobulins (IgA, IgE, IgG, IgM) or portions thereof 
(CHI, CH2, CHS, and any combination thereof, including both entire domains and portions 
thereof), or albumin (including, but not limited to, native or recombinant human albumin or 
fragments or variants thereof (see, e.g., U.S. Patent No. 5,876,969, issued March 2, 1999, EP 
Patent 0 413 622, and U.S. Patent No. 5,766,883, issued June 16, 1998, herein incorporated by 

10 reference in their entirety)), resulting in chimeric polypeptides. For example, EP-A-O 464 533 
(Canadian counterpart 2045869) discloses fusion proteins comprising various portions of constant 
region of immunoglobulin molecules together with another human protein or part thereof. In 
many cases, the Fc part in a fusion protein is beneficial in therapy and diagnosis, and thus can 
result in, for example, improved pharmacokinetic properties (EP-A 0232 262). Alternatively, 

15 deleting the Fc part after the fusion protein has been expressed, detected, and purified, would be 
desired. For example, the Fc portion may hinder therapy and diagnosis if the fusion protein is 
used as an antigen for immunizations. In drug discovery, for example, human proteins, such as 
hIL-5, have been fused with Fc portions for the purpose of high-throughput screening assays to 
identify antagonists of hIL-5. See, D. Bennett et al., J. Molecular Recognition 8:52-58 (1995); K. 

20 Johanson et al., J. Biol. Chenx 270:9459-9471 (1995). 

Moreover, the polypeptides of the present invention can be fused to marker sequences, 
such as a polypeptide which facilitates purification of the fused polypeptide. In preferred 
embodiments, the marker amino acid sequence is a hexa-histidine peptide, such as the tag provided 
in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311), among others, 

25 many of which are commercially available. As described in Gentz et al., Proc. Natl. Acad. Sci. 
USA 86:821-824 (1989), for instance, hexa-histidine provides for convenient purification of the 
fusion protein. Another peptide tag useful for purification, the "HA" tag, corresponds to an 
epitope derived from the influenza hemagglutinin protein (Wilson et al., Cell 37:767 (1984)). 

Additional fusion proteins of the invention may be generated through the techniques of 

30 gene-shuffling, motif-shuffling, exon-shuffling, and/or codon-shuffling (collectively referred to as 
"DNA shuffling"). DNA shuffling may be employed to modulate the activities of polypeptides of 
the invention, such methods can be used to generate polypeptides with altered activity, as well as 
agonists and antagonists of the polypeptides. See, generally, U.S. Patent Nos. 5,605,793; 
5,811,238; 5,830,721; 5,834,252; and 5,837,458, and Patten et al., Curr. Opinion Biotechnol. 

35 8:724-33 (1997); Harayama, Trends Biotechnol. 16(2):76-82 (1998); Hansson, et al., J. MoL Biol. 
287:265-76 (1999); and Lorenzo and Blasco, Biotechniques 24(2):308- 13 (1998) (each of these 
patents and publications are hereby incorporated by reference in its entirety). In one embodiment, 
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alteration of polynucleotides corresponding to SEQ ED NO:X and the polypeptides encoded by 
these polynucleotides may be achieved by DNA shuffling. DNA shuffling involves the assembly 
of two or more DNA segments by homologous or site-specific recombination to generate variation 
in the polynucleotide sequence. In another embodiment, polynucleotides of the invention, or the 
5 encoded polypeptides, may be altered by being subjected to random mutagenesis by error-prone 
PCR, random nucleotide insertion or other methods prior to recombination. In another 
embodiment, one or more components, motifs, sections, parts, domains, fragments, etc., of a 
polynucleotide encoding a polypeptide of the invention may be recombined with one or more 
components, motifs, sections, parts, domains, fragments, etc. of one or more heterologous 
10 molecules. 

Thus, any of these above fusions can be engineered using the polynucleotides or the 
polypeptides of the present invention. 

Recombinant and Synthetic Production of Polypeptides of the Invention 
15 The present invention also relates to vectors containing the polynucleotide of the present 

invention, host cells, and the production of polypeptides by synthetic and recombinant techniques. 
The vector may be, for example, a phage, plasmid, viral, or retroviral vector. Retroviral vectors 
may be replication competent or replication defective. In the latter case, viral propagation 
generally will occur only in complementing host cells. 
20 The polynucleotides of the invention may be joined to a vector containing a selectable 

marker for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such 
as a calcium phosphate precipitate, or in a complex with a charged lipid If the vector is a virus, it 
may be packaged in vitro using an appropriate packaging cell line and then transduced into host 
cells. 

25 The polynucleotide insert should be operatively linked to an appropriate promoter, such as 

the phage lambda PL promoter, the E. coli lac, trp, phoA and tac promoters, the SV40 early and 
late promoters and promoters of retroviral LTRs, to name a few. Other suitable promoters will be 
known to the skilled artisan. The expression constructs will further contain sites for transcription 
initiation, termination, and, in the transcribed region, a ribosome binding site for translation. The 

30 coding portion of the transcripts expressed by the constructs will preferably include a translation 
initiating codon at the beginning and a termination codon (UAA, UGA or UAG) appropriately 
positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one selectable marker. 
Such markers include dihydrofolate reductase, G418, glutarnine synthase, or neomycin resistance 

35 for eukaryotic cell culture, and tetracycline, kanamycin or ampicillin resistance genes for culturing 
in E. coli and other bacteria. Representative examples of appropriate hosts include, but are not 
limited to, bacterial cells, such as E. coli, Streptomyces and Salmonella typhimurium cells; fungal 
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cells, such as yeast cells (e.g., Saccharomyces cerevisiae or Pichia pastoris (ATCC Accession No. 
201178)); insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, 
COS, 293, and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-<iescribed host cells are known in the art. 
5 Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, available 

from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, pNH16a, pNH18A, 
pNH46A, available from Stratagene Cloning Systems, Inc.; and ptrc99a, pKK223-3, pKK233-3, 
pDR540, pRTTS available from Pharmacia Biotech, Inc. Among preferred eukaryotic vectors are 
pWLNEO, pSV2CAT, pOG44, pXTl and pSG available from Stratagene; and pSVK3, pBPV, 
10 pMSG and pSVL available from Pharmacia. Preferred expression vectors for use in yeast systems 
include, but are not limited to pYES2, pYDl, pTEFl/Zeo, pYES2/GS, pPICZ, pGAPZ, 
pGAPZalph, pPIC9, pPIC3.5, pH3L-D2, pHIL-Sl, pPIC3.5K, pPIC9K, and PA0815 (all available 
from Invitrogen, Carlbad, CA). Other suitable vectors will be readily apparent to the skilled 
artisan. 1 

15 Vectors which use glutamine synthase (GS) or DHFR as the selectable markers can be 

amplified in the presence of the drugs methionine sulphoximine or methotrexate, respectively. An 
advantage of glutamine synthase based vectors are the availabilty of cell lines (e.g., the murine 
myeloma cell line, NSO) which are glutamine synthase negative. Glutamine synthase expression 
systems can also function in glutamine synthase expressing cells (e.g., Chinese Hamster Ovary 

20 (CHO) cells) by providing additional inhibitor to prevent the functioning of the endogenous gene. 
A glutamine synthase expression system and components thereof are detailed in PCT publications: 
WO87/04462; WO86/05807; WO89/01036; WO89/10404; and WO91/06657, which are hereby 
incorporated in their entireties by reference herein. Additionally, glutamine synthase expression 
vectors can be obtained from Lonza Biologies, Inc. (Portsmouth, NH). Expression and production 

25 of monoclonal antibodies using a GS expression system in murine myeloma cells is described in 
Bebbington et al, Bio/technology 10:169(1992) and in Biblia and Robinson Biotechnol. Prog. 
11:1 (1995) which are herein incorporated by reference. 

The present invention also relates to host cells containing the above-described vector 
constructs described herein, and additionally encompasses host cells containing nucleotide 

30 sequences of the invention that are operably associated with one or more heterologous control 

» 

regions (e.g., promoter and/or enhancer) using techniques known of in the art. The host cell can 
be a higher eukaryotic cell, such as a. mammalian cell (e.g., a human derived cell), or a lower 
eukaryotic cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial 
ceD. A host strain may be chosen which modulates the expression of the inserted gene sequences, 
35 or modifies and processes the gene product in the specific fashion desired. Expression from 
certain promoters can be elevated in the presence of certain inducers; thus expression of the 
genetically engineered polypeptide may be controlled. Furthermore, different host cells have 
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characteristics and specific mechanisms for the translational and post-translational processing and 
modification (e.g., phosphorylation, cleavage) of proteins. Appropriate cell lines can be chosen to 
ensure the desired modifications and processing of the foreign protein expressed. 

Introduction of the nucleic acids and nucleic acid constructs of the invention into the host 
5 cell can be effected by calcium phosphate transfection, DEAE-dextran mediated transfection, 
cationic lipid-mediated transfection, electroporation, transduction, infection, or other methods. 
Such methods are described in many standard laboratory manuals, such as Davis et al., Basic 
Methods In Molecular Biology (1986). It is specifically contemplated that the polypeptides of the 
present invention may in fact be expressed by a host cell lacking a recombinant vector. 

10 In addition to encompassing host cells containing the vector constructs discussed herein, 

the invention also encompasses primary, secondary, and immortalized host cells of vertebrate 
origin, particularly rmmrnalian origin, that have been engineered to delete or replace endogenous 
genetic material (e.g., the coding sequence), and/or to include genetic material (e.g., heterologous 
polynucleotide sequences) that is operably associated with polynucleotides of the invention, and 

15 which activates, alters, and/or amplifies endogenous polynucleotides. For example, techniques 
known in the art may be used to operably associate heterologous control regions (e.g., promoter 
and/or enhancer) and endogenous polynucleotide sequences via homologous recombination (see, 
e.g., US Patent Number 5,641,670, issued June 24, 1997; International Publication Number WO 
96/29411; International Publication Number WO 94/12650; Koller et al. f Proc. Natl Acad. ScL 

20 USA 56:8932-8935 (1989); and Zijlstra et at, Nature 342:435^38 (1989), the disclosures of each 
of which are incorporated by reference in their entireties). 

Polypeptides of the invention can be recovered and purified from recombinant cell 
cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 

25 hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite 
chromatography and lectin chromatography. Most preferably, high performance liquid 
chromatography ("HPLC") is employed for purification. 

Polypeptides of the present invention can also be recovered from: products purified from 
natural sources, including bodily fluids, tissues and cells, whether directly isolated or cultured; 

30 products of chemical synthetic procedures; and products produced by recombinant techniques 
from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, insect, 
and mammalian cells. Depending upon the host employed in a recombinant production procedure, 
the polypeptides of the present invention may be glycosylated or may be non-glycosylated. In 
addition, polypeptides of the invention may also include an initial modified methionine residue, in 

35 some cases as a result of host-mediated processes. Thus, it is well known in the art that the N- 
terminal methionine encoded by the translation initiation codon generally is removed with high 
efficiency from any protein after translation in all eukaryotic cells. While the N-terminal 
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methionine on most proteins also is efficiently removed in most prokaryotes, for some proteins, 
this prokaryotic removal process is inefficient, depending on the nature of the amino acid to which 
the N-terminal methionine is covalently linked. 

In one embodiment, the yeast Pichia pastoris is used to express polypeptides of the 
invention in a eukaryotic system. Pichia pastoris is a methylotrophic yeast which can metabolize 
methanol as its sole carbon source. A main step in the methanol metabolization pathway is the 
oxidation of methanol to formaldehyde using 0 2 . This reaction is catalyzed by the enzyme alcohol 
oxidase. In order to metabolize methanol as its sole carbon source, Pichia pastoris must generate 
high levels of alcohol oxidase due, in part, to the relatively low affinity of alcohol oxidase for 0 2 . 
Consequently, in a growth medium depending on methanol as a main carbon source, the promoter 
region of one of the two alcohol oxidase genes (AOX1) is highly active. In the presence of 
methanol, alcohol oxidase produced from the AOX1 gene comprises up to approximately 30% of 
the total soluble protein in Pichia pastoris. See Ellis, S.B., et al„ MoL Cell Biol. 5:1111-21 
(1985); Koutz, PJ, et al p Yeast 5:167-77 (1989); Tschopp, J.R, et al., Nucl Acids Res. 15:3859-76 
(1987). Thus, a heterologous coding sequence, such as, for example, a polynucleotide of the 
present invention, under the transcriptional regulation of all or part of the AOX1 regulatory 
sequence is expressed at exceptionally high levels in Pichia yeast grown in the presence of 
methanol. 

In one example, the plasmid vector pPIC9K is used to express DNA encoding a 
polypeptide of the invention, as set forth herein, in a Picliea yeast system essentially as described 
in "Pichia Protocols: Methods in Molecular Biology," D.R. Higgins and J. Gregg, eds. The 
Humana Press, Totowa, NJ, 1998. This expression vector allows expression and secretion of a 
polypeptide of the invention by virtue of the strong AOX1 promoter linked to the Pichia pastoris 
alkaline phosphatase (PHO) secretory signal peptide (i.e., leader) located upstream of a multiple 
cloning site. 

Many other yeast vectors could be used in place of pPIC9K, such as, pYES2, pYDl, 
pTEFl/Zeo, pYES2/GS, pPICZ, pGAPZ, pGAPZalpha, pPIC9, pPIC3.5, pHIL-D2, pHIL-Sl, 
pPIC3.5K, and PA0815, as one skilled in the art would readily appreciate, as long as the proposed 
expression construct provides appropriately located signals for transcription, translation, secretion 
(if desired), and the like, including an in-frame AUG as required. 

In another embodiment, high-level expression of a heterologous coding sequence, such as, 
for example, a polynucleotide of the present invention, may be achieved by cloning the 
heterologous polynucleotide of the invention into an expression vector such as, for example, 
pGAPZ or pGAPZalpha, and growing the yeast culture in the absence of methanol. 
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In addition to encompassing host cells containing the vector constructs discussed herein, 
the invention also encompasses primary, secondary, and immortalized host cells of vertebrate 
origin, particularly mammalian origin, that have been engineered to delete or replace endogenous 
genetic material (e.g., coding sequence), and/or to include genetic material (e.g., heterologous 
5 polynucleotide sequences) that is operably associated with polynucleotides of the invention, and 
which activates, alters, and/or amplifies endogenous polynucleotides. For example, techniques 
known in the art may be used to operably associate heterologous control regions (e.g,, promoter 
and/or enhancer) and endogenous polynucleotide sequences via homologous recombination (see, 
e.g., U.S. Patent No. 5,641,670, issued June 24, 1997; International Publication No. WO 
10 96/29411, published September 26, 1996; International Publication No. WO 94/12650, published 
August 4, 1994; Koller et al., Proc. Natl. Acad. Sci. USA 86:8932-8935 (1989); and Zijlstra et al., 
Nature 342:435-438 (1989), the disclosures of each of which are incorporated by reference in their 
entireties). 

In addition, polypeptides of the invention can be chemically synthesized using techniques 
15 known in the art (e.g., see Creighton, 1983, Proteins: Structures and Molecular Principles, W.H. 
Freeman & Co., N.Y., and Hunkapiller et al., Nature, 310:105-111 (1984)). For example, a 
polypeptide corresponding to a fragment of a polypeptide can be synthesized by use of a peptide 
synthesizer. Furthermore, if desired, nonclassical amino acids or chemical amino acid analogs can 
be introduced as a substitution or addition into the polypeptide sequence. Non-classical amino 
20 acids include, but are not limited to, to the D-isomers of the common amino acids, 2,4- 
diaminobutyric acid, a-arnino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, 
g-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, 
ornithine, norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, 
t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, b-alanine, fluoro-amino acids, 
25 designer amino acids such as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino 
acids, and amino acid analogs in general. Furthermore, the amino acid can be D (dextrorotary) or 
L (levorotary). 

The invention encompasses polypeptides of the present invention which are differentially 
modified during or after translation, e.g., by glycosylation, acetylation, phosphorylation, 

30 amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, linkage to an 
antibody molecule or other cellular ligand, etc. Any of numerous chemical modifications may be 
carried out by known techniques, including but not limited, to specific chemical cleavage by 
cyanogen bromide, trypsin, chymotrypsin, papain, V8 protease, NaBHj; acetylation, formylation, 
oxidation, reduction; metabolic synthesis in the presence of tunicamycin; etc. 

35 Additional post-translational modifications encompassed by the invention include, for 

example, e.g., N-linked or O-linked carbohydrate chains, processing of N-terrninal or C-terminal 
ends), attachment of chemical moieties to the amino acid backbone, chemical modifications of 
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N-linked or O-linked carbohydrate chains, and addition or deletion of an N-terminal methionine 
residue as a result of procaryotic host cell expression. The polypeptides may also be modified 
with a detectable label, such as an enzymatic, fluorescent, isotopic or affinity label to allow for 
detection and isolation of the protein. 
5 Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, beta- 

galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes 

10 luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin; and 
examples of suitable radioactive material include iodine ( 121 I, 123 I, 125 I, 131 I), carbon ( 14 C), sulfur 
( 35 S), tritium ( 3 H), indium ( ni In, n2 In, 113m In, U5m In), technetium ("Tc^c), thallium (^Ti), 
gallium (^Ga, 67 Ga), palladium ( 103 Pd), molybdenum ("Mo), xenon ( 133 Xe), fluorine ( 18 F), ^Sm, 
l77 Lu, 159 Gd, U9 Pm, 140 La, 175 Yb, 166 Ho, *Y, 47 Sc, 186 Re, l88 Re, 142 Pr, 105 Rh, and 97 Ru. 

15 In specific embodiments, a polypeptide of the present invention or fragment or variant 

thereof is attached to macrocyclic chelators that associate with radiometal ions, including but not 
limited to, 177 Lu, 90 Y, 166 Ho, and 153 Sm, to polypeptides. In a preferred embodiment, the 
radiometal ion associated with the macrocyclic chelators is ni In. In another preferred 
embodiment, the radiometal ion associated with the macrocyclic chelator is 90 Y. hi specific 

20 embodiments, the macrocyclic chelator is l,4,7,104etraazacyclododecane-NJN'JSr , ,N ,,, -tetraacetic 
acid (DOTA). In other specific embodiments, DOTA is attached to an antibody of the invention or 
fragment thereof via a linker molecule. Examples of linker molecules useful for conjugating 
DOTA to a polypeptide are commonly known in the art - see, for example, DeNardo et al., Clin 
Cancer Res. 4(10):2483-90 (1998); Peterson et al., Bioconjug. Chem. 10(4):553-7 (1999); and 

25 Zimmerman et al, Nucl. Med. Biol. 26(8):943-50 (1999); which are hereby incorporated by 
reference in their entirety. 

As mentioned, the proteins of the invention may be modified by either natural processes, 
such as posttranslarional processing, or by chemical modification techniques which are well 
known in the art. It will be appreciated that the same type of modification may be present in the 

30 same or varying degrees at several sites in a given polypeptide. Polypeptides of the invention may 
be branched, for example, as a result of ubiquitination, and they may be cyclic, with or without 
branching. Cyclic, branched, and branched cyclic polypeptides may result from posttranslation 
natural processes or may be made by synthetic methods. Modifications include acetylation, 
acylation, ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a 

35 heme moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of 
a lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, 
disulfide bond formation, demethylation, formation of covalent cross-links, formation of cysteine, 
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formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor 
formation, hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, 
transfer-RNA mediated addition of amino acids to proteins such as arginylation, and 
5 ubiquitination. (See, for instance, PROTEINS - STRUCTURE AND MOLECULAR 
PROPERTIES, 2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993); 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., 
Academic Press, New York, pgs. 1-12 (1983); Seifter et al., Meth. Enzymol. 182:626-646 (1990); 
Rattan et aL, Ann. N.Y. Acad. Sci. 663:48-62 (1992)). 

10 Also provided by the invention are chemically modified derivatives of the polypeptides of 

the invention which may provide additional advantages such as increased solubility, stability and 
circulating time of the polypeptide, or decreased immunogenicity (see U.S. Patent No. 4,179,337). 
The chemical moieties for derivitization may be selected from water soluble polymers such as 
polyethylene glycol, ethylene glycol/propylene glycol copolymers, carboxymethylcellulose, 

15 dextran, polyvinyl alcohol and the like. The polypeptides may be modified at random positions 
within the molecule, or at predetermined positions within the molecule and may include one, two, 
three or more attached chemical moieties. 

The polymer may be of any molecular weight, and may be branched or unbranched. For 
polyethylene glycol, the preferred molecular weight is between about i kDa and about 100 kDa 

20 (the term "about" indicating that in preparations of polyethylene glycol, some molecules will 
weigh more, some less, than the stated molecular weight) for ease in handling and manufacturing. 
Other sizes may be used, depending on the desired therapeutic profile (e.g., the duration of 
sustained release desired, the effects, if any on biological activity, the ease in handling, the degree 
or lack of antigenicity and other known effects of the polyethylene glycol to a therapeutic protein 

25 or analog). For example, the polyethylene glycol may have an average molecular weight of about 
200, 500, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500, 5000, 5500, 6000, 6500, 7000, 7500, 
8000, 8500, 9000, 9500, 10,000, 10,500, 11,000, 11,500, 12,000, 12,500, 13,000, 13,500, 14,000, 
14,500, 15,000, 15,500, 16,000, 16,500, 17,000, 17,500, 18,000, 18,500, 19,000, 19,500, 20,000, 
25,000, 30,000, 35,000, 40,000, 45,000, 50,000, 55,000, 60,000, 65,000, 70,000, 75,000, 80,000, 

30 85,000, 90,000, 95,000, or 100,000 kDa. 

As noted above, the polyethylene glycol may have a branched structure. Branched 
polyethylene glycols are described, for example, in U.S. Patent No. 5,643,575; Morpurgo et aL, 
Appl. Biochem. BiotechnoL 56:59-72 (1996); Vorobjev et al., Nucleosides Nucleotides 18:2145- 
2750 (1999); and Caliceti et al., Bioconjug. Chem. 70:638-646 (1999), the disclosures of each of 

35 which are incorporated herein by reference. 

The polyethylene glycol molecules (or other chemical moieties) should be attached to the 
protein with consideration of effects on functional or antigenic domains of the protein. There are a 

547 



WO 02/076488 



PCTAJS02/08276 



number of attachment methods available to those skilled in the art, such as, for example, the 
method disclosed in EP 0 401 384 (coupling PEG to G-CSF), herein incorporated by reference; see 
also Malik et al., Exp. Hematol. 20:1028-1035 (1992), reporting pegylation of GM-CSF using 
tresyl chloride. For example, polyethylene glycol may be covalentiy bound through amino acid 
residues via a reactive group, such as a free amino or carboxyl group. Reactive groups are those to 
which an activated polyethylene glycol molecule may be bound. The amino acid residues having a 
free amino group may include lysine residues and the N-terminal amino acid residues; those 
having a free carboxyl group may include aspartic acid residues glutamic acid residues and the 
C4erminal amino acid residue. Sulfhydryl groups may also be used as a reactive group for 
attaching the polyethylene glycol molecules. Preferred for therapeutic purposes is attachment at 
an amino group, such as attachment at the N-terminus or lysine group. 

As suggested above, polyethylene glycol may be attached to proteins via linkage to any of 
a number of amino acid residues. For example, polyethylene glycol can be linked to proteins via 
covalent bonds to lysine, histidine, aspartic acid, glutamic acid, or cysteine residues. One or more 
reaction chemistries may be employed to attach polyethylene glycol to specific amino acid 
residues (e.g., lysine, histidine, aspartic acid, glutamic acid, or cysteine) of the protein or to more 
than one type of amino acid residue (e.g., lysine, histidine, aspartic acid, glutamic acid, cysteine 
and combinations thereof) of the protein. 

One may specifically desire proteins chemically modified at the' N-terminus. Using 
polyethylene glycol as an illustration of the present composition, one may select from a variety of 
polyethylene glycol molecules (by molecular weight, branching, etc.), the proportion of 
polyethylene glycol molecules to protein (polypeptide) molecules in the reaction mix, the type of 
pegylation reaction to be performed, and the method of obtaining the selected N-terminally 
pegylated protein. The method of obtaining the N-tenninally pegylated preparation (i.e., 
separating this moiety from other monopegylated moieties if necessary) may be by purification of 
the N-terminally pegylated material from a population of pegylated protein molecules. Selective 
proteins chemically modified at the N-tenriinus modification may be accomplished by reductive 
alkylation which exploits differential reactivity of different types of primary amino groups (lysine 
versus the N-terminal) available for derivatization in a particular protein. Under the appropriate 
reaction conditions, substantially selective derivatization of the protein at the N-terminus with a 
carbonyl group containing polymer is achieved. 

As indicated above, pegylation of the proteins of the invention may be accomplished by 
any number of means. For example, polyethylene glycol may be attached to the protein either 
directly or by an intervening linker. Linkerless systems for attaching polyethylene glycol to 
proteins are described in Delgado et al., Crit. Rev. Thera. Drug Carrier Sys. 9:249-304 (1992); 
Francis et al., Intern. J. of Hematol. 68:1-18 (1998); U.S. Patent No. 4,002^31; U.S. Patent No. 
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5,349,052; WO 95/06058; and WO 98/32466, the disclosures of each of which are incorporated 
herein by reference. 

One system for attaching polyethylene glycol directly to amino acid residues of proteins 
without an intervening linker employs tresylated MPEG, which is produced by the modification of 
5 monmethoxy polyethylene glycol (MPEG) using tresylchloride (CISO2CH2CF3). Upon reaction of 
protein with tresylated MPEG, polyethylene glycol is directly attached to amine groups of the 
protein. Thus, the invention includes protein-polyethylene glycol conjugates produced by reacting 
proteins of the invention with a polyethylene glycol molecule having a 2,2,2-trifluoreothane 
sulphonyl group. 

10 Polyethylene glycol can also be attached to proteins using a number of different 

intervening linkers. For example, U.S. Patent No. 5,612,460, the entire disclosure of which is 
incorporated herein by reference, discloses urethane linkers for connecting polyethylene glycol to 
proteins. Protein-polyethylene glycol conjugates wherein the polyethylene glycol is attached to 
the protein by a linker can also be produced by reaction of proteins with compounds such as 

15 MPEG-succinimidylsuccinate, MPEG activated with l.l'-carbonyldiimidazole, MPEG- 
2,4,5-trichloropenylcarbonate, MPEG-p-nitrophenolcarbonate, and various MPEG-succinate 
derivatives. A number of additional polyethylene glycol derivatives and reaction chemistries for 
attaching polyethylene glycol to proteins are described in International Publication No. 
WO 98/32466, the entire disclosure of which is incorporated herein by reference. Pegylated 

20 protein products produced using the reaction chemistries set out herein are included within the 
scope of the invention. 

The number of polyethylene glycol moieties attached to each protein of the invention (i.e., 
the degree of substitution) may also vary. For example, the pegylated proteins of the invention 
may be linked, on average, to 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 15, 17, 20, or more polyethylene 
25 glycol molecules. Similarly, the average degree of substitution within ranges such as 1-3, 2-4, 3-5, 
4-6, 5-7, 6-8, 7-9, 8-10, 9-11. 10-12, 11-13, 12-14, 13-15, 14-16, 15-17, 16-18, 17-19, or 18-20 
polyethylene glycol moieties per protein molecule. Methods for determining the degree of 
substitution are discussed, for example, in Delgado et al., Crit Rev. Thera. Drug Carrier Sys. 
9:249-304 (1992). 

30 The polypeptides of the invention can be recovered and purified from chemical synthesis 

and recombinant cell cultures by standard methods which include, but are not limited to, 
ammonium sulfate or ethanol precipitation, acid extraction, anion- or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and lectin chromatography. Most 

35 preferably, high performance liquid chromatography ( n HPLC") is employed for purification. Well 
known techniques for refolding protein may be employed to regenerate active conformation when 
the polypeptide is denatured during isolation and/or purification. 
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The polypeptides of the invention may be in monomers or multimers (i.e., dimers, trimers, 
tetramers and higher multimers). Accordingly, the present invention relates to monomers and 
multimers of the polypeptides of the invention, their preparation, and compositions (preferably, 
Therapeutics) containing them. Li specific embodiments, the polypeptides of the invention are 
5 monomers, dimers, trimers or tetramers. In additional embodiments, the multimers of the 
invention are at least dimers, at least trimers, or at least tetramers. 

Multimers encompassed by the invention may be homomers or heteromers. As used 
herein, the term homomer refers to a multimer containing only polypeptides corresponding to a 
protein of the invention (e.g., the amino acid sequence of SEQ ID NO:Y, an amino acid sequence 

10 encoded by SEQ ID NO:X or the complement of SEQ ID NO:X, the amino acid sequence encoded 
by the portion of SEQ ID NO:X as defined in columns 8 and 9 of Table 2, and/or an amino acid 
sequence encoded by cDNA contained in ATCC Deposit No:Z (including fragments, variants, 
splice variants, and fusion proteins, corresponding to these as described herein)). These homomers 
may contain polypeptides having identical or different amino acid sequences. In a specific 

15 embodiment, a homomer of the invention is a multimer containing only polypeptides having an 
identical amino acid sequence. In another specific embodiment, a homomer of the invention is a 
multimer containing polypeptides having different amino acid sequences. In specific 
embodiments, the multimer of the invention is a homodimer (e.g., containing two polypeptides 
having identical or different amino acid sequences) or a homotrimer (e.g., containing three 

20 polypeptides having identical and/or different amino acid sequences). In additional embodiments, 
the homomeric multimer of the invention is at least a homodimer, at least a homotrimer, or at least 
a homotetramer. 

As used herein, the term heteromer refers to a multimer containing one or more 
heterologous polypeptides (i.e., polypeptides of different proteins) in addition to the polypeptides 

25 of the invention. In a specific embodiment, the multimer of the invention is a heterodimer, a 
heterotrimer, or a heterotetramer. In additional embodiments, the heteromeric multimer of the 
invention is at least a heterodimer, at least a heterotrimer, or at least a heterotetramer. 

Multimers of the invention may be the result of hydrophobic, hydrophilic, ionic and/or 
covalent associations and/or may be indirectly linked by, for example, liposome formation. Thus, 

30 in one embodiment, multimers of the invention, such as, for example, homodimers or 
homotrimers, are formed when polypeptides of the invention contact one another in solution. In 
another embodiment, heteromultimers of the invention, such as, for example, heterotrimers or 
heterotetramers, are formed when polypeptides of the invention contact antibodies to the 
polypeptides of the invention (including antibodies to the heterologous polypeptide sequence in a 

35 fusion protein of the invention) in solution. In other embodiments, multimers of the invention are 
formed by covalent associations with and/or between the polypeptides of the invention. Such 
covalent associations may involve one or more amino acid residues contained in the polypeptide 
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sequence (e.g., that recited in SEQ ID NO:Y, encoded by the portion of SEQ ID NO:X as defined 
in columns 8 and 9 of Table 2, and/or encoded by the cDNA contained in ATCC Deposit No:Z). 
In one instance, the covalent associations are cross-linking between cysteine residues located 
within the polypeptide sequences which interact in the native (i.e., naturally occurring) 
polypeptide. In another instance, the covalent associations are the consequence of chemical or 
recombinant manipulation. Alternatively, such covalent associations may involve one or more 
amino acid residues contained in the heterologous polypeptide sequence in a fusion protein. In 
one example, covalent associations are between the heterologous sequence contained in a fusion 
protein of the invention (see, e.g., US Patent Number 5,478,925). In a specific example, the 
covalent associations are between the heterologous sequence contained in a Fc fusion protein of 
the invention (as described herein). In another specific example, covalent associations of fusion 
proteins of the invention are between heterologous polypeptide sequence from another protein that 
is capable of forming covalently associated multimers, such as for example, osteoprotegerin (see, 
e.g., International Publication NO: WO 98/49305, the contents of which are herein incorporated by 
reference in its entirety). In another embodiment, two or more polypeptides of the invention are 
joined through peptide linkers. Examples include those peptide linkers described in U.S. Pat. No. 
5,073,627 (hereby incorporated by reference). Proteins comprising multiple polypeptides of the 
invention separated by peptide linkers may be produced using conventional recombinant DNA 
technology. 

Another method for preparing multimer polypeptides of the invention involves use of 
polypeptides of the invention fused to a leucine zipper or isoleucine zipper polypeptide sequence. 
Leucine zipper and isoleucine zipper domains are polypeptides that promote multimerization of 
the proteins in which they are found. Leucine zippers were originally identified in several DNA- 
binding proteins (Landschulz et al., Science 240:1759, (1988)), and have since been found in a 
variety of different proteins. Among the known leucine zippers are naturally occurring peptides 
and derivatives thereof that dimerize or trirnerize. Examples of leucine zipper domains suitable 
for producing soluble multimeric proteins of the invention are those described in PCT application 
WO 94/10308, hereby incorporated by reference. Recombinant fusion proteins comprising a 
polypeptide of the invention fused to- a polypeptide sequence that dimerizes or trimerizes in 
solution are expressed in suitable host cells, and the resulting soluble multimeric fusion protein is 
recovered from the culture supernatant using techniques known in the art. 

Trimeric polypeptides of the invention may offer the advantage of enhanced biological 
activity. Preferred leucine zipper moieties and isoleucine moieties are those that preferentially 
form trimers. One example is a leucine zipper derived from lung surfactant protein D (SPD), as 
described in Hoppe et al. (FEBS Letters 344:191, (1994)) and in U.S. patent application Ser. No. 
08/446,922, hereby incorporated by reference. Other peptides derived from naturally occurring 
trimeric proteins may be employed in preparing trimeric polypeptides of the invention. 
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In another example, proteins of the invention are associated by interactions between 
Flag® polypeptide sequence contained in fusion proteins of the invention containing Flag® 
polypeptide sequence. In a further embodiment, proteins of the invention are associated by 
interactions between heterologous polypeptide sequence contained in Flag® fusion proteins of the 
5 invention and anti-Rag® antibody. 

The multimers of the invention may be generated using chemical techniques known in the 
art. For example, polypeptides desired to be contained in the multimers of the invention may be 
chemically cross-linked using linker molecules and linker molecule length optimization techniques 
known in the art (see, e.g., US Patent Number 5,478,925, which is herein incorporated by 

10 reference in its entirety). Additionally, multimers of the invention may be generated using 
techniques known in the art to form one or more inter-molecule cross-links between the cysteine 
residues located within the sequence of the polypeptides desired to be contained in the multimer 
(see, e.g., US Patent Number 5,478,925, which is herein incorporated by reference in its entirety). 
Further, polypeptides of the invention may be routinely modified by the addition of cysteine or 

15 biotin to the C-tenninus or N-terminus of the polypeptide and techniques known in the art may be 
applied to generate multimers containing one or more of these modified polypeptides (see, e.g., US 
Patent Number 5,478,925, which is herein incorporated by reference in its entirety). Additionally, 
techniques known in the art may be applied to generate liposomes containing the polypeptide 
components desired to be contained in the multimer of the invention (see, e.g., US Patent Number 

20 5,478,925, which is herein incorporated by reference in its entirety). 

Alternatively, multimers of the invention may be generated using genetic engineering 
techniques known in the art In one embodiment, polypeptides contained in multimers of the 
invention are produced recombinantly using fusion protein technology described herein or 
otherwise known in the art (see, e.g., US Patent Number 5,478,925, which is herein incorporated 

25 by reference in its entirety). In a specific embodiment, polynucleotides coding for a homodimer of 
the invention are generated by ligating a polynucleotide sequence encoding a polypeptide of the 
invention to a sequence encoding a linker polypeptide and then further to a synthetic 
polynucleotide encoding the translated product of the polypeptide in the reverse orientation from 
the original C-terminus to the N^erminus (lacking the leader sequence) (see, e.g., US Patent 

30 Number 5,478,925, which is herein incorporated by reference in its entirety). In another 
embodiment, recombinant techniques described herein or otherwise known in the art are applied to 
generate recombinant polypeptides of the invention which contain a transmembrane domain (or 
hydrophobic or signal peptide) and which can be incorporated by membrane reconstitution 
techniques into liposomes (see, e.g., US Patent Number 5,478,925, which is herein incorporated by 

35 reference in its entirety). 

Antibodies 
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Further polypeptides of the invention relate to antibodies and T-cell antigen receptors 
(TCR) which irnmunospecifically bind a polypeptide, polypeptide fragment, or variant of the 
invention (e.g., a polypeptide or fragment or variant of the amino acid sequence of SEQ ID NO:Y 
or a polypeptide encoded by the cDNA contained in ATCC Deposit No:Z, and/or an epitope, of 
5 the present invention) as determined by immunoassays well known in the art for assaying specific 
antibody-antigen binding. Antibodies of the invention include, but are not limited to, polyclonal, 
monoclonal, multispecific, human, humanized or chimeric antibodies, single chain antibodies, Fab 
fragments, F(ab') fragments, fragments produced by a Fab expression library, anti-idiotypic (anti- 
Id) antibodies (including, e.g., anti-Id antibodies to antibodies of the invention), intracellularly- 

10 made antibodies (i.e., intrabodies), and epitope-binding fragments of any of the above. The term 
"antibody," as used herein, refers to immunoglobulin molecules and immunologically active 
portions of immunoglobulin molecules, i.e., molecules that contain an antigen binding site that 
irnmunospecifically binds an antigen. The immunoglobulin molecules of the invention can be of 
any type (e.g., IgG, IgE, IgM, IgD, IgA and IgY), class (e.g., IgGl, IgG2, IgG3, IgG4, IgAl and 

15 IgA2) or subclass of immunoglobulin molecule. In preferred embodiments, the immunoglobulin 
molecules of the invention are IgGl. In other preferred embodiments, the immunoglobulin 
molecules of the invention are IgG4. 

Most preferably the antibodies are human antigen-binding antibody fragments of the 
present invention and include, but are not limited to, Fab, Fab' and F(ab')2, Fd, single-chain Fvs 

20 (scFv), single-chain antibodies, disulfide-linked Fvs (sdFv) and fragments comprising either a VL 
or VH domain. Antigen-binding antibody fragments, including single-chain antibodies, may 
comprise the variable region(s) alone or in combination with the entirety or a portion of the 
following: hinge region, CHI, CH2, and CH3 domains. Also included in the invention are 
antigen-binding fragments also comprising any combination of variable region(s) with a hinge 

25 region, CHI, CH2, and CH3 domains. The antibodies of the invention may be from any animal 
origin including birds and mammals. Preferably, the antibodies are human, murine (e.g., mouse 
and rat), donkey, ship rabbit, goat, guinea pig, camel, horse, or chicken. As used herein, "human" 
antibodies include antibodies having the amino acid sequence of a human immunoglobulin and 
include antibodies isolated from human immunoglobulin libraries or from animals transgenic for 

30 one or more human immunoglobulin and that do not express endogenous immunoglobulins, as 
described infra and, for example in, U.S. Patent No. 5,939,598 by Kucherlapati et al. 

The antibodies of the present invention may be monospecific, bispecific, trispecific or of 
greater multispecificity. Multispecific antibodies may be specific for different epitopes of a 
polypeptide of the present invention or may be specific for both a polypeptide of the present 

35 invention as well as for a heterologous epitope, such as a heterologous polypeptide or solid support 
material. See, e.g., PCT publications WO 93/17715; WO 92/08802; WO 91/00360; WO 92/05793; 
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Tutt, et al., J. Immunol. 147:60-69 (1991); U.S. Patent Nos. 4,474,893; 4,714,681; 4,925,648; 
5,573,920; 5,601,819; Kostelny et al., J. Immunol. 148:1547-1553 (1992). 

Antibodies of the present invention may be described or specified in terms of the 
epitope(s) or portion(s) of a polypeptide of the present invention which they recognize or 
5 specifically bind. The epitope(s) or polypeptide portion(s) may be specified as described herein, 
e.g., by N-terminal and C-terrninal positions, or by size in contiguous amino acid residues, or 
listed in the Tables and Figures. Preferred epitopes of the invention include the predicted epitopes 
shown in column 7 of Table 1B.1, as well as polynucleotides that encode these epitopes. 
Antibodies which specifically bind any epitope or polypeptide of the present invention may also be 

10 excluded. Therefore, the present invention includes antibodies that specifically bind polypeptides 
of the present invention, and allows for the exclusion of the same. 

Antibodies of the present invention may also be described or specified in terms of their 
cross-reactivity. Antibodies that do not bind any other analog, ortholog, or homolog of a 
polypeptide of the present invention are included. Antibodies that bind polypeptides with at least 

15 95%, at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 65%, at least 
60%, at least 55%, and at least 50% identity (as calculated using methods known in the art and 
described herein) to a polypeptide of the present invention are also included in the present 
invention. In specific embodiments, antibodies of the present invention cross-react with murine, 
rat and/or rabbit homologs of human proteins and the corresponding epitopes thereof. Antibodies 

20 that do not bind polypeptides with less than 95%, less than 90%, less than 85%, less than 80%, less 
than 75%, less than 70%, less than 65%, less than 60%, less than 55%, and less than 50% identity 
(as calculated using methods known in the art and described herein) to a polypeptide of the present 
invention are also included in the present invention. In a specific embodiment, the above- 
described cross-reactivity is with respect to any single specific antigenic or immunogenic 

25 polypeptide, or combination(s) of 2, 3, 4, 5, or more of the specific antigenic and/or immunogenic 
polypeptides disclosed herein. Further included in the present invention are antibodies which bind 
polypeptides encoded by polynucleotides which hybridize to a polynucleotide of the present 
invention under stringent hybridization conditions (as described herein). Antibodies of the present 
invention may also be described or specified in terms of their binding affinity to a polypeptide of 

30 the invention. Preferred binding affinities include those with a dissociation constant or Kd less 
than 5 X 10" 2 M, 10" 2 M, 5 X 10* 3 M, 10 3 M, 5 X 10" 4 M, 10" 4 M, 5 X 10" 5 M, 10 s M, 5 X 10* M, 
10-*M, 5 X 10' 7 M, 10 7 M, 5 X 10" 8 M, 10^ M, 5 X 1(T 9 M, 10" 9 M, 5 X I0' i0 M, 10" 10 M, 5 X Iff 11 
M, 10 n M, 5 X 10 12 M, 10* 12 M, 5 X 10' 13 M, 10' 13 M, 5 X 10' 14 M, 10 14 M, 5 X 10* 15 M, or 10" ls 
M. 

35 The invention also provides antibodies that competitively inhibit binding of an antibody to 

an epitope of the invention as determined by any method known in the art for detennining 
competitive binding, for example, the immunoassays described herein. In preferred embodiments, 
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the antibody competitively inhibits binding to the epitope by at least 95%, at least 90%, at least 85 
%, at least 80%, at least 75%, at least 70%, at least 60%, or at least 50%. 

Antibodies of the present invention may act as agonists or antagonists of the polypeptides 
of the present invention. For example, the present invention includes antibodies which disrupt the 
5 receptor/ligand interactions with the polypeptides of the invention either partially or fully. 
Preferably, antibodies of the present invention bind an antigenic epitope disclosed herein, or a 
portion thereof. The invention features both receptor-specific antibodies and ligand-specific 
antibodies. The invention also features receptor-specific antibodies which do not prevent ligand 
binding but prevent receptor activation. Receptor activation (i.e., signaling) may be deterrnined by 

10 techniques described herein or otherwise known in the art. For example, receptor activation can 
be determined by detecting the phosphorylation (e.g., tyrosine or serine/threonine) of the receptor 
or its substrate by immunoprecipitation followed by western blot analysis (for example, as 
described supra). In specific embodiments, antibodies are provided that inhibit ligand activity or 
receptor activity by at least 95%, at least 90%, at least 85%, at least 80%, at least 75%, at least 

15 70%, at least 60%, or at least 50% of the activity in absence of the antibody. 

The invention also features receptor-specific antibodies which both prevent ligand binding 
and receptor activation as well as antibodies that recognize the receptor-ligand complex, and, 
preferably, do not specifically recognize the unbound receptor or the unbound ligand. Likewise, 
included in the invention are neutralizing antibodies which bind the ligand and prevent binding of 

20 the ligand to the receptor, as well as antibodies which bind the ligand, thereby preventing receptor 
activation, but do not prevent the ligand from binding the receptor. Further included in the 
invention are antibodies which activate the receptor. These antibodies may act as receptor 
agonists, i.e., potentiate or activate either all or a subset of the biological activities of the ligand- 
mediated receptor activation, for example, by inducing dimerization of the receptor. The 

25 antibodies may be specified as agonists, antagonists or inverse agonists for biological activities 
comprising the specific biological activities of the peptides of the invention disclosed herein. The 
above antibody agonists can be made using methods known in the art. See, e.g., PCT publication 
WO 96/40281; U.S. Patent No. 5,811,097; Deng et aL, Blood 92(6): 1981-1988 (1998); Chen et aL, 
Cancer Res. 58(16):3668-3678 (1998); Harrop et aL, J. Immunol. 161(4): 1786-1794 (1998); Zhu 

30 et al., Cancer Res. 58(15):3209-3214 (1998); Yoon et aL, J. Immunol. 160(7):3 170-3 179 (1998); 
Prat et al., J. Cell. Sci. lll(Pt2):237-247 (1998); Pitard et al., J. Immunol. Methods 205(2):177- 
190 (1997); Liautard et al., Cytokine 9(4):233-241 (1997); Carlson et al., J. Biol. Chera. 
272(17): 11295-1 1301 (1997); Taryman et al., Neuron 14(4):755-762 (1995); Muller et al., 
Structure 6(9): 1153-1 167 (1998); Bartunek et al., Cytokine 8(l):14-20 (1996) (which are all 

35 incorporated by reference herein in their entireties). 

Antibodies of the present invention may be used, for example, to purify, detect, and target 
the polypeptides of the present invention, including both in vitro and in vivo diagnostic and 
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therapeutic methods. For example, the antibodies have utility in immunoassays for qualitatively 
and quantitatively measuring levels of the polypeptides of the present invention in biological 
samples. See, e.g., Harlow et al., Antibodies: A Laboratory Manual, (Cold Spring Harbor 
Laboratory Press, 2nd ed. 1988); incorporated by reference herein in its entirety. 

As discussed in more detail below, the antibodies of the present invention may be used 
either alone or in combination with other compositions. The antibodies may further be 
recombinantly fused to a heterologous polypeptide at the N- or C^enninus or chemically 
conjugated (including covalent and non-covalent conjugations) to polypeptides or other 
compositions. For example, antibodies of the present invention may be recombinantly fused or 
conjugated to molecules useful as. labels in detection assays and effector molecules such as 
heterologous polypeptides, drugs, radionuclides, or toxins. See, e.g., PCT publications WO 
92/08495; WO 91/14438; WO 89/12624; U.S. Patent No. 5,314,995; and EP 396,387; the 
disclosures of which are incorporated herein by reference in their entireties. 

The antibodies of the invention include derivatives that are modified, i.e, by the covalent 
attachment of any type of molecule to the antibody such that covalent attachment does not prevent 
the antibody from generating an anti-idiotypic response. For example, but not by way of 
limitation, the antibody derivatives include antibodies that have been modified, e.g., by 
glycosylation, acetylation, pegylation, phosphylation, amidation, derivatization by known 
protecting/blocking groups, proteolytic cleavage, linkage to a cellular ligand or other protein, etc. 
Any of numerous chemical modifications may be carried out by known techniques, including, but 
not limited to specific chemical cleavage, acetylation, formylation, metabolic synthesis of 
tunicamycin, etc. Additionally, the derivative may contain one or more non-classical amino acids. 

The antibodies of the present invention may be generated by any suitable method known 
in the art Polyclonal antibodies to an antigen-of- interest can be produced by various procedures 
well known in the art. For example, a polypeptide of the invention can be administered to various 
host animals including, but not limited to, rabbits, mice, rats, etc. to induce the production of sera 
containing polyclonal antibodies specific for the antigen. Various adjuvants may be used to 
increase the immunological response, depending on the host species, and include but are not 
limited to, Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 
keyhole limpet hemocyanins, dinitrophenol, and potentially useful human adjuvants such as BCG 
(bacille Calmette-Guerin) and corynebacterium parvum. Such adjuvants are also well known in 
the art. 

Monoclonal antibodies can be prepared using a wide variety of techniques known in the 
art including the use of hybridoma, recombinant, and phage display technologies, or a combination 
thereof. For example, monoclonal antibodies can be produced using hybridoma techniques 
including those known in the art and taught, for example, in Harlow et al, Antibodies: A 
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Laboratory Manual, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); Hamrnerling, et aL, in: 
Monoclonal Antibodies and T-Cell Hybridomas 563-681 (Elsevier, N.Y., 1981) (said references 
incorporated by reference in their entireties). The term "monoclonal antibody" as used herein is 
not limited to antibodies produced through hybridoma technology. The term "monoclonal 
antibody" refers to an antibody that is derived from a single clone, including any eukaryotic, 
prokaryotic, or phage clone, and not the method by which it is produced. 

Methods for producing and screening for specific antibodies using hybridoma technology 
are routine and well known in the art and are discussed in detail in the Examples. In a non- 
hmiting example, mice can be immunized with a polypeptide of the invention or a cell expressing 
such peptide. Once an immune response is detected, e.g., antibodies specific for the antigen are 
detected in the mouse serum, the mouse spleen is harvested and splenocytes isolated. The 
splenocytes are then fused by well known techniques to any suitable myeloma cells, for example 
cells from cell line SP20 available from the ATCC. Hybridomas are selected and cloned by 
limited dilution. The hybridoma clones are then assayed by methods known in the art for cells that 
secrete antibodies capable of binding a polypeptide of the invention. Ascites fluid, which 
generally contains high levels of antibodies, can be generated by immunizing mice with positive 
hybridoma clones. 

Accordingly, the present invention provides methods of generating monoclonal antibodies 
as well as antibodies produced by the method comprising culturing a hybridoma cell secreting an 
antibody of the invention wherein, preferably, the hybridoma is generated by fusing splenocytes 
isolated from a mouse immunized with an antigen of the invention with myeloma cells and then 
screening the hybridomas resulting from the fusion for hybridoma clones that secrete an antibody 
able to bind a polypeptide of the invention. 

Another well known method for producing both polyclonal and monoclonal human B cell 
lines is transformation using Epstein Barr Virus (EBV). Protocols for generating EBV- 
transformed B cell lines are commonly known in the art, such as, for example, the protocol 
outlined in Chapter 7.22 of Current Protocols in Immunology, Coligan et aL, Eds., 1994, John 
Wiley & Sons, NY, which is hereby incorporated in its entirety by reference. The source of B 
cells for transformation is commonly human peripheral blood, but B cells for transformation may 
also be derived from other sources including, but not limited to, lymph nodes, tonsil, spleen, tumor 
tissue, and infected tissues. Tissues are generally made into single cell suspensions prior to EBV 
transformation. Additionally, steps may be taken to either physically remove or inactivate T cells 
(e.g., by treatment with cyclosporin A) in B cell-containing samples, because T ceDs from 
individuals seropositive for anti-EBV antibodies can suppress B cell immortalization by EBV. 

In general, the sample containing human B cells is innoculated with EBV, and cultured for 
3-4 weeks. A typical source of EBV is the culture supernatant of the B95-8 cell line (ATCC #VR- 
1492). Physical signs of EBV transformation can generally be seen towards the end of the 
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week culture period. By phase-contrast microscopy, transformed cells may appear large, clear, 
hairy and tend to aggregate in tight clusters of cells. Initially, EBV lines are generally polyclonal. 
However, over prolonged periods of cell cultures, EBV lines may become monoclonal or 
polyclonal as a result of the selective outgrowth of particular B cell clones. Alternatively, 
5 polyclonal EBV transformed lines may be subcloned (e.g., by limiting dilution culture) or fused 
with a suitable fusion partner and plated at limiting dilution to obtain monoclonal B cell lines. 
Suitable fusion partners for EBV transformed cell lines include mouse myeloma cell lines (e.g., 
SP2/0, X63-Ag8.653), heteromyeloma cell lines (human x mouse; e.g, SPAM-8, SBC-H20, and 
CB-F7), and human cell lines (e.g., GM 1500, SKO-007, RPMI 8226, and KR-4). Thus, the 

10 present invention also provides a method of generating polyclonal or monoclonal human 
antibodies against polypeptides of the invention or fragments thereof, comprising EBV- 
transformation of human B cells. 

Antibody fragments which recognize specific epitopes may be generated by known 
techniques. For example, Fab and F(ab')2 fragments of the invention may be produced by 

15 proteolytic cleavage of immunoglobulin molecules, using enzymes such as papain (to produce Fab 
fragments) or pepsin (to produce F(ab*)2 fragments). F(ab')2 fragments contain the variable 
region, the light chain constant region and the CHI domain of the heavy chain. 

For example, the antibodies of the present invention can also be generated using various 
phage display methods known in the art. In phage display methods, functional antibody domains 

20 are displayed on the surface of phage particles which carry the polynucleotide sequences encoding 
them. In a particular embodiment, such phage can be utilized to display antigen binding domains 
expressed from a repertoire or combinatorial antibody library (e.g., human or murine). Phage 
expressing an antigen binding domain that binds the antigen of interest can be selected or 
identified with antigen, e.g., using labeled antigen or antigen bound or captured to a solid surface 

25 or bead. Phage used in these methods are typically filamentous phage including fd and M13 
binding domains expressed from phage with Fab, Fv or disulfide stabilized Fv antibody domains 
recombinantly fused to either the phage gene III or gene VIII protein. Examples of phage display 
methods that can be used to make the antibodies of the present invention include those disclosed 
in Brinkman et al., J. Immunol. Methods 182:41-50 (1995); Ames et al., J. Immunol. Methods 

30 184: 177-186 (1995); Ketdeborough et al., Eur. J. Immunol. 24:952-958 (1994); Persic et al., Gene 
187 9-18 (1997); Burton et al., Advances in Immunology 57:191-280 (1994); PCT application 
No. PCT/GB91/01134;.PCT publications WO 90/02809; WO 91/10737; WO 92/01047; WO 
92/18619; WO 93/11236; WO 95/15982; WO 95/20401; and U.S. Patent Nos. 5,698,426; 
5,223,409; 5,403,484; 5,580,717; 5,427,908; 5,750,753; 5,821,047; 5,571,698; 5,427,908; 

35 5,516,637; 5,780,225; 5,658,727; 5,733,743 and 5,969,108; each of which is incorporated herein 
by reference in its entirety. 
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As described in the above references, after phage selection, the antibody coding regions 
from the phage can be isolated and used to generate whole antibodies, including human antibodies, 
or any other desired antigen binding fragment, and expressed in any desired host, including 
mammalian cells, insect cells, plant cells, yeast, and bacteria, e.g., as described in detail below. 
5 For example, techniques to recombinantly produce Fab, Fab and F(ab*)2 fragments can also be 
employed using methods known in the art such as those disclosed in PCT publication WO 
92/22324; Mullinax et al., BioTechniques 12(6):864-869 (1992); and Sawai et al., AJRI 34:26-34 
(1995); and Better et al., Science 240:1041-1043 (1988) (said references incorporated by reference 
in their entireties). 

10 Examples of techniques which can be used to produce single-chain Fvs and antibodies 

include those described in U.S. Patents 4,946,778 and 5,258,498; Huston et al., Methods in 
Enzymology 203:46-88 (1991); Shu et al., PNAS 90:7995-7999 (1993); and Skerra et al., Science 
240:1038-1040 (1988). For some uses, including in vivo use of antibodies in humans and in vitro 
detection assays, it may be preferable to use chimeric, humanized, or human antibodies. A 

15 chimeric antibody is a molecule in which different portions of the antibody are derived from 
different animal species, such as antibodies having a variable region derived from a murine 
monoclonal antibody and a human immunoglobulin constant region. Methods for producing 
chimeric antibodies are known in the art. See e.g., Morrison, Science 229:1202 (1985); Oi et al., 
BioTechniques 4:214 (1986); Gillies et al., (1989) J. Immunol. Methods 125:191-202; U.S. Patent 

20 Nos. 5,807,715; 4,816,567; and 4,816397, which are incorporated herein by reference in their 
entirety. Humanized antibodies are antibody molecules from non-human species antibody that 
binds the desired antigen having one or more complementarity determining regions (CDRs) from 
the non-human species and a framework regions from a human immunoglobulin molecule. Often, 
framework residues in the human framework regions will be substituted with the corresponding 

25 residue from the CDR donor antibody to alter, preferably improve, antigen binding. These 
framework substitutions are identified by methods well known in the art, e.g., by modeling of the 
interactions of the CDR and framework residues to identify framework residues important for 
antigen binding and sequence comparison to identify unusual framework residues at particular 
positions. (See, e.g., Queen et al., U.S. Patent No. 5,585,089; Riechmann et al., Nature 332:323 

30 . (1988), which are incorporated herein by reference in their entireties.) Antibodies can be 
humanized using a variety of techniques known in the art including, for example, CDR-grafting 
(EP 239,400; PCT publication WO 91/09967; U.S. Patent Nos. 5,225,539; 5,530,101; and 
5,585,089), veneering or resurfacing (EP 592,106; EP 519,596; Padlan, Molecular Immunology 
28(4/5):489-498 (1991); Studnicka et al., Protein Engineering 7(6):805-814 (1994); Roguska. et 

35 al., PNAS 91:969-973 (1994)), and chain shuffling (U.S. Patent No. 5,565,332). 

Completely human antibodies are particularly desirable for therapeutic treatment of human 
patients. Human antibodies can be made by a variety of methods known in the art including phage 
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display methods described above using antibody libraries derived from human immunoglobulin 
sequences. See also, U.S. Patent Nos. 4,444,887 and 4,716,111; and PCT publications WO 
98/46645, WO 98/50433, WO 98/24893, WO 98/16654, WO 96/34096, WO 96/33735, and WO 
91/10741; each of which is incorporated herein by reference in its entirety. 
5 Human antibodies can also be produced using transgenic mice which are incapable of 

expressing functional endogenous immunoglobulins, but which can express human 
immunoglobulin genes. For example, the human heavy and light chain immunoglobulin gene 
complexes may be introduced randomly or by homologous recombination into mouse embryonic 
stem cells. Alternatively, the human variable region, constant region, and diversity region may be 

10 introduced into mouse embryonic stem cells in addition to the human heavy and light chain genes. 
The mouse heavy and light chain immunoglobulin genes may be rendered non-functional 
separately or simultaneously with the introduction of human immunoglobulin loci by homologous 
recombination. In particular, homozygous deletion of the JH region prevents endogenous antibody 
production. The modified embryonic stem cells are expanded and microinjected into blastocysts 

15 to produce cliimeric mice. The chimeric mice are then bred to produce homozygous offspring 
which express human antibodies. The transgenic mice are immunized in the normal fashion with a 
selected antigen, e.g., all or a portion of a polypeptide of the invention. Monoclonal antibodies 
directed against the antigen can be obtained from the immunized, transgenic mice using 
conventional hybridoma technology. The human immunoglobulin transgenes harbored by the 

20 transgenic mice rearrange during B cell differentiation, and subsequently undergo class switching 
and somatic mutation. Thus, using such a technique, it is possible to produce therapeutically 
useful IgG, IgA, IgM and IgE antibodies. For an overview of this technology for producing 
human antibodies, see Lonberg and Huszar, Int. Rev. Immunol. 13:65-93 (1995). For a detailed 
discussion of this technology for producing human antibodies and human monoclonal antibodies 

25 and protocols for producing such antibodies, see, e.g., PCT publications WO 98/24893; WO 
92/01047; WO 96/34096; WO 96/33735; European Patent No. 0 598 877; U.S. Patent Nos. 
5,413,923; 5,625,126; 5,633,425; 5,569,825; 5,661,016; 5,545,806; 5,814,318; 5,885,793; 
5,916,771; 5,939,598; 6,075,181; and 6,114,598, which are incoiporated by reference herein in 
their entirety. In addition, companies such as Abgenix, Inc. (Freemont, CA) and Genpharm (San 

30 Jose, CA) can be engaged to provide human antibodies directed against a selected antigen using 
technology similar to that described above. 

Completely human antibodies which recognize a selected epitope can be generated using a 
technique referred to as "guided selection." In this approach a selected non-human monoclonal 
antibody, e.g., a mouse antibody, is used to guide the selection of a completely human antibody 

35 recognizing the same epitope. (Jespers et al., Bio/technology 12:899-903 (1988)). 

Further, antibodies to the polypeptides of the invention can, in turn, be utilized to generate 
anti-idiotype antibodies that "mimic" polypeptides of the invention using techniques well known 
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to those skilled in the art. (See, e.g., Greenspan & Bona, FASEB J. 7(5):437^44; (1989) and 
Nissinoff, J. Immunol. 147(8):2429-2438 (1991)). For example, antibodies which bind to and 
competitively inhibit polypeptide multimerization and/or binding of a polypeptide of the invention 
to a ligand can be used to generate anti-idiotypes that "mimic" the polypeptide multimerization 
5 and/or binding domain and, as a consequence, bind to and neutralize polypeptide and/or its ligand. 
Such neutralizing anti-idiotypes or Fab fragments of such anti-idiotypes can be used in therapeutic 
regimens to neutralize polypeptide ligand(s)/receptor(s). For example, such anti-idiotypic 
antibodies can be used to bind a polypeptide of the invention and/or to bind its 
ligand(s)/receptor(s), and thereby block its biological activity. Alternatively, antibodies which 

10 bind to and enhance polypeptide multimerization and/or binding, and/or receptor/ligand 
multimerization, binding and/or signaling can be used to generate anti-idiotypes that function as 
agonists of a polypeptide of the invention and/or its ligand/receptor. Such agonistic anti-idiotypes 
or Fab fragments of such anti-idiotypes can be used in therapeutic regimens as agonists of the 
polypeptides of the invention or its ligand(s)/receptor(s). For example, such anti-idiotypic 

15 antibodies can be used to bind a polypeptide of the invention and/or to bind its 
ligand(s)/receptor(s), and thereby promote or enhance its biological activity. 

Intrabodies of the invention can be produced using methods known in the art, such as 
those disclosed and reviewed in Chen et al., Hum. Gene Then 5:595-601 (1994); Marasco, W.A., 
Gene Then 4:11-15 (1997); Rondon and Marasco, Annu. Rev. Microbiol. 51:257-283 (1997); 

20 Proba et al., J. Mol. Biol. 275:245-253 (1998); Cohen et al., Oncogene 17:2445-2456 (1998); 
Ohage and Steipe, J. Mol. Biol. 291:1119-1128 (1999); Ohage et al., J. Mol. Biol. 291:1129-1134 
(1999); Wirtz and Steipe, Protein Sci. 8:2245-2250 (1999); Zhu et al., J. Immunol. Methods 
231:207-222 (1999); and references cited therein. 

25 Polynucleotides Encoding Antibodies 

The invention further provides polynucleotides comprising a nucleotide sequence 
encoding an antibody of the invention and fragments thereof. The invention also encompasses 
polynucleotides that hybridize under stringent or alternatively, under lower stringency 
hybridization conditions, e.g., as defined supra, to polynucleotides that encode an antibody, 

30 preferably, that specifically binds to a polypeptide of the invention, preferably, an antibody that 
binds to a polypeptide having the amino acid sequence of SEQ ID NO: Y, to a polypeptide encoded 
by a portion of SEQ ID NO:X as defined in columns 8 and 9 of Table 2, and/or to a polypeptide 
encoded by the cDNA contained in ATCC Deposit No:Z. 

The polynucleotides may be obtained, and the nucleotide sequence of the polynucleotides 

35 determined, by any method known in the art. For example, if the nucleotide sequence of the 
antibody is known, a polynucleotide encoding the antibody may be assembled from chemically 
synthesized oligonucleotides (e.g., as described in Kutmeier et al., BioTechniques 17:242 (1994)), 
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which, briefly, involves the synthesis of overlapping oligonucleotides containing portions of the 
sequence encoding the antibody, annealing and ligating of those oligonucleotides, and then 
amplification of the ligated oligonucleotides by PCR. 

Alternatively, a polynucleotide encoding an antibody may be generated from nucleic acid 
5 from a suitable source. If a clone containing a nucleic acid encoding a particular antibody is not 
available, but the sequence of the antibody molecule is known, a nucleic acid encoding the 
immunoglobulin may be chemically synthesized or obtained from a suitable source (e.g., an 
antibody cDNA library, or a cDNA library generated from, or nucleic acid, preferably poly A+ 
RNA, isolated from, any tissue or cells expressing the antibody, such as hybridoma cells selected 

10 to express an antibody of the invention) by PCR amplification using synthetic primers 
hybridizable to the 3' and 5' ends of the sequence or by cloning using an oligonucleotide probe 
specific for the particular gene sequence to identify, e.g., a cDNA clone from a cDNA library that 
encodes the antibody. Amplified nucleic acids generated by PCR may then be cloned into 
replicable cloning vectors using any method well known in the art. 

15 Once the nucleotide sequence and corresponding amino acid sequence of the antibody is 

determined, the nucleotide sequence of the antibody may be manipulated using methods well 
known in the art for the manipulation of nucleotide sequences, e.g., recombinant DNA techniques, 
site directed mutagenesis, PCR, etc. (see, for example, the techniques described in Sambrook et 
aL, 1990, Molecular Cloning, A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory, Cold 

20 Spring Harbor, NY and Ausubel et aL, eds., 1998, Current Protocols in Molecular Biology, John 
Wiley & Sons, NY, which are both incorporated by reference herein in their entireties ), to 
generate antibodies having a different amino acid sequence, for example to create amino acid 
substitutions, deletions, and/or insertions. 

In a specific embodiment, the amino acid sequence of the heavy and/or light chain 

25 variable domains may be inspected to identify the sequences of the complementarity determining 
regions (CDRs) by methods that are well know in the art, e.g., by comparison to known amino 
acid sequences of other heavy and light chain variable regions to determine the regions of 
sequence hypervariability. Using routine recombinant DNA techniques, one or more of the CDRs 
may be inserted within framework regions, e.g., into human framework regions to humanize a 

30 non-human antibody, as described supra. The framework regions may be naturally occurring or 
consensus framework regions, and preferably human framework regions (see, e.g., Chothia et aL, 
J. Mol. Biol. 278: 457-479 (1998) for a listing of human framework regions). Preferably, the 
polynucleotide generated by the combination of the framework regions and CDRs encodes an 
antibody that specifically binds a»polypeptide of the invention. Preferably, as discussed supra, 

35 one or more amino acid substitutions may be made within the framework regions, and, preferably, 
the amino acid substitutions improve binding of the antibody to its antigen. Additionally, such 
methods may be used to make amino acid substitutions or deletions of one or more variable region 
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cysteine residues participating in an intrachain disulfide bond to generate antibody molecules 
lacking one or more intrachain disulfide bonds. Other alterations to the polynucleotide are 
encompassed by the present invention and within the skill of the art. 

In addition, techniques developed for the production of "chimeric antibodies" (Morrison et 

5 al., Proc. Natl. Acad. Sci. 81:851-855 (1984); Neuberger et al., Nature 312:604-608 (1984); 
Takeda et al., Nature 314:452-454 (1985)) by splicing genes from a mouse antibody molecule of 
appropriate antigen specificity together with genes from a human antibody molecule of 
appropriate biological activity can be used. As described supra, a chimeric antibody is a molecule 
in which different portions are derived from different animal species, such as those having a 

10 variable region derived from a murine mAb and a human immunoglobulin constant region, e.g., 
humanized antibodies. 

Alternatively, techniques described for the production of single chain antibodies (U.S. 
Patent No. 4,946,778; Bird, Science 242:423- 42 (1988); Huston et al., Proc. Natl. Acad. Sci. USA 
85:5879-5883 (1988); and Ward et al., Nature 334:544-54 (1989)) can be adapted to produce 
15 single chain antibodies. Single chain antibodies are formed by linking the heavy and light chain 
fragments of the Fv region via an amino acid bridge, resulting in a single chain polypeptide. 
Techniques for the assembly of functional Fv fragments in E. coli may also be used (Skerra et al., 
Science 242: 1038- 1041 (1988)). 

20 Methods of Producing Antibodies 

The antibodies of the invention can be produced by any method known in the art for the 
synthesis of antibodies, in particular, by chemical synthesis or preferably, by recombinant 
expression techniques. Methods of producing antibodies include, but are not limited to, hybridoma 
technology, EBV transformation, and other methods discussed herein as well as through the use 

25 recombinant DNA technology, as discussed below. 

Recombinant expression of an antibody of the invention, or fragment, derivative or analog 
thereof, (e.g., a heavy or light chain of an antibody of the invention or a single chain antibody of 
the invention), requires construction of an expression vector containing a polynucleotide that 
encodes the antibody. Once a polynucleotide encoding an antibody molecule or a heavy or light 

30 chain of an antibody, or portion thereof (preferably containing the heavy or light chain variable 
domain), of the invention has been obtained, the vector for the production of the antibody 
molecule may be produced by recombinant DNA technology using techniques well known in the 
art. Thus, methods for preparing a protein by expressing a polynucleotide containing an antibody 
encoding nucleotide sequence are described herein. Methods which are well known to those 

35 skilled in the art can be used to construct expression vectors containing antibody coding 
sequences and appropriate transcriptional and translational control signals. These methods 
include, for example, in vitro recombinant DNA techniques, synthetic techniques, and in vivo 
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genetic recombination. The invention, thus, provides replicable vectors comprisiDg a nucleotide 
sequence encoding an antibody molecule of the invention, or a heavy or light chain thereof, or a 
heavy or light chain variable domain, operably linked to a promoter. Such vectors may include 
the nucleotide sequence encoding the constant region of the antibody molecule (see, e.g., PCT 
5 Publication WO 86/05807; PCT Publication WO 89/01036; and U.S. Patent No. 5,122,464) and 
the variable domain of the antibody may be cloned into such a vector for expression of the entire 
heavy or light chain. 

The expression vector is transferred to a host cell by conventional techniques and the 
transfected cells are then cultured by conventional techniques to produce an antibody of the 

10 invention. Thus, the invention includes host cells containing a polynucleotide encoding an 
antibody of the invention, or a heavy or light chain thereof, or a single chain antibody of the 
invention, operably linked to a heterologous promoter. In preferred embodiments for the 
expression of double-chained antibodies, vectors encoding both the heavy and light chains may be 
co-expressed in the host cell for expression of the entire immunoglobulin molecule, as detailed 

15 below. 

A variety of host-expression vector systems may be utilized to express the antibody 
molecules of the invention. Such host-expression systems represent vehicles by which the coding 
sequences of interest may be produced and subsequently purified, but also represent cells which 
may, when transformed or transfected with the appropriate nucleotide coding sequences, express 

20 an antibody molecule of the invention in situ. These include but are not limited to 
microorganisms such as bacteria (e.g., E. coli, B. subtilis) transformed with recombinant 
bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors containing antibody 
coding sequences; yeast (e.g., Saccharomyces, Pichia) transformed with recombinant yeast 
expression vectors containing antibody coding sequences; insect cell systems infected with 

25 recombinant virus expression vectors (e.g., baculovirus) containing antibody coding sequences; 
plant cell systems infected with recombinant virus expression vectors (e.g., cauliflower mosaic 
virus, CaMV; tobacco mosaic virus, TMV) or transformed with recombinant plasmid expression 
vectors (e.g., Ti plasmid) containing antibody coding sequences; or mammalian cell systems (e.g., 
COS, CHO, BHK, 293, 3T3 cells) harboring recombinant expression constructs containing 

30 promoters derived from the genome of mammalian cells (e.g., metallothionein promoter) or from 
mammalian viruses (e.g., the adenovirus late promoter; the vaccinia virus 7.5K promoter). 
Preferably, bacterial cells such as Escherichia coli, and more preferably, eukaryotic cells, 
especially for the expression of whole recombinant antibody molecule, are used for the expression 
of a recombinant antibody molecule. For example, mammalian cells such as Chinese hamster 

35 ovary cells (CHO), in conjunction with a vector such as the major intermediate early gene 
promoter element from human cytomegalovirus is an effective expression system for antibodies 
(Foecking et al., Gene 45:101 (1986); Cockett et al., Bio/Technology 8:2 (1990)). 
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In bacterial systems, a number of expression vectors may be advantageously selected 
depending upon the use intended for the antibody molecule being expressed. For example, when 
a large quantity of such a protein is to be produced, for the generation of pharmaceutical 
compositions of an antibody molecule, vectors which direct the expression of high levels of fusion 
5 protein products that are readily purified may be desirable. Such vectors include, but are not 
limited, to the E. coli expression vector pUR278 (Ruther et al., EMBO J. 2:1791 (1983)), in which 
the antibody coding sequence may be ligated individually into the vector in frame with the lac Z 
coding region so that a fusion protein is produced; pIN vectors (Inouye & Inouye, Nucleic Acids 
Res. 13:3101-3109 (1985); Van Heeke & Schuster, J. Biol. Chem. 24:5503-5509 (1989)); and the 

10 like. pGEX vectors may also be used to express foreign polypeptides as fusion proteins with 
glutathione S-transferase (GST). In general, such fusion proteins are soluble and can easily be 
purified from lysed cells by adsorption and binding to matrix glutathione-agarose beads followed 
by elution in the presence of free glutathione. The pGEX vectors are designed to include 
thrombin or factor Xa protease cleavage sites so that the cloned target gene product can be 

1 5 released from the GST moiety. 

In an insect system, Autographa californica nuclear polyhedrosis virus (AcNPV) is used as 
a vector to express foreign genes. The virus grows in Spodoptera frugiperda cells. The antibody 
coding sequence may be cloned individually into non-essential regions (for example the 
polyhedrin gene) of the virus and placed under control of an AcNPV promoter (for example the 

20 polyhedrin promoter). 

In mammalian host cells, a number of viral-based expression systems may be utilized. In 
cases where an adenovirus is used as an expression vector, the antibody coding sequence of 
interest may be ligated to an adenovirus transcription/translation control complex, e.g., the late 
promoter and tripartite leader sequence. This chimeric gene may then be inserted in the 

25 adenovirus genome by in vitro or in vivo recombination. Insertion in a non- essential region of the 
viral genome (e.g., region El or E3) will result in a recombinant virus that is viable and capable of 
expressing the antibody molecule in infected hosts, (e.g., see Logan & Shenk, Proc. Natl. Acad. 
Sci. USA 81:355-359 (1984)). Specific initiation signals may also be required for efficient 
translation of inserted antibody coding sequences. These signals include the ATG initiation codon 

30 and adjacent sequences. Furthermore, the initiation codon must be in phase with the reading 
frame of the desired coding sequence to ensure translation of the entire insert These exogenous 
translational control signals and initiation codons can be of a variety of origins, both natural and 
synthetic. The efficiency of expression may be enhanced by the inclusion of appropriate 
transcription enhancer elements, transcription terminators, etc. (see Bittner et al., Methods in 

35 Enzymol. 153:51-544 (1987)). 

In addition, a host cell strain may be chosen which modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific fashion desired. 
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Such modifications (e.g., glycosylation) and processing (e.g., cleavage) of protein products may be 
important for the function of the protein. Different host cells have characteristic and specific 
mechanisms for the post-translational processing and modification of proteins and gene products. 
Appropriate cell lines or host systems can be chosen to ensure the correct modification and 
5 processing of the foreign protein expressed. To this end, eukaryotic host cells which possess the 
cellular machinery for proper processing of the primary transcript, glycosylation, and 
phosphorylation of the gene product may be used. Such mammalian host cells include but are not 
limited to CHO, VERY, BHK, Hela, COS, MDCK, 293, 3T3, WI38, and in particular, breast 
cancer cell lines such as, for example, BT483, Hs578T, HTB2, BT20 and T47D, and normal 

10 mammary gland cell line such as, for example, CRL7030 and Hs578Bst. 

For long-term, high-yield production of recombinant proteins, stable expression is 
preferred. For example, cell lines which stably express the antibody molecule may be engineered. 
Rather than using expression vectors which contain viral origins of replication, host cells can be 
transformed with DNA controlled by appropriate expression control elements (e.g., promoter, 

15 enhancer, sequences, transcription terminators, polyadenylation sites, etc.), and a selectable 
marker. Following the introduction of the foreign DNA, engineered cells may be allowed to grow 
for 1-2 days in an enriched media, and then are switched to a selective media. The selectable 
marker in the recombinant plasmid confers resistance to the selection and allows cells to stably 
integrate the plasmid into their chromosomes and grow to form foci which in turn can be cloned 

20 and expanded into cell lines. This method may advantageously be used to engineer cell lines 
which express the antibody molecule. Such engineered cell lines may be particularly useful in 
screening and evaluation of compounds that interact directly or indirectly with the antibody 
molecule. 

A number of selection systems may be used, including but not limited to the herpes 
25 simplex virus thymidine kinase (Wigler et al., Cell 11:223 (1977)), hypoxanthine-guanine 
phosphoribosyltransferase (Szybalska & Szybalski, Proc. Natl. Acad. Sci. USA 48:202 (1992)), 
and adenine phosphoribosyltransferase (Lowy et al., Cell 22:817 (1980)) genes can be employed 
in tk-, hgprt- or aprt- cells, respectively. Also, antimetabolite resistance can be used as the basis of 
selection for the following genes: dhfr, which confers resistance to methotrexate (Wigler et al., 
30 Natl. Acad. Sci. USA 77:357 (1980); OHare et al., Proc. Natl. Acad. Sci. USA 78:1527 (1981)); 
gpt, which confers resistance to mycophenolic acid (Mulligan & Berg, Proc. Natl. Acad. Sci. USA 
78:2072 (1981)); neo, which confers resistance to the aminoglycoside G-418 Clinical Pharmacy 
12:488-505; Wu and Wu, Biotherapy 3:87-95 (1991); Tolstoshev, Ann. Rev. Pharmacol. Toxicol. 
32:573-596 (1993); Mulligan, Science 260:926-932 (1993); and Morgan and Anderson, Ann. Rev. 
35 Biochera 62:191-217 (1993); May, 1993, TIB TECH 11(5):155-215 (1993)); and hygro, which 
confers resistance tohygromycin (Santerre et al., Gene 30:147 (1984)). Methods commonly 
known in the art of recombinant DNA technology may be routinely applied to select the desired 

566 



t 

I 

WO 02/076488 PCT/US02/08276 

recombinant clone, and such methods are described, for example, in Ausubel et al. (eds.)» Current 
Protocols in Molecular Biology, John Wiley & Sons, NY (1993); Kriegler, Gene Transfer and 
Expression, A Laboratory Manual, Stockton Press, NY (1990); and in Chapters 12 and 13, 
Dracopoli et al. (eds), Current Protocols in Human Genetics, John Wiley & Sons, NY (1994); 
5 Colberre-Gaiapin et al., J. Mol. Biol. 150:1 (1981), which are incorporated by reference herein in 
their entireties. 

The expression levels of an antibody molecule can be increased by vector amplification 
(for a review, see Bebbington and Hentschel, The use of vectors based on gene amplification for 
the expression of cloned genes in mammalian cells in DNA cloning, VoL3. (Academic Press, New 

10 York, 1987)). When a marker in the vector system expressing antibody is amplifiable, increase in 
the level of inhibitor present in culture of host cell will increase the number of copies of the 
marker gene. Since the amplified region is associated with the antibody gene, production of the 
antibody will also increase (Crouse et al., Mol. Cell. Biol. 3:257 (1983)). 

Vectors which use glutamine synthase (GS) or DHFR as the selectable markers can be 

15 amplified in the presence of the drugs methionine sdphoximine or methotrexate, respectively. An 
advantage of glutamine synthase based vectors are the availabilty of cell lines (e.g., the murine 
myeloma cell line, NS0) which are glutamine synthase negative. Glutamine synthase expression 
systems can also function in glutamine synthase expressing cells (e.g. Chinese Hamster Ovary 
(CHO) cells) by providing additional inhibitor to prevent the functioning of the endogenous gene. 

20 A glutamine synthase expression system and components thereof are detailed in PCT publications: 
WO87/04462; WO86/05807; WO89/01036; WO89/10404; and WO91/06657 which are 
incorporated in their entireties by reference herein. Additionally, glutamine synthase expression 
vectors that may be used according to the present invention are commercially available from 
suplliers, including, for example Lonza Biologies, Inc. (Portsmouth, NH). Expression and 

25 production of monoclonal antibodies using a GS expression system in murine myeloma cells is 
described in Bebbington et al., Bio/technology 10:169(1992) and in Biblia and Robinson 
Biotechnol Prog. 11:1 (1995) which are incorporated in their entirities by reference herein. 

The host cell may be co-transfected with two expression vectors of the invention, the first 
vector encoding a heavy chain derived polypeptide and the second vector encoding a light chain 

30 derived polypeptide. The two vectors may contain identical selectable markers which enable equal 
expression of heavy and light chain polypeptides. Alternatively, a single vector may be used 
which encodes, and is capable of expressing, both heavy and light chain polypeptides. In such 
situations, the light chain should be placed before the heavy chain to avoid an excess of toxic free 
heavy chain (Proudfoot, Nature 322:52 (1986); Kohler, Proc. Natl. Acad. Sci. USA 77:2197 

35 (1980)). The coding sequences for the heavy and light chains may comprise cDNA or genomic 
DNA. 



567 



WO 02/076488 PCT/US02/08276 

Once an antibody molecule of the invention has been produced by an animal, chemically 
synthesized, or recombinantly expressed, it may be purified by any method known in the art for 
purification of an immunoglobulin molecule, for example, by chromatography (e.g., ion exchange, 
affinity, particularly by affinity for the specific antigen after Protein A, and sizing column 
5 chromatography), centrifugation, differential solubility, or by any other standard technique for the 
purification of proteins. In addition, the antibodies of the present invention or fragments thereof 
can be fused to heterologous polypeptide sequences described herein or otherwise known in the 
art, to facilitate purification. 

The present invention encompasses antibodies recombinantly fused or chemically 

10 conjugated (including both covalently and non-covalently conjugations) to a polypeptide (or 
portion thereof, preferably at least 10, 20, 30, 40, 50, 60, 70, 80, 90 or 100 amino acids of the 
polypeptide) of the present invention to generate fusion proteins. The fusion does not necessarily 
need to be direct, but may occur through linker sequences. The antibodies may be specific for 
antigens other than polypeptides (or portion thereof, preferably at least 10, 20, 30, 40, 50, 60, 70, 

15 80, 90 or 100 amino acids of the polypeptide) of the present invention. For example, antibodies 
may be used to target the polypeptides of the present invention to particular cell types, either in 
vitro or in vivo, by fusing or conjugating the polypeptides of the present invention to antibodies 
specific for particular cell surface receptors. Antibodies fused or conjugated to the polypeptides of 
the present invention may also be used in in vitro immunoassays and purification methods using 

20 methods known in the art. See e.g., Harbor et aL, supra, and PCT publication WO 93/21232; EP 
439,095; Naramura et aL, Immunol. Lett. 39:91-99 (1994); U.S. Patent 5,474,981; Gillies et aL, 
PNAS 89:1428-1432 (1992); Fell et aL, J. Immunol. 146:2446-2452 (1991), which are 
incorporated by reference in their entireties. 

The present invention further includes compositions comprising the polypeptides of the 

25 present invention fused or conjugated to antibody domains other than the variable regions. For 
example, the polypeptides of the present invention may be fused or conjugated to an antibody Fc 
region, or portion thereof. The antibody portion fused to a polypeptide of the present invention 
may comprise the constant region, hinge region, CHI domain, CH2 domain, and CH3 domain or 
any combination of whole domains or portions thereof. The polypeptides may also be fused or 

30 conjugated to the above antibody portions to form multimers. For example, Fc portions fused to 
the polypeptides of the present invention can form dimers through disulfide bonding between the 
Fc portions. Higher multimeric forms can be made by fusing the polypeptides to portions of IgA 
and IgM. Methods for fusing or conjugating the polypeptides of the present invention to antibody 
portions are known in the art. See, e.g., U.S. Patent Nos. 5,336,603; 5,622,929; 5,359,046; 

35 5,349,053; 5,447,851; 5,112,946; EP 307,434; EP 367,166; PCT publications WO 96/04388; WO 
91/06570; Ashkenazi et aL, Proc. Natl. Acad. Sci. USA 88:10535-10539 (1991); Zheng et aL, J. 
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Immunol. 154:5590-5600 (1995); and Vil et al., Proc. Nad. Acad. Sci. USA 89:11337- 11341 
(1992) (said references incorporated by reference in their entireties). 

As discussed, supra, the polypeptides corresponding to a polypeptide, polypeptide 
fragment, or a variant of SEQ ID NO: Y may be fused or conjugated to the above antibody portions 
to increase the in vivo half life of the polypeptides or for use in immunoassays using methods 
known in the art. Further, the polypeptides corresponding to SEQ ID NO:Y may be fused or 
conjugated to the above antibody portions to facilitate purification. One reported example 
describes chimeric proteins consisting of the first two domains of the human CD4~polypeptide and 
various domains of the constant regions of the heavy or light chains of mammalian 
immunoglobulins. See EP 394,827; and Traunecker et al., Nature 331:84-86 (1988). The 
polypeptides of the present invention fused or conjugated to an antibody having disulfide- linked 
dimeric structures (due to the IgG) may also be more efficient in binding and neutralizing other 
molecules, than the monomelic secreted protein or protein fragment alone. See, for example, 
Fountoulakis et al., J. Biochem. 270:3958-3964 (1995). In many cases, the Fc part in a fusion 
protein is beneficial in therapy and diagnosis, and thus can result in, for example, improved 
pharmacokinetic properties. See, for example, EP A 232,262. Alternatively, deleting the Fc part 
after the fusion protein has been expressed, detected, and purified, would be desired. For example, 
the Fc portion may hinder therapy and diagnosis if the fusion protein is used as an antigen for 
immunizations. In drug discovery, for example, human proteins, such as hIL-5, have been fused 
with Fc portions for the purpose of high-throughput screening assays to identify antagonists of 
hIL-5. (See, Bennett et al, J. Molecular Recognition 8:52-58 (1995); Johanson et al., J. Biol. 
Chenx 270:9459-9471 (1995)). 

Moreover, the antibodies or fragments thereof of the present invention can be fused to 
marker sequences, such as a peptide to facilitate purification. In preferred embodiments, the 
marker amino acid sequence is a hexa-histidine peptide, such as the tag provided in a pQE vector 
(QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311), among others, many of which are 
commercially available. As described in Gentz et al., Proc. Natl. Acad. Sci. USA 86:821-824 
(1989), for instance, hexa-histidine provides for convenient purification of the fusion protein. 
Other peptide tags useful for purification include, but are not limited to, the "HA" tag, which 
corresponds to an epitope derived from the influenza hemagglutinin protein (Wilson et al., Cell 
37:767 (1984)) and the "flag" tag. 

The present invention further encompasses antibodies or fragments thereof conjugated to a 
diagnostic or therapeutic agent. The antibodies can be used diagnostically to, for example, 
monitor the development or progression of a tumor as part of a clinical testing procedure to, e.g., 
determine the efficacy of a given treatment regimen. Detection can be facilitated by coupling the 
antibody to a detectable substance. Examples of detectable substances include various enzymes, 
prosthetic groups, fluorescent materials, luminescent materials, bioluminescent materials, 
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radioactive materials, positron emitting metals using various positron emission tomographies, and 
nonradioactive paramagnetic metal ions. The detectable substance may be coupled or conjugated 
either directly to the antibody (or fragment thereof) or indirectly, through an intermediate (such as, 
for example, a linker known in the art) using techniques known in the art. See, for example, U.S. 
5 Patent No. 4,741,900 for metal ions which can be conjugated to antibodies for use as diagnostics 
according to the present invention. Examples of suitable enzymes include horseradish peroxidase, 
alkaline phosphatase, beta-galactosidase, or acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidWbiotin and avidin/biotin; examples of suitable fluorescent 
materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 

10 dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent 
material includes lunoinol; examples of biolurninescent materials include luciferase, luciferin, and 
aequorin; and examples of suitable radioactive material include 1251, 1311, 1 1 lln or 99Tc. 

Further, an antibody or fragment thereof may be conjugated to a therapeutic moiety such 
as a cytotoxin, e.g., a cytostatic or cytocidal agent, a therapeutic agent or a radioactive metal ion, 

15 e.g., alpha-emitters such as, for example, 213Bi. A cytotoxin or cytotoxic agent includes any 
agent that is detrimental to cells. Examples include paclitaxol, cytochalasin B, gramicidin D, 
ethidium bromide, emetine, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicin, 
doxorubicin, daunorubicin, dihydroxy anthracin dione, mitoxantrone, nnthramycin, actinomycin 
D, 1-dehydrotestosterone, glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and 

20 puromycin and analogs or homologs thereof. Therapeutic agents include, but are not limited to, 
antimetabolites (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil 
decarbazine), alkylating agents (e.g., mechloretharnine, thioepa chlorambucd, melphalan, 
carmustine (BSNU) and lornustine (CCNU), cyclothosphamide, busulfan, dibromomannitol, 
streptozotocin, mitomycin C, and cis- dicUorcwiiarnine platinum (H) (DDP) cisplatin), 

25 anthracyclines (e.g., daunorubicin (formerly daunomycin) and doxorubicin), antibiotics (e.g., 
dactinomycin (formerly actinomycin), bleomycin, mithramycin, and anthramycin (AMC)), and 
anti-mitotic agents (e.g., vincristine and vinblastine). 

The conjugates of the invention can be used for modifying a given biological response, the 

■ 

therapeutic agent or drug moiety is not to be construed as Hmited to classical chemical therapeutic 
30 agents. For example, the drug moiety may be a protein or polypeptide possessing a desired 
biological activity. Such proteins may include, for example, a toxin such as abrin, ricin A, 
pseudomonas exotoxin, or diphtheria toxin; a protein such as tumor necrosis factor, a-interferon, 
8-interferon, nerve growth factor, platelet derived growth factor, tissue plasrninogen activator, an 
apoptotic agent, e.g., TNF-alpha, TNF-beta, AIM I (See, International Publication No. WO 
35 97/33899), AIM H (See, Intemational Publication No. WO 97/3491 1), Fas Ligand (Takahashi et 
at, Int. Immunol, 6:1567-1574 (1994)), VEGI (See, Intemational Publication No. WO 99/23105), 
a thrombotic agent or an anti- angiogenic agent, e.g., angiostatin or endostatin; or, biological 
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response modifiers such as, for example, lymphokines, interleukin-1 ("IL-l"), interleukin-2 ("IL- 
2"), interleukin-6 ( n IL-6 n ), granulocyte macrophage colony stimulating factor ("GM-CSF"), 
granulocyte colony stimulating factor ("G-CSF"), or other growth factors. 

Antibodies may also be attached to solid supports, which are particularly useful for 
5 immunoassays or purification of the target antigen. Such solid supports include, but are not 
limited to, glass, cellulose, polyacrylamide, nylon, polystyrene, polyvinyl chloride or 
polypropylene. 

Techniques for conjugating such therapeutic moiety to antibodies are well known. See, for 
example, Arnon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer 

10 Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. (eds.), pp. 243-56 (Alan 
R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug Delivery", in Controlled Drug 
Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 (Marcel Dekker, Inc. 1987); Thorpe, 
"Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A Review", in Monoclonal 
Antibodies '84: Biological And Clinical Applications, Pinchera et al. (eds.), pp. 475-506 (1985); 

15 "Analysis, Results, And Future Prospective Of The Therapeutic Use Of Radiolabeled Antibody In 
Cancer Therapy", in Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et al. 
(eds.), pp. 303-16 (Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic 
Properties Of Antibody-Toxin Conjugates", Immunol. Rev. 62:119-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an antibody 

20 heteroconjugate as described by Segal in U.S. Patent No. 4,676,980, which is incorporated herein 
by reference in its entirety. 

An antibody, with or without a therapeutic moiety conjugated to it, administered alone or 
in combination with cytotoxic factors) and/or cytokine(s) can be used as a therapeutic. 

25 ImniunopJienotyping 

The antibodies of the invention may be utilized for immunophenotyping of cell lines and 
biological samples. Translation products of the gene of the present invention may be useful as cell- 
specific markers, or more specifically as cellular markers that are differentially expressed at 
various stages of differentiation and/or maturation of particular cell types. Monoclonal antibodies 

30 directed against a specific epitope, or combination of epitopes, will allow for the screening of 
ceUular populations expressing the marker. Various techniques can be utilized using monoclonal 
antibodies to screen for cellular populations expressing the markers), and include magnetic 
separation using antibody-coated magnetic beads, "panning" with antibody attached to a solid 
matrix (i.e., plate), and flow cytometry (See, e.g., U.S. Patent 5,985,660; and Morrison et al., Cell, 

35 96:737^9 (1999)). 

These techniques allow for the screening of particular populations of cells, such as might 
be found with hematological malignancies (i.e. mimmal residual disease (MRD) in acute leukemic 
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patients) and "non-self 1 cells in transplantations to prevent Graft-versus-Host Disease (GVHD). 
Alternatively, these techniques allow for the screening of hematopoietic stem and progenitor cells 
capable of undergoing proliferation and/or differentiation, as might be found in human umbilical 
cord blood. 

5 

Assays For Antibody Binding 

The antibodies of the invention may be assayed for imrnunospecific binding by any 
method known in the art The immunoassays which can be used include but are not limited to 
competitive and non-competitive assay systems using techniques such as western blots, 

10 radioimmunoassays, ELISA (enzyme linked immunosorbent assay), "sandwich" immunoassays, 
immunoprecipitation assays, precipitin reactions, gel diffusion precipitin reactions, 
immunodiffusion assays, agglutination assays, complement-fixation assays, immunoradiometric 
assays, fluorescent immunoassays, and protein A immunoassays, to name but a few. Such assays 
are routine and well known in the art (see, e.g., Ausubel et al, eds, 1994, Current Protocols in 

15 Molecular Biology, Vol. 1, John Wiley & Sons, Inc., New York, which is incorporated by 
reference herein in its entirety). Exemplary immunoassays are described briefly below (but are 
not intended by way of limitation). 

Immunoprecipitation protocols generally comprise lysing a population of cells in a lysis 
buffer such as RIPA buffer (1% NP-40 or Triton X- 100, 1% sodium deoxycholate, 0.1% SDS, 

20 0.15 M NaCl, 0.01 M sodium phosphate at pH 7.2, 1% Trasylol) supplemented with protein 
phosphatase and/or protease inhibitors (e.g., EDTA, PMSF, aprotinin, sodium vanadate), adding 
the antibody of interest to the cell lysate, incubating for a period of time (e.g., 1-4 hours) at 4° C, 
adding protein A and/or protein G sepharose beads to the cell lysate, incubating for about an hour 
or more at 4° C, washing the beads in lysis buffer and resuspending the beads in SDS/sample 

25 buffer. The ability of the antibody of interest to immunoprecipitate a particular antigen can be 
assessed by, e.g., western blot analysis. One of skill in the art would be knowledgeable as to the 
parameters that can be modified to increase the binding of the antibody to an antigen and decrease 
the background (e.g., pre-clearing the cell lysate with sepharose beads). For further discussion 
regarding immunoprecipitation protocols see, e.g., Ausubel et al., eds., (1994), Current Protocols 

30 in Molecular Biology, Vol. 1, John Wiley & Sons, Inc., New York, section 10.16.1. 

Western blot analysis generally comprises preparing protein samples, electrophoresis of 
the protein samples in a polyacrylarnide gel (e.g., 8%- 20% SDS-PAGE depending on the 
molecular weight of the antigen), transferring the protein sample from the polyacrylarnide gel to a 
membrane such as nitrocellulose, PVDF or nylon, blocking the membrane in blocking solution 

35 (e.g., PBS with 3% BSA or non-fat milk), washing the membrane in washing buffer (e.g., PBS- 
Tween 20), blocking the membrane with primary antibody (the antibody of interest) diluted in 
blocking buffer, washing the membrane in washing buffer, blocking the membrane with a 
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secondary antibody (which recognizes the primary antibody, e.g., an anti-human antibody) 
conjugated to an enzymatic substrate (e.g., horseradish peroxidase or alkaline phosphatase) or 
radioactive molecule (e.g., 32P or 1251) diluted in blocking buffer, washing the membrane in 
wash buffer, and detecting the presence of the antigen. One of skill in the art would be 
knowledgeable as to the parameters that can be modified to increase the signal detected and to 
reduce the background noise. For further discussion regarding western blot protocols see, e.g., 
Ausubel et al, eds, (1994), Current Protocols in Molecular Biology, Vol 1, John Wiley & Sons, 
Inc., New York, section 10.8.1. 

ELISAs comprise preparing antigen, coating the well of a 96 well microtiter plate with the 
antigen, adding the antibody of interest conjugated to a detectable compound such as an 
enzymatic substrate (e.g., horseradish peroxidase or alkaline phosphatase) to the well and 
incubating for a period of time, and detecting the presence of the antigen. In ELISAs the antibody 
of interest does not have to be conjugated to a detectable compound; instead, a second antibody 
(which recognizes the antibody of interest) conjugated to a detectable compound may be added to 
the well. Further, instead of coating the well with the antigen, the antibody may be coated to the 
well. In this case, a second antibody conjugated to a detectable compound may be added 
following the addition of the antigen of interest to the coated well. One of skill in the art would be 
knowledgeable as to the parameters that can be modified to increase the signal detected as well as 
other variations of ELISAs known in the art. For further discussion regarding ELISAs see, e.g., 
Ausubel et al, eds, (1994), Current Protocols in Molecular Biology, Vol. 1, John Wiley & Sons, 
Inc., New York, section 11.2.1. 

The binding affinity of an antibody to an antigen and the off-rate of an antibody-antigen 
interaction can be determined by competitive binding assays. One example of a competitive 
binding assay is a radioimmunoassay comprising the incubation of labeled antigen (e.g., 3H or 
1251) with the antibody of interest in the presence of increasing amounts of unlabeled antigen, and 
the detection of the antibody bound to the labeled antigen. The affinity of the antibody of interest 
for a particular antigen and the binding off-rates can be determined from the data by scatchard plot 
analysis. Competition with a second antibody can also be determined using radioimmunoassays. 
In this case, the antigen is incubated with antibody of interest conjugated to a labeled compound 
(e.g., 3H or 1251) in the presence of increasing amounts of an unlabeled second antibody. 

Antibodies of the invention may be characterized using immunocytochenristy methods on 
cells (e.g., mammalian cells, such as CHO cells) transfected with a vector enabling the expression 
of an antigen or with vector alone using techniques commonly known in the art. Antibodies that 
bind antigen transfected cells, but not vector-only transfected cells, are antigen specific. 

Therapeutic Uses 
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Table ID also provides information regarding biological activities and preferred 
therapeutic uses (i.e. see, "Preferred Indications" column) for polynucleotides and polypeptides of 
the invention (including antibodies, agonists, and/or antagonists thereof). Table ID also provides 
information regarding assays which may be used to test polynucleotides and polypeptides of the 
5 invention (including antibodies, agonists, and/or antagonists thereof) for the corresponding 
biological activities. The first column ("Gene No.") provides the gene number in the application 
for each clone identifier. The second column ("cDNA ATCC Deposit No:Z") provides the unique 
clone identifier for each clone as previously described and indicated in Table 1A, Table IB, and 
Table 1C The third column ("AA SEQ ID NO:Y") indicates the Sequence Listing SEQ ID 

10 Number for polypeptide sequences encoded by the corresponding cDNA clones (also as indicated 
in Table 1A, Table IB, and Table 2). The fourth column ("Biological Activity") indicates a 
biological activity corresponding to the indicated polypeptides (or polynucleotides encoding said 
polypeptides). The fifth column ( ,r Exemplary Activity Assay") further describes the 
corresponding biological activity and also provides mfonnation pertaining to the various types of 

15 assays which may be performed to test, demonstrate, or quantify the corresponding biological 
activity. 

The present invention is further directed to antibody-based therapies which involve 
administering antibodies of the invention to an animal, preferably a mammal, and most preferably 
a human, patient for treating one or more of the disclosed diseases, disorders, or conditions. 

20 Therapeutic compounds of the invention include, but are not limited to, antibodies of the 
invention (including fragments, analogs and derivatives thereof as described herein) and nucleic 
acids encoding antibodies of the invention (including fragments, analogs and derivatives thereof 
and anti-idiotypic antibodies as described herein). The antibodies of the invention can be used to 
detect, prevent, diagnose, prognosticate, treat, and/or ameliorate diseases, disorders or conditions 

25 associated with aberrant expression and/or activity of a polypeptide of the invention, including, but 
not limited to, gastrointestinal diseases and disorders. The treatment and/or prevention of 
gastrointestinal diseases and disorders associated with aberrant expression and/or activity of a 
polypeptide of the invention includes, but is not limited to, alleviating symptoms associated with 
gastrointestinal diseases and disorders. Antibodies of the invention may be provided in 

30 pharmaceutically acceptable compositions as known in the art or as described herein. 

In a specific and preferred embodiment, the present invention is directed to antibody- 
based therapies which involve adrninistering antibodies of the invention to an animal, preferably a 
mammal, and most preferably a human, patient for treating gastrointestinal diseases and disorders. 
Therapeutic compounds of the invention include, but are not limited to, antibodies of the 

35 invention (e.g., antibodies directed to the full length protein expressed on the cell surface of a 
mammalian cell; antibodies directed to an epitope of a polypeptide of the invention (such as, for 
example, a predicted linear epitope shown in column 7 of Table 1B.1 ; or a conformational epitope, 
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including fragments, analogs and derivatives thereof as described herein) and nucleic acids 
encoding antibodies of the invention (including fragments, analogs and derivatives thereof and 
anti-idiotypic antibodies as described herein). The antibodies of the invention can be used to 
detect, diagnose, prevent, treat, prognosticate, and/or ameliorate gastrointestinal diseases, 
disorders or conditions associated with aberrant expression and/or activity of a polypeptide of the 
invention. The treatment and/or prevention of gastrointestinal diseases, disorders, or conditions 
associated with aberrant expression and/or activity of a polypeptide of the invention includes, but 
is not limited to, alleviating symptoms associated with those diseases, disorders or conditions. 
Antibodies of the invention may be provided in pharmaceutically acceptable compositions as 
known in the art or as described herein. 

A summary of the ways in which the antibodies of the present invention may be used 
therapeutically includes binding polynucleotides or polypeptides of the present invention locally 
or systemically in the body or by direct cytotoxicity of the antibody, e.g. as mediated by 
complement (CDC) or by effector cells (ADCC). Some of these approaches are described in 
more detail below. Armed with the teachings provided herein, one of ordinary skill in the art will 
know how to use the antibodies of the present invention for diagnostic, monitoring or therapeutic 
purposes without undue experimentation. 

The antibodies of this invention may be advantageously utilized in combination with other 
monoclonal or chimeric antibodies, or with lymphokines or hematopoietic growth factors (such 
as, e.g., IL-2, EL-3 and IL-7), for example, which serve to increase the number or activity of 
effector cells which interact with the antibodies. 

The antibodies of the invention may be administered alone or in combination with other 
types of treatments (e.g., radiation therapy, chemotherapy, hormonal therapy, immunotherapy and 
anti-tumor agents). Generally, administration of products of a species origin or species reactivity 
(in the case of antibodies) that is the same species as that of the patient is preferred. Thus, in a 
preferred embodiment, human antibodies, fragments derivatives, analogs, or nucleic acids, are 
administered to a human patient for therapy or prophylaxis. 

It is preferred to use high affinity and/or potent in vivo inhibiting and/or neutralizing 
antibodies against polypeptides or polynucleotides of the present invention, fragments or regions 
thereof, for both immunoassays directed to and therapy of gastrointestinal diseases and disorders 
related to polynucleotides or polypeptides, including fragments thereof, of the present invention. 
Such antibodies, fragments, or regions, will preferably have an affinity for polynucleotides or 
polypeptides of the invention, including fragments thereof. Preferred binding affinities include 
those with a dissociation constant or Kd less than 5 X 10' 2 M, 10" 2 M, 5 X 10" 3 M, 10" 3 M, 5 X 10" 4 
M, 10- 4 M, 5 X 10" 5 M, 10" 5 M, 5 X 10* 6 M, lO^ 6 M, 5 X 10' 7 M, 10 7 M, 5 X 10' 8 M, 10" 8 M, 5 X 
10" 9 M, 10' 9 M, 5 X 10 10 M, 10 10 M,5X 10" u M, Iff" M, 5 X 10' 12 M, 10 12 M, 5 X 10 -13 M, 10" 13 
M, 5 X 10 M M, 10 14 M, 5 X 10" 15 M, and 10 15 M. 
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Gene Therapy 

In a specific embodiment, nucleic acids comprising sequences encoding antibodies or 
functional derivatives thereof, are administered to treat, inhibit or prevent a gastrointestinal 
5 disease or disorder associated with aberrant expression and/or activity of a polypeptide of the 
invention, by way of gene therapy. Gene therapy refers to therapy performed by the 
administration to a subject of an expressed or expressible nucleic acid. In this embodiment of the 
invention, the nucleic acids produce their encoded protein that mediates a therapeutic effect. 

Any of the methods for gene therapy available in the art can be used according to the 
10 present invention. Exemplary methods are described below. 

For general reviews of the methods of gene therapy, see Goldspiel et al., Clinical 
Pharmacy 12:488-505 (1993); Wu and Wu, Biotherapy 3:87-95 (1991); Tolstoshev, Ann. Rev. 
Pharmacol. Toxicol. 32:573-596 (1993); Mulligan, Science 260:926-932 (1993); and Morgan and 
Anderson, Ann. Rev. Biochem. 62:191-217 (1993); May, TTBTECH 11(5):155-215 (1993). 
15 Methods commonly known in the art of recombinant DNA technology which can be used are 
described in Ausubel et al. (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, 
NY (1993); and Kriegler, Gene Transfer and Expression, A Laboratory Manual, Stockton Press, 
NY (1990). 

In a preferred embodiment, the compound comprises nucleic acid sequences encoding an 

20 antibody, said nucleic acid sequences being part of expression vectors that express the antibody or 
fragments or chimeric proteins or heavy or light chains thereof in a suitable host. In particular, 
such nucleic acid sequences have promoters operably linked to the antibody coding region, said 
promoter being inducible or constitutive, and, optionally, tissue-specific. In another particular 
embodiment, nucleic acid molecules are used in which the antibody coding sequences and any 

25 other desired sequences are flanked by regions that promote homologous recombination at a 
desired site in the genome, thus providing for intrachromosomal expression of the antibody 
encoding nucleic acids (Koller and Smithies, Proc. Nad. Acad. Sci. USA 86:8932-8935 (1989); 
Zijlstra et al., Nature 342:435-438 (1989). In specific embodiments, the expressed antibody 
molecule is a single chain antibody; alternatively, the nucleic acid sequences include sequences 

30 encoding both the heavy and light chains, or fragments thereof, of the antibody. 

Delivery of the nucleic acids into a patient may be either direct, in which case the patient 
is directly exposed to the nucleic acid or nucleic acid- carrying vectors, or indirect, in which case, 
cells are first transformed with the nucleic acids in vitro, then transplanted into the patient These 
two approaches are known, respectively, as in vivo or ex vivo gene therapy. 

35 In a specific embodiment, the nucleic acid sequences are directly administered in vivo, 

where it is expressed to produce the encoded product. This can be accomplished by any of 
numerous methods known in the art, e.g., by constructing them as part of an appropriate nucleic 
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acid expression vector and administering it so that they become intracellular, e.g., by infection 
using defective or attenuated retrovirals or other viral vectors (see U.S. Patent No. 4,980,286), or 
by direct injection of naked DNA, or by use of microparticle bombardment (e.g., a gene gun; 
Biolistic, Dupont), or coating with lipids or cell-surface receptors or transfecting agents, 
5 encapsulation in liposomes, microparticles, or microcapsules, or by administering them in linkage 
to a peptide which is known to enter the nucleus, by administering it in linkage to a ligand subject 
to receptor-mediated endocytosis (see, e.g., Wu and Wu, J. Biol. Chenx 262:4429-4432 (1987)) 
(which can be used to target cell types specifically expressing the receptors), etc. In another 
embodiment, nucleic acid-ligand complexes can be formed in which the ligand comprises a 

10 fusogenic viral peptide to disrupt endosomes, allowing the nucleic acid to avoid lysosomal 
degradation. In yet another embodiment, the nucleic acid can be targeted in vivo for cell specific 
uptake and expression, by targeting a specific receptor (see, e.g., PCT Publications WO 92/06180; 
WO 92/22635; WO92/20316; W093/14188, WO 93/20221). Alternatively, the nucleic acid can 
be introduced intracellularly and incorporated within host cell DNA for expression, by 

15 homologous recombination (Koller and Smithies, Proc. Natl. Acad Sci. USA 86:8932-8935 
(1989); Zijlstra et al, Nature 342:435-438 (1989)). 

In a specific embodiment, viral vectors that contains nucleic acid sequences encoding an 
antibody of the invention are used. For example, a retroviral vector can be used (see Miller et al., 
Meth. Enzymoi. 217:581-599 (1993)). These retroviral vectors contain the components necessary 

20 for the correct packaging of the viral genome and integration into the host cell DNA. The nucleic 
acid sequences encoding the antibody to be used in gene therapy are cloned into one or more 
vectors, which facilitates delivery of the gene into a patient. More detail about retroviral vectors 
can be found in Boesen et al., Biotherapy 6:291-302 (1994), which describes the use of a 
retroviral vector to deliver the mdrl gene to hematopoietic stem cells in order to make the stem 

25 cells more resistant to chemotherapy. Other references illustrating the use of retroviral vectors in 
gene therapy are: Clowes et al., J. Clin. Invest. 93:644-651 (1994); Kiem et al., Blood 83:1467- 
1473 (1994); Salmons and Gunzberg, Human Gene Therapy 4:129-141 (1993); and Grossman and 
Wilson, Curr. Opin. in Genetics and Devel. 3:110-114 (1993). 

Adenoviruses are other viral vectors that can be used in gene therapy. Adenoviruses are 

30 especially attractive vehicles for delivering genes to respiratory epithelia. Adenoviruses naturally 
infect respiratory epithelia where they cause a mild disease. Other targets for adenovirus-based 
delivery systems are liver, the central nervous system, endothelial cells, and muscle. 
Adenoviruses have the advantage of being capable of infecting non-dividing cells. Kozarsky and 
Wilson, Current Opinion in Genetics and Development 3:499-503 (1993) present a review of 

35 adenovirus-based gene therapy. Bout et al., Human Gene Therapy 5:3-10 (1994) demonstrated 
the use of adenovirus vectors to transfer genes to the respiratory epithelia of rhesus monkeys. 
Other instances of the use of adenoviruses in gene therapy can be found in Rosenfeld et al., 
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Science 252:431-434 (1991); Rosenfeld et al., Cell 68:143- 155 (1992); Mastrangeli et al., J. Clin. 
Invest 91:225-234 (1993); PCT Publication W094/12649; and Wang, et al., Gene Therapy 2:775- 
783 (1995). In a preferred embodiment, adenovirus vectors are used. 

Adeno-associated virus (AAV) has also been proposed for use in gene therapy (Walsh et 
5 al., Proc. Soc. Exp. Biol. Med. 204:289-300 (1993); U.S. Patent No. 5,436,146). 

Another approach to gene therapy involves transferring a gene to cells in tissue culture by 
such methods as electroporation, lipofection, calcium phosphate mediated transfection, or viral 
infection. Usually, the method of transfer includes the transfer of a selectable marker to the cells. 
The cells are then placed under selection to isolate those cells that have taken up and are 

10 expressing the transferred gene. Those cells are then delivered to a patient. 

In this embodiment, the nucleic acid is introduced into a cell prior to administration in vivo 
of the resulting recombinant cell. Such introduction can be carried out by any method known in 
the art, including but not limited to transfection, electroporation, microinjection, infection with a 
viral or bacteriophage vector containing the nucleic acid sequences, cell fusion, chromosome- 

15 mediated gene transfer, microceil-mediated gene transfer, spheroplast fusion, etc. Numerous 
techniques are known in the art for the introduction of foreign genes into cells (see, e.g., LoefBer 
and Behr, Meth. Enzymol. 217:599-618 (1993)"; Cohen et al., Meth. Enzymol. 217:618-644 
(1993); Cline, Pharmac. Ther. 29:69-92m (1985) and may be used in accordance with the present 
invention, provided that the necessary developmental and physiological functions of the recipient 

20 cells are not disrupted. The technique should provide for the stable transfer of the nucleic acid to 
the cell, so that the nucleic acid is expressible by the cell and preferably heritable and expressible 
by its cell progeny. 

The resulting recombinant cells can be delivered to a patient by various methods known in 
the art. Recombinant blood cells (e.g., hematopoietic stem or progenitor cells) are preferably 
25 administered intravenously. The amount of cells envisioned for use depends on the desired effect, 
patient state, etc., and can be deteranned by one skilled in the art. 

Cells into which a nucleic acid can be introduced for purposes of gene therapy encompass 
any desired, available cell type, and include but are not limited to epithelial cells, endothelial 
cells, keratinocytes, fibroblasts, muscle cells, hepatocytes; blood cells such as T lymphocytes, B 
30 lymphocytes, monocytes, macrophages, neutrophils, eosinophils, megakaryocytes, granulocytes; 
various stem or progenitor cells, in particular hematopoietic stem or progenitor cells, e.g., as 
obtained from bone marrow, umbilical cord blood, peripheral blood, fetal liver, etc. 

In a preferred embodiment, the cell used for gene therapy is autologous to the patient 
In an embodiment in which recombinant cells are used in gene therapy, nucleic acid 
35 sequences encoding an antibody are introduced into the cells such that they are expressible by the 
cells or their progeny, and the recombinant cells are then administered in vivo for therapeutic 
effect. In a specific embodiment, stem or progenitor cells are used. Any stem and/or progenitor 
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cells which can be isolated and maintained in vitro can potentially be used in accordance with this 
embodiment of the present invention (see e.g. PCT Publication WO 94/08598; Stemple and 
Anderson, Cell 71:973-985 (1992); Rheinwald, Meth. Cell Bio. 21A:229 (1980); and Pittelkow 
and Scott, Mayo Clinic Proc. 61:771 (1986)). 
5 In a specific embodiment, the nucleic acid to be introduced for purposes of gene therapy 

comprises an inducible promoter operably linked to the coding region, such that expression of the 
nucleic acid is controllable by the presence or absence of an appropriate inducer of transcription. 

Demonstration of Therapeutic or Prophylactic Activity 

10 The compounds or pharmaceutical compositions of the invention are preferably tested in 

vitro, and then in vivo for the desired therapeutic or prophylactic activity, prior to use in humans. 
For example, in vitro assays to demonstrate the therapeutic or prophylactic utility of a compound 
or pharmaceutical composition include, the effect of a compound on a cell line or a patient tissue 
sample. The effect of the compound or composition on die cell line and/or tissue sample can be 

15 detennined utilizing techniques known to those of skill in the art including, but not limited to, 
rosette formation assays and cell lysis assays. In accordance with the invention, in vitro assays 
which can be used to detennine whether administration of a specific compound is indicated, 
include in vitro cell culture assays in which a patient tissue sample is grown in culture, and 
exposed to or otherwise administered a compound, and the effect of such compound upon the 

20 tissue sample is observed. 



Therapeutic/Prophylactic Administration and Composition 

The invention provides methods of treatment, inhibition and prophylaxis by administration 
to a subject of an effective amount of a compound or pharmaceutical composition of the invention, 
25 preferably a polypeptide or antibody of the invention. In a preferred embodiment, the compound 
is substantially purified (e.g., substantially free from substances that limit its effect or produce 
undesired side-effects). The subject is preferably an animal, including but not limited to animals 
such as cows, pigs, horses, chickens, cats, dogs, etc., and is preferably a mammal, and most 
preferably human. 

30 Formulations and methods of administration that can be employed when the compound 

comprises a nucleic acid or an immunoglobulin are described above; additional appropriate 
formulations and routes of adrninistration can be selected from among those described herein 
below. 

Various delivery systems are known and can be used to administer a compound of the 
35 invention, e.g., encapsulation in liposomes, microparticles, microcapsules, recombinant cells 
capable of expressing the compound, receptor-mediated endocytosis (see, e.g., Wu and Wu, J. 
Biol. Cbem. 262:4429-4432 (1987)), construction of a nucleic acid as part of a retroviral or other 
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vector, etc. Methods of introduction include but are not limited to intradermal, intramuscular, 
intraperitoneal, intravenous, subcutaneous, intranasal, epidural, and oral routes. The compounds 
or compositions may be administered by any convenient route, for example by infusion or bolus 
injection, by absorption through epithelial or mucocutaneous linings (e.g., oral mucosa, rectal and 

■ 

5 intestinal mucosa, etc.) and may be administered together with other biologically active agents. 
Administration can be systemic or local. In addition, it may be desirable to introduce the 
pharmaceutical compounds or compositions of the invention into the central nervous system by 
any suitable route, including intraventricular and intrathecal injection; intraventricular injection 
may be facilitated by an intraventricular catheter, for example, attached to a reservoir, such as an 

10 Ommaya reservoir. Pulmonary administration can also be employed, e.g., by use of an inhaler or 
nebulizer, and formulation with an aerosolizing agent 

In a specific embodiment, it may be desirable to administer the pharmaceutical 
compounds or compositions of the invention locally to the area in need of treatment; this may be 
achieved by, for example, and not by way of limitation, local infusion during surgery, topical 

15 application, e.g., in conjunction with a wound dressing after surgery, by injection, by means of a 
catheter, by means of a suppository, or by means of an implant, said implant being of a porous, 
non-porous, or gelatinous material, including membranes, such as sialastic membranes, or fibers. 
Preferably, when administering a protein, including an antibody, of the invention, care must be 
taken to use materials to which the protein does not absorb. 

20 In another embodiment, the compound or composition can be delivered in a vesicle, in 

particular a liposome (see Langer, Science 249:1527-1533 (1990); Treat et al., in Liposomes in 
the Therapy of Infectious Disease and Cancer, Lopez-Berestein and Fidler (eds.), Liss, New York, 
pp. 353- 365 (1989); Lopez-Berestein, ibid., pp. 317-327; see generally ibid.) 

In yet another embodiment, the compound or composition can be delivered in a controlled 

25 release system. In one embodiment, a pump may be used (see Langer, supra; Sefton, CRC Crit 
Ref. Biomed. Eng. 14:201 (1987); Buchwald et al., Surgery 88:507 (1980); Saudeket al., N. Engl. 
J. Med. 321:574 (1989)). In another embodiment, polymeric materials can be used (see Medical 
Applications of Controlled Release, Langer and Wise (eds.), CRC Pres., Boca Raton, Florida 
(1974); Controlled Drug Bioavailability, Drug Product Design and Performance, Smolen and Ball 

30 (eds.), Wiley, New York (1984); Ranger and Peppas, J., Macromol. Sci. Rev. Macromol. Chem. 
23:61 (1983); see also Levy et al., Science 228:190 (1985); During et al., Ann. Neurol. 25:351 
(1989); Howard et al., LNeurosurg. 71:105 (1989)). In yet another embodiment, a controlled 
release system can be placed in proximity of the therapeutic target, e.g., the brain, thus requiring 
only a fraction of the systemic dose (see, e.g., Goodson, in Medical Applications of Controlled 

35 Release, supra, vol. 2, pp. 115-138 (1984)). 

Other controlled release systems are discussed in the review by Langer (Science 
249:1527-1533 (1990)). 
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In a specific embodiment where the compound of the invention is a nucleic acid encoding 
a protein, the nucleic acid can be administered in vivo to promote expression of its encoded 
protein, by constructing it as part of an appropriate nucleic acid expression vector and 
administering it so that it becomes intracellular, e.g., by use of a retroviral vector (see U.S. Patent 
5 No. 4,980,286), or by direct injection, or by use of microparticle bombardment (e.g., a gene gun; 
Biolistic, Dupont), or coating with lipids or cell-surface receptors or transfecting agents, or by 
administering it in linkage to a homeobox- like peptide which is known to enter the nucleus (see 
e.g., Joliot et al., Proc. Nad. Acad. Sci. USA 88:1864-1868 (1991)), etc. Alternatively, a nucleic 
acid can be introduced intracellularly and incorporated within host cell DNA for expression, by 

10 homologous recombination. 

The present invention also provides pharmaceutical compositions. Such compositions 
comprise a therapeutically effective amount of a compound, and a pharmaceutically acceptable 
carrier. In a specific embodiment, the term "pharmaceutically acceptable" means approved by a 
regulatory agency of the Federal or a state government or listed in the U.S. Pharmacopeia or other 

15 generally recognized pharmacopeia for use in animals, and more particularly in humans. The 
term "carrier" refers to a diluent, adjuvant, excipient, or vehicle with which the therapeutic is 
administered. Such pharmaceutical carriers can be sterile liquids, such as water and oils, 
including those of petroleum, animal, vegetable or synthetic origin, such as peanut oil, soybean 
oil, mineral oil, sesame oil and the like. Water is a preferred carrier when the pharmaceutical 

20 composition is administered intravenously. Saline solutions and aqueous dextrose and glycerol 
solutions can also be employed as liquid carriers, particularly for injectable solutions. Suitable 
pharmaceutical excipients include starch, glucose, lactose, sucrose, gelatin, malt, rice, flour, chalk, 
silica gel, sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim milk, 
glycerol, propylene, glycol, water, ethanol and the like. The composition, if desired, can also 

25 contain minor amounts of wetting or emulsifying agents, or pH buffering agents. These 
compositions can take the form of solutions, suspensions, emulsion, tablets, pills, capsules, 
powders, sustained-release formulations and the like. The composition can be formulated as a 
suppository, with traditional binders and carriers such as triglycerides. Oral formulation can 
include standard carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium 

30 stearate, sodium saccharine, cellulose, magnesium carbonate, etc. Examples of suitable 
pharmaceutical carriers are described in "Remington's Pharmaceutical Sciences" by E.W. Martin. 
Such compositions will contain a therapeutically effective amount of the compound, preferably in 
purified form, together with a suitable amount of carrier so as to provide the form for proper 
administration to the patient The formulation should suit the mode of administration. 

35 In a preferred embodiment, the composition is formulated in accordance with routine 

procedures as a pharmaceutical composition adapted for intravenous administration to human 
beings. Typically, compositions for intravenous administration are solutions in sterile isotonic 
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aqueous buffer. Where necessary, the composition may also include a solubilizing agent and a 
local anesthetic such as lignocaine to ease pain at the site of the injection. Generally, the 
ingredients are supplied either separately or mixed together in unit dosage form, for example, as a 
dry lyophilized powder or water free concentrate in a hermetically sealed container such as an 
5 ampoule or sachette indicating the quantity of active agent. Where the composition is to be 
administered by infusion, it can be dispensed with an infusion bottle containing sterile 
pharmaceutical grade water or saline. Where the composition is administered by injection, an 
ampoule of sterile water for injection or saline can be provided so that the ingredients may be 
mixed prior to administration. 

10 The compounds of the invention can be formulated as neutral or salt forms. 

Pharmaceuticaily acceptable salts include those formed with anions such as those derived from 
hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., and those formed with cations such as 
those derived from sodium, potassium, ammonium, calcium, ferric hydroxides, isopropylamine, 
triethylamine, 2^thylamino ethanol, histidine, procaine, etc. 

15 The amount of the compound of the invention which will be effective in the treatment, 

inhibition and prevention of a disease or disorder associated with aberrant expression and/or 
activity of a polypeptide of the invention can be determined by standard clinical techniques. In 
addition, in vitro assays may optionally be employed to help identify optimal dosage ranges. The 
precise dose to be employed in the formulation will also depend on the route of administration, 

20 and the seriousness of the disease or disorder, and should be decided according to the judgment of 
the practitioner and each patient's circumstances. Effective doses may be extrapolated from dose- 
response curves derived from in vitro or animal model test systems. 

For antibodies, the dosage administered to a patient is typically 0.1 mg/kg to 100 mg/kg of 
the patient's body weight. Preferably, the dosage administered to a patient is between 0.1 mg/kg 

25 and 20 mg/kg of the patient's body weight, more preferably 1 mg/kg to 10 mg/kg of the patient's 
body weight Generally, human antibodies have a longer half-life within the human body than 
antibodies from other species due to the immune response to the foreign polypeptides. Thus, 
lower dosages of human antibodies and less frequent administration is often possible. Further, the 
dosage and frequency of administration of antibodies of the invention may be reduced by 

30 enhancing uptake and tissue penetration (e.g., into the brain) of the antibodies by modifications 
such as, for example, lipidation. 

The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more of the ingredients of the pharmaceutical compositions of the 
invention. Optionally associated with such container(s) can be a notice in the form prescribed by a 

35 governmental agency regulating the manufacture, use or sale of pharmaceuticals or biological 
products, which notice reflects approval by the agency of manufacture, use or sale for human 
administration. 
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Diagnosis and Imaging 

Labeled antibodies, and derivatives and analogs thereof, which specifically bind to a 
polypeptide of interest can be used for diagnostic purposes to detect, diagnose, prognosticate, or 
5 monitor gastrointestinal diseases, disorders, and/or conditions associated with the aberrant 
expression and/or activity of a polypeptide of the invention. The invention provides for the 
detection of aberrant expression of a polypeptide of interest, comprising (a) assaying the 
expression of the polypeptide of interest in cells or body fluid of an individual using one or more 
antibodies specific to the polypeptide interest and (b) comparing the level of gene expression with 

10 a standard gene expression level, whereby an increase or decrease in the assayed polypeptide gene 
expression level compared to the standard expression level is indicative of aberrant expression. 

The invention provides a diagnostic assay for diagnosing a gastrointestinal disease or 
disorder, comprising (a) assaying the expression of the polypeptide of interest in cells or body 
fluid of an individual using one or more antibodies specific to the polypeptide interest and (b) 

15 comparing the level of gene expression with a standard gene expression level, whereby an increase 
or decrease in the assayed polypeptide gene expression level compared to the standard expression 
level is indicative of a particular gastrointestional disease or disorder. With respect to 

i 

gastrointestinal cancers, the presence of a relatively high amount of transcript in biopsied tissue 

from an individual may indicate a predisposition for the development of the disease, or may 
20 provide a means for detecting the disease prior to the appearance of actual clinical symptoms. A 

more definitive diagnosis of this type may allow health professionals to employ preventative 

measures or aggressive treatment earlier thereby preventing the development or further 

progression of the gastrointestinal cancer. 

Antibodies of the invention can be used to assay protein levels in a biological sample 
25 using classical immunohistological methods known to those of skill in the art (e.g., see Jalkanen 

et aL, J. Cell. Biol. 101:976-985 (1985); Jalkanen et ah, J. Cell . Biol. 105:3087-3096 (1987)). 

Other antibody-based methods useful for detecting protein gene expression include immunoassays, 

such as the enzyme linked immunosorbent assay (ELISA) and the radioimmunoassay (RIA). 

Suitable antibody assay labels are known in the art and include enzyme labels, such as, glucose 
30 oxidase; radioisotopes, such as iodine (1251, 1211), carbon (14C), sulfur (35S), tritium (3H), 

indium (112In), and technetium (99Tc); luminescent labels, such as luminol; and fluorescent 

labels, such as fluorescein and rhcriamine, and biotin. 

One facet of the invention is the detection and diagnosis of a disease or disorder associated 

with aberrant expression of a polypeptide of interest in an animal, preferably a mammal and most 
35 preferably a human. In one embodiment, diagnosis comprises: a) administering (for example, 

parenterally, subcutaneously, or intraperitoneally) to a subject an effective amount of a labeled 

molecule which specifically binds to the polypeptide of interest; b) waiting for a time interval 
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following the administering for permitting the labeled molecule to preferentially concentrate at 
sites in the subject where the polypeptide is expressed (and for unbound labeled molecule to be 
cleared to background level); c) determining background level; and d) detecting the labeled 
molecule in the subject, such that detection of labeled molecule above the background level 
5 indicates that the subject has a particular disease or disorder associated with aberrant expression 
of the polypeptide of interest. Background level can be determined by various methods including, 
comparing the amount of labeled molecule detected to a standard value previously determined for 
a particular system. 

It will be understood in the art that the size of the subject and the imaging system used will 

10 determine the quantity of imaging moiety needed to produce diagnostic images. In the case of a 
radioisotope moiety, for a human subject, the quantity of radioactivity injected will normally range 
from about 5 to 20 millicuries of 99mTc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain the specific protein. In vivo tumor 
imaging is described in S.W. Burchiel et al., "Immunophanracokinetics of Radiolabeled 

15 Antibodies and Their Fragments." (Chapter 13 in Tumor Imaging: The Radiochemical Detection 
of Cancer, S.W. Burchiel and B. A. Rhodes, eds., Masson Publishing Inc. (1982)). 

Depending on several variables, including the type of label used and the mode of 
administration, the time interval foDowing the administration for permitting the labeled molecule 
to preferentially concentrate at sites in the subject and for unbound labeled molecule to be cleared 

20 to background level is 6 to 48 hours or 6 to 24 hours or 6 to 12 hours. In another embodiment the 
time interval following administration is 5 to 20 days or 5 to 10 days. 

In an embodiment, monitoring of the disease or disorder is carried out by repeating the 
method for diagnosing the disease or disease, for example, one month after initial diagnosis, six 
months after initial diagnosis, one year after initial diagnosis, etc. 

25 Presence of the labeled molecule can be detected in the patient using methods known in 

the art for in vivo scanning. These methods depend upon the type of label used. Skilled artisans 
will be able to determine the appropriate method for detecting a particular label. Methods and 
devices that may be used in the diagnostic methods of the invention include, but are not limited to, 
computed tomography (CT), whole body scan such as position emission tomography (PET), 

30 magnetic resonance imaging (MRI), and sonography. 

In a specific embodiment, the molecule is labeled with a radioisotope and is detected in 
the patient using a radiation responsive surgical instrument (Thurston et al., U.S. Patent No. 
5,441,050). In another embodiment, the molecule is labeled with a fluorescent compound and is 
detected in the patient using a fluorescence responsive scanning instrument In another 

35 embodiment, the molecule is labeled with a positron emitting metal and is detected in the patent 
using positron ernission-tornography. In yet another embodiment, the molecule is labeled with a 
paramagnetic label and is detected in a patient using magnetic resonance imaging (MRI). 
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Kits 

The present invention provides kits that can be used in the above methods. In one 
embodiment, a kit comprises an antibody of the invention, preferably a purified antibody, in one or 
5 more containers. In a specific embodiment, the kits of the present invention contain a substantially 
isolated polypeptide comprising an epitope which is specifically immunoreactive with an antibody 
included in the kit Preferably, the kits of the present invention further comprise a control 
antibody which does not react with the polypeptide of interest. In another specific embodiment, 
the kits of the present invention contain a means for detecting the binding of an antibody to a 
10 polypeptide of interest (e.g., the antibody may be conjugated to a detectable substrate such as a 
fluorescent compound, an enzymatic substrate, a radioactive compound or a luminescent 
compound, or a second antibody which recognizes the first antibody may be conjugated to a 
detectable substrate). 

In another specific embodiment of the present invention, the kit is a diagnostic kit for use 

15 in screening serum containing antibodies specific against proliferative and/or cancerous 
polynucleotides and polypeptides. Such a kit may include a control antibody that does not react 
with the polypeptide of interest. Such a kit may include a substantially isolated polypeptide 
antigen comprising an epitope which is specifically immunoreactive with at least one anti- 
polypeptide antigen antibody. Further, such a kit includes means for detecting the binding of said 

20 antibody to the antigen (e.g., the antibody may be conjugated to a fluorescent compound such as 
fluorescein or rhodamine which can be detected by flow cytometry). In specific embodiments, the 
kit may include a recombinantly produced or chemically synthesized polypeptide antigen. The 
polypeptide antigen of the kit may also be attached to a solid support. 

In a more specific embodiment the detecting means of the above-<!escribed kit includes a 

25 solid support to which said polypeptide antigen is attached. Such a kit may also include a non- 
attached reporter-labeled anti-human antibody. In this embodiment, binding of the antibody to the 
polypeptide antigen can be detected by binding of the said reporter-labeled antibody. 

In an additional embcKliment, the invention includes a diagnostic kit for use in screening 
serum containing antigens of the polypeptide of the invention. The diagnostic kit includes a 

30 substantially isolated antibody specifically immunoreactive with polypeptide or polynucleotide 
antigens, and means for detecting the binding of the polynucleotide or polypeptide antigen to the 
antibody. In one embodiment, the antibody is attached to a solid support. In a specific 
embodiment, the antibody may be a monoclonal antibody. The detecting means of the kit may 
include a second, labeled monoclonal antibody. Alternatively, or in addition, the detecting means 

35 may include a labeled, competing antigen. 

In one diagnostic configuration, test serum is reacted with a solid phase reagent having a 
surface-bound antigen obtained by the methods of the present invention. After binding with 
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specific antigen antibody to the reagent and removing unbound serum components by washing, the 
reagent is reacted with reporter-labeled anti-human antibody to bind reporter to the reagent in 
proportion to the amount of bound anti-antigen antibody on the solid support The reagent is again 
washed to remove unbound labeled antibody, and the amount of reporter associated with the 
reagent is determined. Typically, the reporter is an enzyme which is detected by incubating the 
solid phase in the presence of a suitable fluorometric, luminescent or colorimetric substrate 
(Sigma, St Louis, MO). 

The solid surface reagent in the above assay is prepared by known techniques for attaching 
protein material to solid support material, such as polymeric beads, dip sticks, 96-well plate or 
filter material. These attachment methods generally include non-specific adsorption of the protein 
to the support or covalent attachment of the protein, typically through a free amine group, to a 
chemically reactive group on the solid support, such as an activated carboxyl, hydroxyl, or 
aldehyde group. Alternatively, streptavidin coated plates can be used in conjunction with 
biotinylated antigen(s). 

Thus, the invention provides an assay system or kit for carrying out this diagnostic 
method. The kit generally includes a support with surface- bound recombinant antigens, and a 
reporter-labeled anti-human antibody for detecting surface-bound anti-antigen antibody. 



20 Uses of the Polynucleotides 

Each of the polynucleotides identified herein can be used in numerous ways as reagents. 
The following description should be considered exemplary and utilizes known techniques. 

The polynucleotides of the present invention are useful for chromosome identification. 
There exists an ongoing need to identify new chromosome markers, since few chromosome 

25 marking reagents, based on actual sequence data (repeat polymorphisms), are presently available. 
Each sequence is specifically targeted to and can hybridize with a particular location on an 
individual human chromosome, thus each polynucleotide of the present invention can routinely be 
used as a chromosome marker using techniques known in the art. Table 1B.1, column 8 provides 
the chromosome location of some of the polynucleotides of the invention. 

30 Briefly, sequences can be mapped to chromosomes by preparing PCR primers (preferably 

at least 15 bp (e.g., 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can optionally 
be selected using computer analysis so that primers do not span more than one predicted exon in 
the genomic DNA. These primers are then used for PCR screening of somatic cell hybrids 
containing individual human chromosomes. Only those hybrids containing the human gene 

35 corresponding to SEQ ID NO:X will yield an amplified fragment. 

Similarly, somatic hybrids provide a rapid method of PCR mapping the polynucleotides to 
particular chromosomes. Three or more clones can be assigned per day using a single thermal 
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cycler. Moreover, sublocalization of the polynucleotides can be achieved with panels of specific 
chromosome fragments. Other gene mapping strategies that can be used include in situ 
hybridization, prescreening with labeled flow-sorted chromosomes, preselection by hybridization 
to construct chromosome specific-cDNA libraries, and computer mapping techniques (See, e.g., 
5 Shuler, Trends Biotechnol 16:456-459 (1998) which is hereby incorporated by reference in its 
entirety). 

Precise chromosomal location of the polynucleotides can also be achieved using 
fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. This technique 
uses polynucleotides as short as 500 or 600 bases; however, polynucleotides 2,000-4,000 bp are 
10 preferred. For a review of this technique, see Verma et al, "Human Chromosomes: a Manual of 
Basic Techniques," Pergamon Press, New York (1988). 

For chromosome mapping, the polynucleotides can be used individually (to mark a single 

§ 

chromosome or a single site on that chromosome) or in panels (for marking multiple sites and/or 
multiple chromosomes). 

15 Thus, the present invention also provides a method for chromosomal localization which 

involves (a) preparing PCR primers from the polynucleotide sequences in Table IB and/or Table 2 
and SEQ ID NO:X and (b) screening somatic cell hybrids containing individual chromosomes. 

The polynucleotides of the present invention would likewise be useful for radiation hybrid 
mapping, HAPPY mapping, and long range restriction mapping. For a review of these techniques 

20 and others known in the art, see, e.g. Dear, "Genome Mapping: A Practical Approach," IRL Press 
at Oxford University Press, London (1997); Aydin, J. Mol. Med. 77:691-694 (1999); Hacia et al., 
Mol. Psychiatry 3:483^92 (1998); Herrick et al., Chromosome Res. 7:409-423 (1999); Hamilton 
et al., Methods Cell Biol. 62:265-280 (2000); and/or Ott, J. Hered. 90:68-70 (1999) each of which 
is hereby incorporated by reference in its entirety. 

25 Once a polynucleotide has been mapped to a precise chromosomal location, the physical 

position of the polynucleotide can be used in linkage analysis. Linkage analysis establishes 
coinheritance between a chromosomal location and presentation of a particular disease. (Disease 
mapping data are found, for example, in V. McKusick, Mendelian Inheritance in Man (available 
on line through Johns Hopkins University Welch Medical Library)). Column 9 of Table 1B.1 

30 provides an OMIM reference identification number of diseases associated with the cytologic band 
disclosed in column 8 of Table 1B.1, as determined using techniques described herein and by 
reference to Table 5. Assuming 1 megabase mapping resolution and one gene per 20 kb, a cDNA 
precisely localized to a chromosomal region associated with the disease could be one of 50-500 
potential causative genes. 

35 Thus, once coinheritance is established, differences in a polynucleotide of the invention 

and the corresponding gene between affected and unaffected individuals can be examined. First, 
visible structural alterations in the chromosomes, such as deletions or translocations, are examined 
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in chromosome spreads or by PCR. If no structural alterations exist, the presence of point 
mutations are ascertained. Mutations observed in some or all affected individuals, but not in 
normal individuals, indicates that the mutation may cause the disease. However, complete 
sequencing of the polypeptide and the corresponding gene from several normal individuals is 
5 required to distinguish the mutation from a polymorphism. If a new polymorphism is identified, 
this polymorphic polypeptide can be used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected individuals as 
compared to unaffected individuals can be assessed using the polynucleotides of the invention. 
Any of these alterations (altered expression, chromosomal rearrangement, or mutation) can be 
10 used as a diagnostic or prognostic marker. Diagnostic and prognostic methods, kits and reagents 
encompassed by the present invention are briefly described below and more thoroughly elsewhere 
herein (see e.g., the sections labeled "Antibodies", 'Diagnostic Assays", and 'Methods for 
Detecting Diseases"). 

Thus, the invention also provides a diagnostic method useful during diagnosis of a 
15 disorder, involving measuring the expression level of polynucleotides of the present invention in 
cells or body fluid from an individual and comparing the measured gene expression level with a 
standard level of polynucleotide expression level, whereby an increase or decrease in the gene 
expression level compared to the standard is indicative of a disorder. Additional non-limiting 
examples of diagnostic methods encompassed by the present invention are more thoroughly 
20 described elsewhere herein (see, e.g., Example 12). 

In still another embodiment, the invention includes a kit for analyzing samples for the 
presence of proliferative and/or cancerous polynucleotides derived from a test subject In a general 
embodiment, the kit includes at least one polynucleotide probe containing a nucleotide sequence 
that will specifically hybridize with a polynucleotide of the invention and a suitable container. In a 
25 specific embodiment, the kit includes two polynucleotide probes defining an internal region of the 
polynucleotide of the invention, where each probe has one strand containing a 31'mer-end internal 
to the region. In a further embodiment, the probes may be useful as primers for polymerase chain 
reaction amplification. 

Where a diagnosis of a related disorder, including, for example, diagnosis of a tumor, has 
30 already been made according to conventional methods, the present invention is useful as a 
prognostic indicator, whereby patients exhibiting enhanced or depressed polynucleotide of the 
invention expression will experience a worse clinical outcome relative to patients expressing the 
gene at a level nearer the standard level. 

By "measuring the expression level of polynucleotides of the invention" is intended 
35 qualitati vely or quantitatively measuring or estimating the level of the polypeptide of the invention 
or the level of the rnRNA encoding the polypeptide of the invention in a first biological sample 
either directly (e.g., by deterrnining or estimating absolute protein level or rnRNA level) or 
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relatively (e.g., by comparing to the polypeptide level or mRNA level in a second biological 
sample). Preferably, the polypeptide level or mRNA level in the first biological sample is 
measured or estimated and compared to a standard polypeptide level or mRNA level, the standard 
being taken from a second biological sample obtained from an individual not having the related 
5 disorder or being determined by averaging levels from a population of individuals not having a 
related disorder. As will be appreciated in the art, once a standard polypeptide level or mRNA 
level is known, it can be used repeatedly as a standard for comparison. 

By "biological sample" is intended any biological sample obtained from an individual, 
body fluid, cell line, tissue culture, or other source which contains polypeptide of the present 

10 invention or the corresponding mRNA. As indicated, biological samples include body fluids (such 
as semen, lymph, vaginal pool, sera, plasma, urine, synovial fluid and spinal fluid) which contain 
the polypeptide of the present invention, and tissue sources found to express the polypeptide of the 
present invention. Methods for obtaining tissue biopsies and body fluids from mammals are well 
known in the art. W^here the biological sample is to include mRNA, a tissue biopsy is the 

15 preferred source. 

The method(s) provided above may preferably be applied in a diagnostic method and/or 
kits in which polynucleotides and/or polypeptides of the invention are attached to a solid support. 
In one exemplary method, the support may be a "gene chip" or a "biological chip" as described in 
US Patents 5,837,832, 5,874,219, and 5,856,174. Further, such a gene chip with polynucleotides 

20 of the invention attached may be used to identify polymorphisms between the isolated 
polynucleotide sequences of the invention, with polynucleotides isolated from a test subject. The 
knowledge of such polymorphisms (i.e. their location, as well as, their existence) would be 
beneficial in identifying disease loci for many disorders, such as for example, in neural disorders, 
immune system disorders, muscular disorders, reproductive disorders, gastrointestinal disorders, 

25 pulmonary disorders, digestive disorders, metabolic disorders, cardiovascular disorders, renal 
disorders, proliferative disorders, and/or cancerous diseases and conditions. Such a method is 
described in US Patents 5,858,659 and 5,856,104. The US Patents referenced supra are hereby 
incorporated by reference in their entirety herein. 

The present invention encompasses polynucleotides of the present invention that are 

30 chemically synthesized, or reproduced as peptide nucleic acids (PNA), or according to other 
methods known in the art The use of PNAs would serve as the preferred form if the 
polynucleotides of the invention are incorporated onto a solid support, or gene chip. For the 
purposes of the present invention, a peptide nucleic acid (PNA) is a polyamide type of DNA 
analog and the monomeric units for adenine, guanine, mymine and cytosine are available 

35 commercially (Perceptive Biosystems). Certain components of DNA, such as phosphorus, 
phosphorus oxides, or deoxyribose derivatives, are not present in PNAs. As disclosed by Nielsen 
et al., Science 254, 1497 (1991); and Egholm et aL, Nature 365, 666 (1993), PNAs bind 
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specifically and tightly to complementary DNA strands and are not degraded by nucleases. In fact, 
PNA binds more strongly to DNA than DNA itself does. This is probably because there is no 
electrostatic repulsion between the two strands, and also the polyamide backbone is more flexible. 
Because of this, PNA/DNA duplexes bind under a wider range of stringency conditions than 
5 DNA/DNA duplexes, making it easier to perform multiplex hybridization. Smaller probes can be 
used than with DNA due to the strong binding. In addition, it is more likely that single base 
mismatches can be determined with PNA/DNA hybridization because a single mismatch in a 
PNA/DNA 15-mer lowers the melting point (T.sub.m) by 8°-20° C, vs. 4°-16° C for the 
DNA/DNA 15-mer duplex. Also, the absence of charge groups in PNA means that hybridization 

10 can be done at low ionic strengths and reduce possible interference by salt during the analysis. 

The compounds of the present invention have uses which include, but are not limited to, 
detecting cancer in mammals. In particular the invention is useful during diagnosis of pathological 
cell proliferative neoplasias which include, but are not limited to: acute myelogenous leukemias 
including acute monocytic leukemia, acute myeloblastic leukemia, acute promyelocytic leukemia, 

15 acute myeiomonocytic leukemia, acute erythroleukemia, acute megakaryocyte leukemia, and 
acute undifferentiated leukemia, etc.; and chronic myelogenous leukemias including chronic 
myeiomonocytic leukemia, chronic granulocytic leukemia, etc. Preferred mammals include 
monkeys, apes, cats, dogs, cows, pigs, horses, rabbits and humans. Particularly preferred are 
humans. 

20 Pathological cell proliferative disorders are often associated with inappropriate activation 

of proto-oncogenes. (Gelmann, E. P. et aL, "The Etiology of Acute Leukemia: Molecular Genetics 
and Viral Oncology," in Neoplastic Diseases of the Blood, Vol L, Wiernik, P. H. et al. eds., 161- 
182 (1985)). Neoplasias are now believed to result from the qualitative alteration of a normal 
cellular gene product, or from the quantitative modification of gene expression by insertion into 

25 the chromosome of a viral sequence, by chromosomal translocation of a gene to a more actively 
transcribed region, or by some other mechanism. (Gelmann et al., supra) It is likely that mutated 
or altered expression of specific genes is involved in the pathogenesis of some leukemias, among 
other tissues and ceil types. (Gelmann et al., supra) Indeed, the human counterparts of the 
oncogenes involved in some animal neoplasias have been amplified or translocated in some cases 

30 of human leukemia and carcinoma. (Gelmann et al., supra) 

For example, c-myc expression is highly amplified in the non-lymphocytic leukemia cell 
line HL-60. When HL-60 cells are chemically induced to stop proliferation, the level of c-myc is 
found to be downregulated. (International Publication Number WO 91/15580). However, it has 
been shown that exposure of HL-60 cells to a DNA construct that is complementary to the 5' end 

35 of c-myc or c-myb blocks translation of the corresponding mRNAs which downregulates 
expression of the c-myc or c-myb proteins and causes arrest of cell proliferation and differentiation 
of the treated cells. (International Publication Number WO 91/15580; Wickstrom et al., Proc. 
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Natl. Acad. Sci. 85:1028 (1988); Anfossi et aL, Proc. Natl. Acad. Sci. 86:3379 (1989)). However, 
the skilled artisan would appreciate the present invention's usefulness is not be limited to 
treatment, prevention, and/or prognosis of proliferative disorders of cells and tissues of 
hematopoietic origin, in light of the numerous cells and cell types of varying origins which are 
5 known to exhibit proliferative phenotypes. 

In addition to the foregoing, a polynucleotide of the present invention can be used to 
control gene expression through triple helix formation or through antisense DNA or RNA. 
Antisense techniques are discussed, for example, in Okano, J. Neurochem. 56: 560 (1991); 
"Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL 

10 (1988). Triple helix formation is discussed in, for instance Lee et al., Nucleic Acids Research 6: 
3073 (1979); Cooney et al., Science 241: 456 (1988); and Dervan et al., Science 251: 1360 (1991). 
Both methods rely on binding of the polynucleotide to a complementary DNA or RNA. For these 
techniques, preferred polynucleotides are usually oligonucleotides 20 to 40 bases in length and 
complementary to either the region of the gene involved in transcription (triple helix - see Lee et 

15 al, Nucl. Acids Res. '6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et al., 
Science 251: 1360 (1991)) or to the mRNA itself (antisense - Okano, J. Neurochem 56:560 (1991); 
Oligodeoxy-nucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL 
(1988)). Triple helix formation optimally results in a shut-off of RNA transcription from DNA, 
while antisense RNA hybridization blocks translation of an mRNA molecule into polypeptide. The 

20 oligonucleotide described above can also be delivered to cells such that the antisense RNA or 
DNA may be expressed in vivo to inhibit production of polypeptide of the present invention 
antigens. Both techniques are effective in model systems, and the information disclosed herein 
can be used to design antisense or triple helix polynucleotides in an effort to treat disease, and in 
particular, for the treatment of proliferative diseases and/or conditions. Non-limiting antisense and 

25 triple helix methods encompassed by the present invention are more thoroughly described 
elsewhere herein (see, e.g., the section labeled "Antisense and Ribozyme (Antagonists)"). 

Polynucleotides of the present invention are also useful in gene therapy. One goal of gene 
therapy is to insert a normal gene into an organism having a defective gene, in an effort to correct 
the genetic defect. The polynucleotides disclosed in the present invention offer a means of 

30 targeting such genetic defects in a highly accurate manner. Another goal is to insert a new gene 
that was not present in the host genome, thereby producing a new trait in the host cell. Additional 
non-limiting examples of gene therapy methods encompassed by the present invention are more 
thoroughly described elsewhere herein (see, e.g., the sections labeled "Gene Therapy Methods", 
and Examples 16, 17 and 18). 

35 The polynucleotides are also useful for identifying individuals from minute biological 

samples. The United States military, for example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its personnel. In this technique, an individual's 
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genomic DNA is digested with one or more restriction enzymes, and probed on a Southern blot to 
yield unique bands for identifying personnel. This method does not suffer from the current 
limitations of "Dog Tags" which can be lost, switched, or stolen, making positive identification 
difficult. The polynucleotides of the present invention can be used as additional DNA markers for 
5 RFLP. 

The polynucleotides of the present invention can also be used as an alternative to RFLP, 
by determining the actual base-by-base DNA sequence of selected portions of an individual's 
genome. These sequences can be used to prepare PCR primers for amplifying and isolating such 
selected DNA, which can then be sequenced. Using this technique, individuals can be identified 

10 because each individual will have a unique set of DNA sequences. Once an unique ID database is 
established for an individual, positive identification of that individual, living or dead, can be made 
from extremely small tissue samples. 

Forensic biology also benefits from using DNA-based identification techniques as 
disclosed herein. DNA sequences taken from very small biological samples such as tissues, e.g., 

15 hair or skin, or body fluids, e.g., blood, saliva, semen, synovial fluid, amniotic fluid, breast milk, 
lymph; pulmonary sputum or surfactant, urine, fecal matter, etc., can be amplified using PCR. Li 
one prior art technique, gene sequences amplified from polymorphic loci, such as DQa class II 
HLA gene, are used in forensic biology to identify individuals. (Erlich, H., PCR Technology, 
Freeman and Co. (1992)). Once these specific polymorphic loci are amplified, they are digested 

20 with one or more restriction enzymes, yielding an identifying set of bands on a Southern blot 
probed with DNA corresponding to the DQa class II HLA gene. Similarly, polynucleotides of the 
present invention can be used as polymorphic markers for forensic purposes. 

There is also a need for reagents capable of identifying the source of a particular tissue. 
Such need arises, for example, in forensics when presented with tissue of unknown origin. 

25 Appropriate reagents can comprise, for example, DNA probes or primers prepared from the 
sequences of the present invention, specific to tissues, including but not limited to those shown in 
Table IB. Panels of such reagents can identify tissue by species and/or by organ type. In a similar 
fashion, these reagents can be used to screen tissue cultures for contamination. Additional non- 
limiting examples of such uses are further described herein. 

30 The polynucleotides of the present invention are also useful as hybridization probes for 

differential identification of the tissues) or cell type(s) present in a biological sample. Similarly, 
polypeptides and antibodies directed to polypeptides of the present invention are useful to provide 
immunological probes for differential identification of the tissue(s) (e.g., irnmunohistochemistry 
assays) or cell type(s) (e.g., irnmunocytochemistry assays). In addition, for a number of disorders 

35 of the above tissues or cells, significantly higher or lower levels of gene expression of the 
polynucleotides/polypeptides of the present invention may be detected in certain tissues (e.g., 
tissues expressing polypeptides and/or polynucleotides of the present invention, for example, those 

592 



» 



WO 02/076488 



PCT/US02/08276 



10 



15 



disclosed in Table IB, and/or cancerous and/or wounded tissues) or bodily fluids (e.g., semen, 
lymph, vaginal pool, serum, plasma, urine, synovial fluid or spinal fluid) taken from an individual 
having such a disorder, relative to a "standard" gene expression level, i.e., the expression level in 
healthy tissue from an individual not having the disorder. 

Thus, the invention provides a diagnostic method of a disorder, which involves: (a) 
assaying gene expression level in cells or body fluid of an individual; (b) comparing the gene 
expression level with a standard gene expression level, whereby an increase or decrease in the 
assayed gene expression level compared to the standard expression level is indicative of a 
disorder. 

In the very least, the polynucleotides of the present invention can be used as molecular 
weight markers on Southern gels, as diagnostic probes for the presence of a specific mRNA in a 
particular cell type, as a probe to "subtract-out" known sequences in the process of discovering 
novel polynucleotides, for selecting and making oligomers for attachment to a "gene chip" or other 
support, to raise anti-DNA antibodies using DNA immunization techniques, and as an antigen to 
elicit an immune response. 



Uses of the Polypeptides 

Each of the polypeptides identified herein can be used in numerous ways. The following 
description should be considered exemplary and utilizes known techniques. 

20 Polypeptides and antibodies directed to polypeptides of the present invention are useful to 

provide immunological probes for differential identification of the tissue(s) (e.g., 
immunohistochemistry assays such as, for example, ABC irnmunoperoxidase (Hsu et al., J. 
Histochem. Cytochem. 29:577-580 (1981)) or cell type(s) (e.g., immunocytochemistry assays). 

Antibodies can be used to assay levels of polypeptides encoded by polynucleotides of the 

25 invention in a biological sample using classical immunohistological methods known to those of 
skill in the art (e.g., see Jalkanen, et al., J. Cell. BioL 101:976-985 (1985); Jalkanen, et al., J. Cell. 
Biol. 105:3087-3096 (1987)). Other antibody-based methods useful for detecting protein gene 
expression include immunoassays, such as the enzyme linked immunosorbent assay (ELISA) and 
the radioimmunoassay (RIA). Suitable antibody assay labels are known in the art and include 

30 enzyme labels, such as, glucose oxidase; radioisotopes, such as iodine ( 13I I, 1, 123 I, 121 I), carbon 
( 14 C), sulfur ( 35 S), tritium ( 3 H), indium ( n5m w, n3m In, 112 In, m In), and technetium ("Tc, ""Tc), 
thallium (™Ti), gallium (^Ga, 67 Ga), palladium ( 103 Pd), molybdenum ("Mo), xenon ( 133 Xe), 
fluorine ( )8 F), ,53 Sm, ,77 Lu, ,59 Gd, 149 Pm, l40 La, ,7S Yb, 146 Ho, S0 Y, 47 Sc, 186 Re, 188 Re, 142 Pr, 105 Rh, 
"Ru; luminescent labels, such as luminol; and fluorescent labels, such as fluorescein and 

35 rhodamine, and biotin. 

In addition to assaying levels of polypeptide of the present invention in a biological 
sample, proteins can also be detected in vivo by imaging. Antibody labels or markers for in vivo 
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imaging of protein include those detectable by X-radiography, NMR or ESR. For X-radiography, 
suitable labels include radioisotopes such as barium or cesium, which emit detectable radiation but 
are not overtly harmful to the subject. Suitable markers for NMR and ESR include those with a 
detectable characteristic spin, such as deuterium, which may be incorporated into the antibody by 
5 labeling of nutrients for the relevant hybridoma. 

A protein-specific antibody or antibody fragment which has been labeled with an 
appropriate detectable imaging moiety, such as a radioisotope (for example, 131 I, 112 In, ""TTc, ( 131 I, 
^ 123 I, 121 I), carbon ( 14 Q, sulfur ( 35 S), tritium fK), indium ( 115m In, 113m In, n V m In), and 
technetium ("Tc, ""Tc), thalhum (^Ti), gallium (^Ga, 67 Ga), palladium ( 103 Pd), molybdenum 

10 ("Mo), xenon ( 133 Xe), fluorine ( l8 F, 153 Sm, 177 Lu, 159 Gd, 149 Pm, 140 La, l75 Yb, 166 Ho, 47 Sc, 186 Re, 
l88 Re } 142 Pr, 105 Rh, ^Ru), a radio-opaque substance, or a material detectable by nuclear magnetic 
resonance, is introduced (for example, parenterally, subcutaneously or intraperitoneally) into the 
mammal to be examined for immune system disorder. It will be understood in the art that the size 
of the subject and the imaging system used will determine the quantity of imaging moiety needed 

15 to produce diagnostic images. In the case of a radioisotope moiety, for a human subject, the 
quantity of radioactivity injected will normally range from about 5 to 20 millicuries of ""Tc. The 
labeled antibody or antibody fragment will then preferentially accumulate at the location of cells 
which express the polypeptide encoded by a polynucleotide of the invention. In vivo tumor 
imaging is described in S.W. Burchiel et al., "Immunopharmacokinetics of Radiolabeled 

20 Antibodies and Their Fragments" (Chapter 13 in Tumor Imaging: The Radiochemical Detection of 
Cancer, S.W. Burchiel and B. A. Rhodes, eds., Masson Publishing Inc. (1982)). 

In one embodiment, the invention provides a method for the specific delivery of 
compositions of the invention to cells by administering polypeptides of the invention (e.g., 
polypeptides encoded by polynucleotides of the invention and/or antibodies) that are associated 

25 with heterologous polypeptides or nucleic acids. In one example, the invention provides a method 
for delivering a therapeutic protein into the targeted cell. In another example, the invention 
provides a method for delivering a single stranded nucleic acid (e.g., antisense or ribozymes) or 
double stranded nucleic acid (e.g., DNA that can integrate into the cell's genome or replicate 
episomally and that can be transcribed) into the targeted cell. 

30 In another embodiment, the invention provides a method for the specific destruction of 

cells (e.g., the destruction of tumor cells) by administering polypeptides of the invention in 
association with toxins or cytotoxic prodrugs. 

By "toxin" is meant one or more compounds that bind and activate endogenous cytotoxic 
effector systems, radioisotopes, holotoxins, modified toxins, catalytic subunits of toxins, or any 

35 molecules or enzymes not normally present in or on the surface of a cell that under defined 
conditions cause the cell's death. Toxins that may be used according to the methods of the 
invention include, but are not limited to, radioisotopes known in the art, compounds such as, for 
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example, antibodies (or complement fixing containing portions thereof) that bind an inherent or 
induced endogenous cytotoxic effector system, thymidine kinase, endonuclease, RNAse, alpha 
toxin, ricin, abrin, Pseudomonas exotoxin A, diphtheria toxin, saporin, momordin, gelonin, 
pokeweed antiviral protein, alpha-sarcin and cholera toxin. 'Toxin" also includes a cytostatic or 
5 cytocidal agent, a therapeutic agent or a radioactive metal ion, e.g., alpha-emitters such as, for 
example, 213 Bi, or other radioisotopes such as, for example, I03 Pd, 333 Xe, 131 I, ^Ge, ^Co, ^Zn, 85 Sr, 
32 P, 35 S, 90 Y, 1S3 Sm, 153 Gd, 169 Yb, 51 Cr, *Mn, 75 Se, U3 Sn, "Yttrium. U7 Tin, 186 Rhenium, 
I66 Holmium, and 188 Rhenium; luminescent labels, such as luminol; and fluorescent labels, such as 
fluorescein and rhodarnine, and biotin. In a specific embodiment, the invention provides a method 

10 for the specific destruction of cells (e.g., the destruction of tumor cells) by adrninistering 
polypeptides of the invention or antibodies of the invention in association with the radioisotope 
90 Y. In another specific embodiment, the invention provides a method for the specific destruction 
of cells (e.g., the destruction of tumor cells) by administering polypeptides of the invention or 
antibodies of the invention in association with the radioisotope lu In. In a further specific 

15 embodiment, the invention provides a method for the specific destruction of cells (e.g., the 
destruction of tumor cells) by administering polypeptides of the invention or antibodies of the 
invention in association with the radioisotope 131 L 

Techniques known in the art may be applied to label polypeptides of the invention 
(including antibodies). Such techniques include, but are not limited to, the use of bifunctional 

20 conjugating agents (see e.g., U.S. Patent Nos. 5,756,065; 5,714,631; 5,696,239; 5,652,361; 
5,505,931; 5,489,425; 5,435,990; 5,428,139; 5,342,604; 5,274,119; 4,994,560; and 5,808,003; the 
contents of each of which are hereby incorporated by reference in its entirety). 

Thus, the invention provides a diagnostic method of a disorder, which involves (a) 
assaying the expression level of a polypeptide of the present invention in cells or body fluid of an 

25 individual; and (b) comparing the assayed polypeptide expression level with a standard 
polypeptide expression level, whereby an increase or decrease in the assayed polypeptide 
expression level compared to the standard expression level is indicative of a disorder. With respect 
to cancer, the presence of a relatively high amount of transcript in biopsied tissue from an 
individual may indicate a predisposition for the development of the disease, or may provide a 

30 means for detecting the disease prior to the appearance of actual clinical symptoms. A more 
definitive diagnosis of this type may allow health professionals to employ preventative measures 
or aggressive treatment earlier thereby preventing the development or further progression of the 
cancer. 

Moreover, polypeptides of the present invention can be used to treat or prevent diseases or 
35 conditions such as, for example, neural disorders, immune system disorders, muscular disorders, 
reproductive disorders, gastrointestinal disorders, pulmonary disorders, cardiovascular disorders, 
renal disorders, proliferative disorders, and/or cancerous diseases and conditions. For example, 
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patients can be administered a polypeptide of the present invention in an effort to replace absent 
or decreased levels of the polypeptide (e.g., insulin), to supplement absent or decreased levels of a 
different polypeptide (e.g., hemoglobin S for hemoglobin B, SOD, catalase, DNA repair proteins), 
to inhibit the activity of a polypeptide (e.g., an oncogene or tumor supressor), to activate the 
5 activity of a polypeptide (e.g., by binding to a receptor), to reduce the activity of a membrane 
bound receptor by competing with it for free ligand (e.g., soluble TNF receptors used in reducing 
inflammation), or to bring about a desired response (e.g., blood vessel growth inhibition, 
enhancement of the immune response to proliferative cells or tissues). 

Similarly, antibodies directed to a polypeptide of the present invention can also be used to 
10 treat disease (as described supra, and elsewhere herein). For example, administration of an 
antibody directed to a polypeptide of the present invention can bind, and/or neutralize the 
polypeptide, and/or reduce overproduction of the polypeptide. Similarly, administration of an 
antibody can activate the polypeptide, such as by binding to a polypeptide bound to a membrane 
(receptor). 

15 At the very least, the polypeptides of the present invention can be used as molecular 

weight markers on SDS-PAGE gels or on molecular sieve gel filtration columns using methods 
well known to those of skill in the art Polypeptides can also be used to raise antibodies, which in 
turn are used to measure protein expression from a recombinant cell, as a way of assessing 
transformation of the host cell. Moreover, the polypeptides of the present invention can be used to 

20 test the biological activities described herein. 



Diagnostic Assays 

The compounds of the present invention are useful for diagnosis, treatment, prevention 
and/or prognosis of various disorders in mammals, preferably humans. Such disorders include, but 

25 are not limited to, those related to biological activities described in Table ID and, also as described 
herein under the section heading "Biological Activities". 

For a number of disorders, substantially altered (increased or decreased) levels of gene 
expression can be detected in tissues, cells or bodily fluids (e.g., sera, plasma, urine, semen, 
synovial fluid or spinal fluid) taken from an individual having such a disorder, relative to a 

30 "standard" gene expression level, that is, the expression level in tissues or bodily fluids from an 
individual not having the disorder. Thus, the invention provides a diagnostic method useful during 
diagnosis of a disorder, which involves measuring the expression level of the gene encoding the 
polypeptide in tissues, cells or body fluid from an individual and comparing the measured gene 
expression level with a standard gene expression level, whereby an increase or decrease in the 

35 gene expression level(s) compared to the standard is indicative of a disorder. These diagnostic 
assays may be performed in vivo or in vitro, such as, for example, on blood samples, biopsy tissue 
or autopsy tissue. 
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The present invention is also useful as a prognostic indicator, whereby patients exhibiting 
enhanced or depressed gene expression will experience a worse clinical outcome relative to 
patients expressing the gene at a level nearer the standard level. 

In certain embodiments, a polypeptide of the invention, or polynucleotides, antibodies, 
* 5 agonists, or antagonists corresponding to that polypeptide, may be used to diagnose and/or 
prognosticate diseases and/or disorders associated with the tissue(s) in which the polypeptide of 
the invention is expressed, including one, two, three, four, five, or more tissues disclosed in Table 
IB .2, column 5 (Tissue Distribution Library Code). 

By "assaying the expression level of the gene encoding the polypeptide" is intended 

10 qualitatively or quantitatively measuring or estimating the level of the polypeptide of the invention 
or the level of the niRNA encoding the polypeptide of the invention in a first biological sample 
either directly (e.g., by determining or estimating absolute protein level or mRNA level) ot 
relatively (e.g., by comparing to the polypeptide level or mRNA level in a second biological 
sample). Preferably, the polypeptide expression level or mRNA level in the first biological sample 

15 is measured or estimated and compared to a standard polypeptide level or mRNA level, the 
standard being taken from a second biological sample obtained from an individual not having the 
disorder or being determined by averaging levels from a population of individuals not having the 
disorder. As will be appreciated in the art, once a standard polypeptide level or mRNA level is 
known, it can be used repeatedly as a standard for comparison. 

20 By "biological sample" is intended any biological sample obtained from an individual, cell 

line, tissue culture, or other source containing polypeptides of the invention (including portions 
thereof) or mRNA. As indicated, biological samples include body fluids (such as sera, plasma, 
urine, synovial fluid and spinal fluid) and tissue sources found to express the full length or 
fragments thereof of a polypeptide or mRNA. Methods for obtaining tissue biopsies and body 

25 fluids from mammals are well known in the art. Where the biological sample is to include mRNA, 
a tissue biopsy is the preferred source. 

Total cellular RNA can be isolated from a biological sample using any suitable technique 
such as the single-step guanidinium-thiocyanate-phenol-chloroform method described in 
Chomczynski and Sacchi, Anal. Biochem. 162:156-159 (1987). Levels of mRNA encoding the 

30 polypeptides of the invention are then assayed using any appropriate method. These include 
Northern blot analysis, SI nuclease mapping, the polymerase chain reaction (PCR), reverse 
transcription in combination with the polymerase chain reaction (RT-PCR), and reverse 
transcription in combination with the ligase chain reaction (RT-LCR). 

The present invention also relates to diagnostic assays such as quantitative and diagnostic 

35 assays for detecting levels of polypeptides of the invention, in a biological sample (e.g., cells and 
tissues), including determination of normal and abnormal levels of polypeptides. Thus, for 
instance, a diagnostic assay in accordance with the invention for detecting over-expression of 
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polypeptides of the invention compared to normal control tissue samples may be used to detect the 
presence of tumors. Assay techniques that can be used to determine levels of a polypeptide, such 
as a polypeptide of the present invention in a sample derived from a host are well-known to those 
of skill in the art. Such assay methods include radioimmunoassays, competitive-binding assays, 
5 Western Blot analysis and ELISA assays. Assaying polypeptide levels in a biological sample can 
occur using any art-known method. 

Assaying polypeptide levels in a biological sample can occur using antibody-based 
techniques. For example, polypeptide expression in tissues can be studied with classical 
immunohistological methods (Jalkanen et al., J. Cell. Biol. 101:976-985 (1985); Jalkanen, M., et 

10 al., J. Cell . Biol. 105:3087-3096 (1987)). Other antibody-based methods useful for detecting 
polypeptide gene expression include immunoassays, such as the enzyme linked immunosorbent 
assay (ELISA) and the radioimmunoassay (RIA). Suitable antibody assay labels are known in the 
art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such as iodine ( ,25 I, 
121 I), carbon ( l4 C), sulfur ( 35 S), tritium ( 3 H), indium ( m In), and technetium (""Tc), and fluorescent 

15 labels, such as fluorescein and rhodamine, and biotin. 

The tissue or cell type to be analyzed will generally include those which are known, or 
suspected, to express the gene of inteest (such as, for example, cancer). The protein isolation 
methods employed herein may, for example, be such as those described in Harlow and Lane 

■ 

(Harlow, E. and Lane, D., 1988, "Antibodies: A Laboratory Manual", Cold Spring Harbor 
20 Laboratory Press, Cold Spring Harbor, New York), which is incorporated herein by reference in its 
entirety. The isolated cells can be derived from cell culture ox from a patient. The analysis of 
cells taken from culture may be a necessary step in the assessment of cells that could be used as 
part of a cell-based gene therapy technique or, alternatively, to test the effect of compounds on the 
expression of the gene. 

25 For example, antibodies, or fragments of antibodies, such as those described herein, may 

be used to quantitatively or qualitatively detect the presence of gene products or conserved 
variants or peptide fragments thereof. This can be accomplished, for example, by 
immunofluorescence techniques employing a fluorescendy labeled antibody coupled with light 
microscopic, flow cytometric, or fluorimetric detection. 

30 In a preferred embodiment, antibodies, or fragments of antibodies directed to any one or 

all of the predicted epitope domains of the polypeptides of the invention (shown in column 7 of 
Table 1B.1) may be used to quantitatively or qualitatively detect the presence of gene products or 
conserved variants or peptide fragments thereof. This can be accomplished, for example, by 
immunofluorescence techniques employing a fluorescently labeled antibody coupled with light 

35 microscopic, flow cytometric, or fluorimetric detection. 

In an additional preferred embodiment, antibodies, or fragments of antibodies directed to a 
conformational epitope of a polypeptide of the invention may be used to quantitatively or 
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qualitatively detect the presence of gene products or conserved variants or peptide fragments 
thereof. This can be accomplished, for example, by immunofluorescence techniques employing a 
fluGrescently labeled antibody coupled with light microscopic, flow cytometric, or fluorimetric 
detection. 

5 The antibodies (or fragments thereof), and/or polypeptides of the present invention may, 

additionally, be employed histologically, as in immunofluorescence, irnmunoelectron microscopy 
or non-immunological assays, for in situ detection of gene products or conserved variants or 
peptide fragments thereof. In situ detection may be accomplished by removing a histological 
specimen from a patient, and applying thereto a labeled antibody or polypeptide of the present 

10 invention. The antibody (or fragment thereof) or polypeptide is preferably applied by overlaying 
the labeled antibody (or fragment) onto a biological sample. Through the use of such a procedure, 
it is possible to determine not only the presence of the gene product, or conserved variants or 
peptide fragments, or polypeptide binding, but also its distribution in the examined tissue. Using 
the present invention, those of ordinary skill will readily perceive that any of a wide variety of 

15 histological methods (such as staining procedures) can be modified in order to achieve such in situ 
detection. 

Immunoassays and non-immunoassays for gene products or conserved variants or peptide 
fragments thereof will typically comprise incubating a sample, such as a biological fluid, a tissue 
extract, freshly harvested cells, or lysates of cells which have been incubated in cell culture, in the 
20 presence of a detectably labeled antibody capable of binding gene products or conserved variants 
or peptide fragments thereof, and detecting the bound antibody by any of a number of techniques 
well-known in the art. 

The biological sample may be brought in contact with and immobilized onto a solid phase 
support or carrier such as nitrocellulose, or other solid support which is capable of immobilizing 

25 cells, cell particles or soluble proteins. The support may then be washed with suitable buffers 
followed by treatment with the detectably labeled antibody or detectable polypeptide of the 
invention. The solid phase support may then be washed with the buffer a second time to remove 
unbound antibody or polypeptide. Optionally the antibody is subsequently labeled. The amount 
of bound label on solid support may then be detected by conventional means. 

30 By "solid phase support or carrier ,, is intended any support capable of binding an antigen 

or an antibody. Well-known supports or carriers include glass, polystyrene, polypropylene, 
polyethylene, dextran, nylon, amylases, natural and modified celluloses, polyacrylamides, gabbros, 
and magnetite. The nature of the carrier can be either soluble to some extent or insoluble for the 
purposes of the present invention. The support material may have virtually any possible structural 

35 configuration so long as the coupled molecule is capable of binding to an antigen or antibody. 
Thus, the support configuration may be spherical, as in a bead, or cylindrical, as in the inside 
surface of a test tube, or the external surface of a rod. Alternatively, the surface may be flat such 
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as a sheet, test strip, etc. Preferred supports include polystyrene beads. Those skilled in the art 
will know many other suitable carriers for binding antibody or antigen, or will be able to ascertain 
the same by use of routine experimentation. 

The binding activity of a given lot of antibody or antigen polypeptide may be detennined 
5 according to well known methods. Those skilled in the art will be able to determine operative and 
optimal assay conditions for each deterrnination by employing routine experimentation. 

In addition to assaying polypeptide levels or polynucleotide levels in a biological sample 
obtained from an individual, polypeptide or polynucleotide can also be detected in vivo by 
imaging. For example, in one embodiment of the invention, polypeptides and/or antibodies of the 

10 invention are used to image diseased cells, such as neoplasms. In another embodiment, 
polynucleotides of the invention (e.g., polynucleotides complementary to all or a portion of an 
mRNA) and/or antibodies (e.g., antibodies directed to any one or a combination of the epitopes of 
a polypeptide of the invention, antibodies directed to a conformational epitope of a polypeptide of 
the invention, or antibodies directed to the full length polypeptide expressed on the cell surface of 

15 a mammalian cell) are used to image diseased or neoplastic cells. 

Antibody labels or markers for in vivo imaging of polypeptides of the invention include 
those detectable by X-radiography, NMR, MRI, CAT-scans or ESR. For X-radiography, suitable 
labels include radioisotopes such as barium or cesium, which emit detectable radiation but are not 
overtly harmful to the subject. Suitable markers for NMR and ESR include those with a detectable 

20 characteristic spin, such as deuterium, which may be incorporated into the antibody by labeling of 
nutrients for the relevant hybridoma. Where in vivo imaging is used to detect enhanced levels of 
polypeptides for diagnosis in humans, it may be preferable to use human antibodies or 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced using techniques 
described herein or otherwise known in the art. For example methods for producing chimeric 

25 antibodies are known in the art See, for review, Morrison, Science 229:1202 (1985); Oi et al., 
BioTechniques 4:214 (1986); Cabilly et al., U.S. Patent No. 4,816,567; Taniguchi et al., EP 
171496; Morrison et al., EP 173494; Neuberger et al., WO 8601533; Robinson et al, WO 
8702671; Boulianne et al, Nature 312:643 (1984); Neuberger et al, Nature 314:26% (1985). 

Additionally, any polypeptides of the invention whose presence can be detected, can be 

30 administered. For example, polypeptides of the invention labeled with a radio-opaque or other 
appropriate compound can be administered and visualized in vivo, as discussed, above for labeled 
antibodies. Further, such polypeptides can be utilized for in vitro diagnostic procedures. 

A polypeptide-specific antibody or antibody fragment which has been labeled with an 
appropriate detectable imaging moiety, such as a radioisotope (for example, l31 I, n2 In, ""Tc), a 

35 radio-opaque substance, or a material detectable by nuclear magnetic resonance, is introduced (for 
example, parenterally, subcutaneously or intraperitoneally) into the mammal to be examined for a 
disorder. It will be understood in the art that the size of the subject and the imaging system used 
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will determine the quantity of imaging moiety needed to produce diagnostic images. In the case of 
a radioisotope moiety for a human subject, the quantity of radioactivity injected will normally 
range from about 5 to 20 millicuries of Wm Tc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the location of cells which contain the antigenic protein. In vivo 
5 tumor imaging is described in S.W. Burchiel et ah, 'Immunopharrnacokinetics of Radiolabeled 
Antibodies and Their Fragments" (Chapter 13 in Tumor Imaging: The Radiochemical Detection of 
Cancer, S.W. Burchiel and B. A. Rhodes, eds., Masson Publishing Inc. (1982)). 

With respect to antibodies, one of the ways in which an antibody of the present invention 
can be detectably labeled is by linking the same to a reporter enzyme and using the linked product 

10 in an enzyme immunoassay (EIA) (Voller, A., "The Enzyme Linked Immunosorbent Assay 
(ELISA)", 1978, Diagnostic Horizons 2:1-7, Microbiological Associates Quarterly Publication, 
Walkersville, MD); Voller et al., J. Clin. Pathol 31:507-520 (1978); Butler, J.E., Meth. EnzymoL 
73:482-523 (1981); Maggio, E. (ed.), 1980, Enzyme Immunoassay, CRC Press, Boca Raton, FL,; 
Ishikawa, E. et al, (eds.), 1981, Enzyme Immunoassay, Kgaku Shoin, Tokyo). The reporter 

15 enzyme which is bound to the antibody will react with an appropriate substrate, preferably a 
chromogenic substrate, in such a manner as to produce a chemical moiety which can be detected, 
for example, by spectrophotometric, fluorimetric or by visual means. Reporter enzymes which can 
be used to detectably label the antibody include, but are not limited to, malate dehydrogenase, 
staphylococcal nuclease, deIta-5-steroid isomerase, yeast alcohol dehydrogenase, alpha- 

20 glycerophosphate, dehydrogenase, triose phosphate isomerase, horseradish peroxidase, alkaline 
phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, 
glucose-6-phosphate dehydrogenase, glucoamylase and acetylcholinesterase. Additionally, the 
detection can be accomplished by colorirnetric methods which employ a chromogenic substrate for 
the reporter enzyme. Detection may also be accomplished by visual comparison of the extent of 

25 enzymatic reaction of a substrate in comparison with similarly prepared standards. 

Detection may also be accomplished using any of a variety of other immunoassays. For 
example, by radioactively labeling the antibodies or antibody fragments, it is possible to detect 
polypeptides through the use of a radioimmunoassay (RIA) (see, for example, Weintraub, B., 
Principles of Radioimmunoassays, Seventh Training Course on Radioligand Assay Techniques, 

30 The Endocrine Society, March, 1986, which is incorporated by reference herein). The radioactive 
isotope can be detected by means including, but not limited to, a gamma counter, a scintillation 
counter, or autoradiography. 

It is also possible to label the antibody with a fluorescent compound When the 
fluorescently labeled antibody is exposed to light of the proper wave length, its presence can then 

35 be detected due to fluorescence. Among the most commonly used fluorescent labeling compounds 
are fluorescein isothiocyanate, rhodarnine, phycoerythrin, phycocyanin, allophycocyanin, 
ophthaldehyde and fluoresc^mine. 
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The antibody can also be detectably labeled using fluorescence emitting metals such as 
152 Eu, or others of the lanthanide series. These metals can be attached to the antibody using such 
metal chelating groups as diethylenetriaminepentacetic acid (DTPA) or ethylenedianiinetetraacetic 
acid (EDTA). 

5 The antibody also can be detectably labeled by coupling it to a chemiluminescent 

compound. The presence of the chemUuminescent-tagged antibody is then determined by 
detecting the presence of luminescence that arises during the course of a chemical reaction. 
Examples of particularly useful chemiluminescent labeling compounds are luminol, isolurninol, 
theromatic acridinium ester, imidazole, acridinium salt and oxalate ester. 
10 Likewise, a bioluminescent compound may be used to label the antibody of the present 

invention. Bioluminescence is a type of chemiluminescence found in biological systems in, which 
a catalytic protein increases the efficiency of the chemiluminescent reaction. The presence of a 
bioluminescent protein is determined by detecting the presence of luminescence. Important 
bioluminescent compounds for purposes of labeling are luciferin, luciferase and aequorin. 

15 

Methods for Detecting Diseases 

In general, a disease may be detected in a patient based on the presence of one or more 
proteins of the invention and/or polynucleotides encoding such proteins in a biological sample (for 
example, blood, sera, urine, and/or tumor biopsies) obtained from the patient. In other words, such 

20 proteins may be used as markers to indicate the presence or absence of a disease or disorder, 
including cancer and/or as described elsewhere herein. In addition, such proteins may be useful for 
the detection of other diseases and cancers. The binding agents provided herein generally permit 
detection of the level of antigen that binds to the agent in the biological sample. Polynucleotide 
primers and probes may be used to detect the level of mRNA encoding polypeptides of the 

25 invention, which is also indicative of the presence or absence of a disease or disorder, including 
cancer. In general, polypeptides of the invention should be present at a level that is at least three 
fold higher in diseased tissue than in normal tissue. 

There are a variety of assay formats known to those of ordinary skill in the art for using a 
binding agent to detect polypeptide markers in a sample. See, e.g., Harlow and Lane, supra. In 

30 general, the presence or absence of a disease in a patient may be determined by (a) contacting a 
biological sample obtained from a patient with a binding agent; (b) detecting in the sample a level 
of polypeptide that binds to the binding agent; and (c) comparing the level of polypeptide with a 
predetermined cut-off value. 

In a preferred embodiment, the assay involves the use of a binding agent(s) immobilized 

35 on a solid support to bind to and remove the polypeptide of the invention from the remainder of 
the sample. The bound polypeptide may then be detected using a detection reagent that contains a 
reporter group and specifically binds to the binding agent/polypeptide complex. Such detection 
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reagents may comprise, for example, a binding agent that specifically binds to the polypeptide or 
an antibody or other agent that specifically binds to the binding agent, such as an anti- 
immunoglobulin, protein G, protein A or a lectin. Alternatively, a competitive assay may be 
utilized, in which a polypeptide is labeled with a reporter group and allowed to bind to the 
5 immobilized binding agent after incubation of the binding agent with the sample. The extent to 
which components of the sample inhibit the binding of the labeled polypeptide to the binding 
agent is indicative of the reactivity of the sample with the immobilized binding agent Suitable 
polypeptides for use within such assays include polypeptides of the invention and portions thereof, 
or antibodies, to which the binding agent binds, as described above. 

10 The solid support may be any material known to those of skill in the art to which 

polypeptides of the invention may be attached. For example, the solid support may be a test well in 
a micTotiteT plate or a nitrocellulose or other suitable membrane. Alternatively, the support may be 
a bead or disc, such as glass fiberglass, latex or a plastic material such as polystyrene or 
polyvinylchloride. The support may also be a magnetic particle or a fiber optic sensor, such as 

15 those disclosed, for example, in U.S. Patent No. 5,359,681. The binding agent may be immobilized 
on the solid support using a variety of techniques known to those of skill in the art, which are 
amply described in the patent and scientific literature. In the context of the present invention, the 
term "immobilization" refers to both noncovalent association, such as adsorption, and covalent 
attachment (which may be a direct linkage between the agent and functional groups on the support 

20 or may be a linkage by way of a cross-linking agent). Immobilization by adsorption to a well in a 
microtiter plate or to a membrane is preferred. In such cases, adsorption may be achieved by 
contacting the binding agent, in a suitable buffer, with the solid support for the suitable amount of 
time. The contact time varies with temperature, but is typically between about 1 hour and about 1 
day. In general, contacting a well of plastic microtiter plate (such as polystyrene or 

25 polyvinylchloride) with an amount of binding agent ranging from about 10 ng to about 10 ug, and 
preferably about 100 ng to about 1 ug, is sufficient to immobilize an adequate amount of binding 
agent. 

Covalent attachment of binding agent to a solid support may generally be achieved by first 
reacting the support with a bifunctional reagent that will react with both the support and a 
30 functional group, such as a hydroxyl or amino group, on the binding agent. For example, the 
binding agent may be covalently attached to supports having an appropriate polymer coating using 
benzoquinone or by condensation of an aldehyde group on the support with an amine and an active 
hydrogen on the binding partner (see, e.g., Pierce Immunotechnology Catalog and Handbook, 
1991,atA12-A13). 

35 
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Gene Therapy Methods 

Also encompassed by the invention are gene therapy methods for treating or preventing 
disorders, diseases and conditions. The gene therapy methods relate to the introduction of nucleic 
acid (DNA, RNA and antisense DNA or RNA) sequences into an animal to achieve expression of 
5 the polypeptide of the present invention. This method requires a polynucleotide which codes for a 
polypeptide of the present invention operatively linked to a promoter and any other genetic 
elements necessary for the expression of the polypeptide by the target tissue. Such gene therapy 
and delivery techniques are known in the art, see, for example, WO90/1 1092, which is herein 
incorporated by reference. 

10 Thus, for example, cells from a patient may be engineered with a polynucleotide (DNA or 

RNA) comprising a promoter operably linked to a polynucleotide of the present invention ex vivo, 
with the engineered cells then being provided to a patient to be treated with the polypeptide of the 
present invention. Such methods are well-known in the art For example, gee Beildegrun, A, et 
at., J. Natl. Cancer Inst. 85: 207-216 (1993); Ferrantini, M. et al., Cancer Research 53: 1107-1112 

15 (1993); Ferrantini, M. et al., J. Immunology 153: 4604-4615 (1994); Kaido, T., et al, Int. J. Cancer 
60: 221-229 (1995); Ogura, H., et al., Cancer Research 50: 5102-5106 (1990); Santodonato, L., et 
al., Human Gene Therapy 7:1-10 (1996); Santodonato, L., et al., Gene Therapy 4:1246-1255 
(1997); and Zhang, J.-F. et al., Cancer Gene Therapy 3: 31-38 (1996)), which are herein 
incorporated by reference. In one embodiment, the cells which are engineered are arterial cells. 

20 The arterial cells may be reintroduced into the patient through direct injection to the artery, the 
tissues surrounding the artery, or through catheter injection. 

As discussed in more detail below, the polynucleotide constructs can be delivered by any 
method that delivers injectable materials to the cells of an animal, such as, injection into the 
interstitial space of tissues (heart, muscle, skin, lung, liver, and the like). The polynucleotide 

25 constructs may be delivered in a pharmaceutically acceptable liquid or aqueous carrier. 

In one embodiment, the polynucleotide of the present invention is delivered as a naked 
polynucleotide. The term "naked" polynucleotide, DNA or RNA refers to sequences that are free 
from any delivery vehicle that acts to assist, promote or facilitate entry into the cell, including viral 
sequences, viral particles, liposome formulations, lipofectin or precipitating agents and the like. 

30 However, the polynucleotide of the present invention can also be delivered in liposome 
formulations and lipofectin formulations and the like can be prepared by methods well known to 
those skilled in the art Such methods are described, for example, in U.S. Patent Nos. 5,593,972, 
5,589,466, and 5,580,859, which are herein incorporated by reference. 

The polynucleotide vector constructs used in the gene therapy method are preferably 

35 constructs that will not integrate into the host genome nor will they contain sequences that allow 
for replication. Appropriate vectors include pWLNEO, pSV2CAT, pOG44, pXTl and pSG 
available from Stratagene; pSVK3, pBPV, pMSG and pSVL available from Pharmacia; and 
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pEFl/V5, pcDNA3.1, and pRc/CMV2 available from Invitrogen. Other suitable vectors will be 
readily apparent to the skilled artisan. 

Any strong promoter known to those skilled in the art can be used for driving the 
expression of the polynucleotide sequence. Suitable promoters include adenoviral promoters, such 
5 as the adenoviral major late promoter; or heterologous promoters, such as the cytomegalovirus 
(CMV) promoter; the respiratory syncytial virus (RSV) promoter, inducible promoters, such as the 
MMT promoter, the metallothionein promoter, heat shock promoters; the albumin promoter; the 
ApoAI promoter, human globin promoters; viral thymidine kinase promoters, such as the Herpes 
Simplex thymidine kinase promoter; retroviral LTRs; the b-actin promoter; and human growth 
10 hormone promoters. The promoter also may be the native promoter for the polynucleotide of the 
present invention. 

Unlike other gene therapy techniques, one major advantage of introducing naked nucleic 
acid sequences into target cells is the transitory nature of the polynucleotide synthesis in the cells. 
Studies have shown that non-replicating DNA sequences can be introduced into cells to provide 

15 production of the desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of tissues within the 
an animal, including of muscle, skin, brain, lung, liver, spleen, bone marrow, thymus, heart, lymph, 
blood, bone, cartilage, pancreas, kidney, gall bladder, stomach, intestine, testis, ovary, uterus, 
rectum, nervous system, eye, gland, and connective tissue. Interstitial space of the tissues comprises 

20 the intercellular, fluid, mucopolysaccharide matrix among the reticular fibers of organ tissues, 
elastic fibers in the walls of vessels or chambers, collagen fibers of fibrous tissues, or that same 
matrix within connective tissue ensheathing muscle cells or in the lacunae of bone. It is similarly 
the space occupied by the plasma of the circulation and the lymph fluid of the lymphatic channels. 
Delivery to the interstitial space of muscle tissue is preferred for the reasons discussed below. They 

25 may be conveniently delivered by injection into the tissues comprising these cells. They are 
preferably delivered to and expressed in persistent, non-dividing cells which are differentiated, 
although delivery and expression may be achieved in non-differentiated or less completely 
differentiated cells, such as, for example, stem cells of blood or skin fibroblasts. In vivo muscle 
cells are particularly competent in their ability to take up and express polynucleotides. 

30 For the naked nucleic acid sequence injection, an effective dosage amount of DNA or RNA 

will be in the range of from about 0.05 mg/kg body weight to about 50 mg/kg body weight. 
Preferably the dosage will be from about 0.005 mg/kg to about 20 mg/kg and more preferably from 
about 0.05 mg/kg to about 5 mg/kg. Of course, as the artisan of ordinary skill will appreciate, this 
dosage will vary according to the tissue site of injection. The appropriate and effective dosage of 

35 nucleic acid sequence can readily be determined by those of ordinary skill in the art and may 
depend on the condition being treated and the route of administration. 
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The preferred route of administration is by the parenteral route of injection into the 
interstitial space of tissues. However, other parenteral routes may also be used, such as, inhalation 
of an aerosol formulation particularly for delivery to lungs or bronchial tissues, throat or mucous 
membranes of the nose. In addition, naked DNA constructs can be delivered to arteries during 
5 angioplasty by the catheter used in the procedure. 

The naked polynucleotides are delivered by any method known in the art, including, but 
not limited to, direct needle injection at the delivery site, intravenous injection, topical 
administration, catheter infusion, and so-called "gene guns". These delivery methods are known in 
the art. 

10 The constructs may also be delivered with delivery vehicles such as viral sequences, viral 

particles, liposome formulations, lipofectin, precipitating agents, etc. Such methods of delivery are 
known in the art. 

In certain embodiments, the polynucleotide constructs are complexed in a liposome 
preparation. Liposomal preparations for use in the instant invention include cationic (positively 

15 charged), anionic (negatively charged) and neutral preparations. However, cationic liposomes are 
particularly preferred because a tight charge complex can be formed between the cationic 
liposome and the polyanionic nucleic acid. Cationic liposomes have been shown to mediate 
intracellular delivery of plasmid DNA (Feigner et al., Proc. Natl. Acad. Sci. USA (1987) 
84:7413-7416, which is herein incorporated by reference); mRNA (Malone et al., Proc. Natl. 

20 Acad. Sci. USA (1989) 86:6077-6081, which is herein incorporated by reference); and purified 
transcription factors (Debs et al., J. Biol. Chem. (1990) 265:10189-10192, which is herein 
incorporated by reference), in functional form. 

Cationic liposomes are readily available. For example, 
N[l-2,3-moleyloxy)propyl]-N,N^-triemyIammonium (DOTMA) liposomes are particularly 

25 useful and are available under the trademark Lipofectin, from GIBCO BRL, Grand Island, N.Y. 
(See, also, Feigner et al., Proc. Natl Acad. Sci. USA (1987) 84:7413-7416, which is herein 
incorporated by reference). Other commercially available liposomes include transfectace 
(DDAB/DOPE) and DOTAP/DOPE (Boehringer). 

Other cationic liposomes can be prepared from readily available materials using 

30 techniques well known in the art. See, e.g. PCT Publication No. WO 90/11092 (which is herein 
incorporated by reference) for a description of the synthesis of DOTAP (l,2-bis(oleoyloxy)-3- 
(trimethylammonio)propane) liposomes. Preparation of DOTMA liposomes is explained in the 
literature, see, e.g., P. Feigner et al., Proc. Nad. Acad. Sci. USA 84:7413-7417, which is herein 
incorporated by reference. Similar methods can be used to prepare liposomes from other cationic 

35 lipid materials. 

Similarly, anionic and neutral liposomes are readily available, such as from Avanti Polar 
Lipids (Birmingham, Ala.), or can be easily prepared using readily available materials. Such 
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materials include phosphatidyl, choline, cholesterol, phosphatidyl ethanolamine, 
dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), 
dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can also be mixed with 
the DOTMA and DOTAP starting materials in appropriate ratios. Methods for making liposomes 
5 using these materials are well known in the art. 

For example, commercially dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl 
glycerol (DOPG), and dioleoylphosphatidyl ethanolamine (DOPE) can be used in various 
combinations to make conventional liposomes, with or without the addition of cholesterol. Thus, 
for example, DOPG/DOPC vesicles can be prepared by drying 50 mg each of DOPG and DOPC 

10 under a stream of nitrogen gas into a sonication vial. The sample is placed under a vacuum pump 
overnight and is hydrated the following day with deionized water. The sample is then sonicated for 
2 hours in a capped vial, using a Heat Systems model 350 sonicator equipped with an inverted cup 
(bath type) probe at the maximum setting while the bath is circulated at 15EC Alternatively, 
negatively charged vesicles can be prepared without sonication to produce multilamellar vesicles 

15 or by extrusion through nucleopore membranes to produce unilamellar vesicles of discrete size. 
Other methods are known and available to those of skill in the art 

The liposomes can comprise multilamellar vesicles (MLVs), small unilamellar vesicles 
(SUVs), or large unilamellar vesicles (LUVs), with SUVs being preferred. The various 
liposome-nucleic acid complexes are prepared using methods well known in the art. See, e.g., 

20 Straubinger et al., Methods of Immunology (1983), 101:512-527, which is herein incorporated by 
reference. For example, MLVs containing nucleic acid can be prepared by depositing a thin film 
of phospholipid on the walls of a glass tube and subsequently hydrating with a solution of the 
material to be encapsulated. SUVs are prepared by extended sonication of MLVs to produce a 
homogeneous population of unilamellar liposomes. The material to be entrapped is added to a 

25 suspension of preformed MLVs and then sonicated. When using liposomes containing cationic 
lipids, the dried lipid film is resuspended in an appropriate solution such as sterile water or an 
isotonic buffer solution such as 10 mM Tris/NaCI, sonicated, and then the preformed liposomes 
are mixed directly with the DNA. The liposome and DNA form a very stable complex due to 
binding of the positively charged liposomes to the cationic DNA. SUVs find use with small 

30 nucleic acid fragments. LUVs are prepared by a number of methods, well known in the art. 
Commonly used methods include Ca 2+ -EDTA chelation (Papahadjopoulos et al., Biochim. 
Biophys. Acta (1975) 394:483; Wilson et al., CeJl 17:77 (1979)); ether injection (Deamer, D. and 
Bangham, A., Biochim. Biophys. Acta 443:629 (1976); Ostro et al., Biochem. Biophys. Res. 
Commun. 76:836 (1977); Fraley et al., Proc. Natl. Acad. Sci. USA 76:3348 (1979)); detergent 

35 dialysis (Enoch, H. and Strittmatter, P., Proc. Nad. Acad. Sci. USA 76:145 (1979)); and 
reverse-phase evaporation (REV) (Fraley et al., J. Biol. Chem. 255:10431 (1980); Szoka, F. and 
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Papahadjopoulos, D., Proc. Natl. Acad. Sci. USA 75:145 (1978); Schaefer-Ridder et al., Science 
215:166 (1982)), which are herein incorporated by reference. 

Generally, the ratio of DNA to liposomes will be from about 10:1 to about 1:10. 
Preferably, the ration will be from about 5:1 to about 1:5. More preferably, the ration will be 
5 about 3:1 to about 1:3. Still more preferably, the ratio will be about 1:1. 

U.S. Patent No. 5,676,954 (which is herein incorporated by reference) reports on the 
injection of genetic material, complexed with cationic liposomes carriers, into mice. U.S. Patent 
Nos. 4,897,355, 4,946,787, 5,049,386, 5,459,127, 5,589,466, 5,693,622, 5,580,859, 5,703,055, 
and international publication no. WO 94/9469 (which are herein incorporated by reference) 

10 provide cationic lipids for use in transfecting DNA into cells and mammals. U.S. Patent Nos. 
5,589,466, 5,693,622, 5,580,859, 5,703,055, and international publication no. WO 94/9469 
provide methods for delivering DNA-cationic lipid complexes to mammals. 

In certain embodiments, cells are engineered, ex vivo or in vivo, using a retroviral particle 
containing RNA which comprises a sequence encoding a polypeptide of the present invention. 

15 Retroviruses from which the retroviral plasmid vectors may be derived include, but are not limited 
to, Moloney Murine Leukemia Virus, spleen necrosis virus, Rous sarcoma Virus, Harvey Sarcoma 
Virus, avian leukosis virus, gibbon ape leukemia virus, human immunodeficiency virus, 
Myeloproliferative Sarcoma Virus, and mammary tumor virus. 

The retroviral plasmid vector is employed to transduce packaging cell lines to form 

20 producer cell lines. Examples of packaging cells which may be transfected include, but are not 
limited to, the PE501, PA317, R-2, R-AM, PA12, T19-14X, VT-19-17-H2, RCRE, RCRIP, 
GP+E-86, GP+envAml2, and DAN cell lines as described in Miller, Human Gene Therapy 1:5-14 
(1990), which is incorporated herein by reference in its entirety. The vector may transduce the 
packaging cells through any means known in the art. Such means include, but are not limited to, 

25 electroporation, the use of liposomes, and CaPO* precipitation. In one alternative, the retroviral 
plasmid vector may be encapsulated into a liposome, or coupled to a lipid, and then administered 
to a host. 

The producer cell line generates infectious retroviral vector particles which include 
polynucleotide encoding a polypeptide of the present invention. Such retroviral vector particles 

30 then may be employed, to transduce eukaryotic ceDs, either in vitro or in vivo. The transduced 
eukaryotic ceDs will express a polypeptide of the present invention. 

In certain other embodiments, cells are engineered, ex vivo or in vivo, with polynucleotide 
contained in an adenovirus vector. Adenovirus can be manipulated such that it encodes and 
expresses a polypeptide of the present invention, and at the same time is inactivated in terms of its 

35 ability to replicate in a normal lytic viral life cycle. Adenovirus expression is achieved without 
integration of the viral DNA into the host cell chromosome, thereby alleviating concerns about 
insertional mutagenesis. Furthermore, adenoviruses have been used as live enteric vaccines for 

608 



» 

WO 02/076488 PCT/US02/08276 

many years with an excellent safety profile (Schwartz et al. Am. Rev. Respir. Dis. 109:233-238 
(1974)). Finally, adenovirus mediated gene transfer has been demonstrated in a number of 
instances including transfer of alpha- 1 -antitrypsin and CFTR to the lungs of cotton rats 
(Rosenfeld, M. A. et al. (1991) Science 252:431^34; Rosenfeld et al., (1992) Cell 68:143-155). 
5 Furthermore, extensive studies to attempt to establish adenovirus as a causative agent in human 
cancer were uniformly negative (Green, M. et al. (1979) Proc. Natl. Acad. Sci. USA 76:6606). 

Suitable adenoviral vectors useful in the present invention are described, for example, in 
Kozarsky and Wilson, Curr. Opin. Genet Devel. 3:499-503 (1993); Rosenfeld et al., Cell 68; 143- 
155 (1992); Engelhardt et al., Human Genet. Ther. 4:759-769 (1993); Yang et al., Nature Genet 

10 7:362-369 (1994); Wilson etal, Nature 365:691-692 (1993); and ' U.S. Patent No. 5,652,224, 
which are herein incorporated by reference. For example, the adenovirus vector Ad2 is useful and 
can be grown in human 293 cells. These cells contain the El region of adenovirus and 
constitutively express Ela and Elb, which complement the defective adenoviruses by providing the 
products of the genes deleted from the vector. In addition to Ad2, other varieties of adenovirus 

15 (e.g., Ad3, Ad5, and Ad7) are also useful in the present invention. 

Preferably, the adenoviruses used in the present invention are replication deficient. 
Replication deficient adenoviruses require the aid of a helper virus and/or packaging cell line to 
form infectious particles. The resulting virus is capable of infecting cells and can express a 
polynucleotide of interest which is operably linked to a promoter, but cannot replicate in most 

20 cells. Replication deficient adenoviruses may be deleted in one or more of all or a portion of the 
following genes: Ela, Elb, E3, E4, E2a, or LI through L5. 

In certain other embodiments, the cells are engineered, ex vivo or in vivo, using an adeno- 
associated virus (AAV). AAVs are naturally occurring defective viruses that require helper 
viruses to produce infectious particles (Muzyczka, K, Curr. Topics in Microbiol. Immunol. 158:97 

25 (1992)). It is also one of the few viruses that may integrate its DNA into non-dividing cells. 
Vectors containing as little as 300 base pairs of AAV can be packaged and can integrate, but space 
for exogenous DNA is limited to about 4.5 kb. Methods for producing and using such AAVs are 
known in the art. See, for example, U.S. Patent Nos. 5,139,941, 5,173,414, 5,354,678, 5,436,146, 
5,474,935, 5,478,745, and 5,589,377. 

30 For example, an appropriate AAV vector for use in the present invention will include all 

the sequences necessary for DNA replication, encapsidation, and host-cell integration. The 
polynucleotide construct is inserted into the AAV vector using standard cloning methods, such as 
those found in Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Press (1989). The recombinant AAV vector is then transfected into packaging cells which are 

35 infected with a helper virus, using any standard technique, including lipofection, elecrroporation, 
calcium phosphate precipitation, etc. Appropriate helper viruses include adenoviruses, 
cytomegaloviruses, vaccinia viruses, or herpes viruses. Once the packaging cells are transfected 
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and infected, they will produce infectious AAV viral particles which contain the polynucleotide 
construct. These viral particles are then used to transduce eukaryotic cells, either ex vivo or in 
vivo. The transduced cells will contain the polynucleotide construct integrated into its genome, 
and will express a polypeptide of the invention. 
5 Another method of gene therapy involves operably associating heterologous control 

regions and endogenous polynucleotide sequences (e.g. encoding a polypeptide of the present 
invention) via homologous recombination (see, e.g., U.S. Patent No. 5,641,670, issued June 24, 
1997; International Publication No. WO 96/29411, published September 26, 1996; International 
Publication No. WO 94/12650, published August 4, 1994; Roller et al., Proc. Natl. Acad. Sci. 
10 USA 86:8932-8935 (1989); and Zijlstra et al., Nature 342:435-438 (1989), which are herein 
encorporated by reference. This method involves the activation of a gene which is present in the 
target cells, but which is not normally expressed in the cells, or is expressed at a lower level than 
desired. 

Polynucleotide constructs are made, using standard techniques known hi the art, which 

15 contain the promoter with targeting sequences flanking the promoter. Suitable promoters are 
described herein. The targeting sequence is sufficiendy complementary to an endogenous 
sequence to permit homologous recombination of the promoter-targeting sequence with the 
endogenous sequence. The targeting sequence will be sufficiently near the 5' end of the desired 
endogenous polynucleotide sequence so the promoter will be operably linked to the endogenous 

20 sequence upon homologous recombination. 

The promoter and the targeting sequences can be amplified using PCR. Preferably, the 
amplified promoter contains distinct restriction enzyme sites on the 5' and 3' ends. Preferably, the 
3 r end of the first targeting sequence contains the same restriction enzyme site as the 5' end of the 
amplified promoter and the 5' end of the second targeting sequence contains the same restriction 

25 site as the 3' end of the amplified promoter. The amplified promoter and targeting sequences are 
digested and ligated together. 

The promoteMargeting sequence construct is delivered to the cells, either as naked 
polynucleotide, or in conjunction with transfection-facilitating agents, such as liposomes, viral 
sequences, viral particles, whole viruses, lipofection, precipitating agents, etc., described in more 

30 detail above. The P promoter-targeting sequence can be delivered by any method, included direct 
needle injection, intravenous injection, topical administration, catheter infusion, particle 
accelerators, etc. The methods are described in more detail below. 

The promoter-targeting sequence construct is taken up by cells. Homologous 
recombination between the construct and the endogenous sequence takes place, such that an 

35 endogenous sequence is placed under the control of the promoter. The promoter then drives the 
expression of the endogenous sequence. 
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The polynucleotide encoding a polypeptide of the present invention may contain a 
secretory signal sequence that facilitates secretion of the protein. Typically , the signal sequence is 
positioned in the coding region of the polynucleotide to be expressed towards or at the 5* end of 
the coding region. The signal sequence may be homologous or heterologous to the polynucleotide 

5 of interest and may be homologous or heterologous to the cells to be transfected. Additionally, the 

* 

signal sequence may be chemically synthesized using methods known in the art 

Any mode of administration of any of the above-described polynucleotides constructs can 
be used so long as the mode results in the expression of one or more molecules in an amount 
sufficient to provide a therapeutic effect This includes direct needle injection, systemic injection, 

10 catheter infusion, biolistic injectors, particle accelerators (i.e., "gene guns"), gelfoam sponge 
depots, other commercially available depot materials, osmotic pumps (e.g., Alza minipumps), oral 
or suppositorial solid (tablet or pill) pharmaceutical formulations, and decanting or topical 
applications during surgery. For example, direct injection of naked calcium 
phosphate-precipitated plasmid into rat liver and rat spleen or a protein-coated plasmid into the 

15 portal vein has resulted in gene expression of the foreign gene in the rat livers (Kaneda et at, 
Science 243:375 (1989)). 

A preferred method of local administration is by direct injection. Preferably, a 
recombinant molecule of the present invention complexed with a delivery vehicle is administered 
by direct injection into or locally within the area of arteries. Administration of a composition 

20 locally within the area of arteries refers to injecting the composition centimeters and preferably, 
millimeters within arteries. 

Another method of local administration is to contact a polynucleotide construct of the 
present invention in or around a surgical wound. For example, a patient can undergo surgery and 
the polynucleotide construct can be coated on the surface of tissue inside the wound or the 

25 construct can be injected into areas of tissue inside the wound. 

Therapeutic compositions useful in systemic administration, include recombinant 
molecules of the present invention complexed to a targeted delivery vehicle of the present 
invention. Suitable delivery vehicles for use with systemic administration comprise liposomes 
comprising ligands for targeting the vehicle to a particular site. In specific embodiments, suitable 

30 delivery vehicles for use with systemic administration comprise liposomes comprising 
polypeptides of the invention for targeting the vehicle to a particular site. 

Preferred methods of systemic administration, include intravenous injection, aerosol, oral 
and percutaneous (topical) delivery. Intravenous injections can be performed using methods 
standard in the art. Aerosol delivery can also be performed using methods standard in the art (see, 

35 for example, Stribling et at, Proc. Natl. Acad. Sci. USA 189:11277-11281, 1992, which is 
incorporated herein by reference). Oral delivery can be performed by complexing a polynucleotide 
construct of the present invention to a carrier capable of withstanding degradation by digestive 
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enzymes in the gut of an animal. Examples of such carriers, include plastic capsules or tablets, 
such as those known in the art. Topical delivery can be performed by mixing a polynucleotide 
construct of the present invention with a lipophilic reagent (e.g., DMSO) that is capable of passing 
into the skin. 

5 Deteirnining an effective amount of substance to be delivered can depend upon a number 

of factors including, for example, the chemical structure and biological activity of the substance, 
the age and weight of the animal, the precise condition requiring treatment and its severity, and the 
route of administration. The frequency of treatments depends upon a number of factors, such as 
the amount of polynucleotide constructs adrninistered per dose, as well as the health and history of 
10 the subject The precise amount, number of doses, and timing of doses will be determined by the 
attending physician or veterinarian. 

Therapeutic compositions of the present invention can be administered to any animal, 
preferably to rnammals and birds. Preferred mammals include humans, dogs, cats, mice, rats, 
rabbits sheep, cattle, horses and pigs, with humans being particularly preferred. 

15 

Biological Activities 

Polynucleotides or polypeptides, or agonists or antagonists of the present invention, can be 
used in assays to test for one or more biological activities. If these polynucleotides or 
polypeptides, or agonists or antagonists of the present invention, do exhibit activity in a particular 

20 assay, it is likely that these molecules may be involved in the diseases associated with the 
biological activity. Thus, the polynucleotides and polypeptides, and agonists or antagonists could 
be used to treat the associated disease. 

Members of the secreted family of proteins are believed to be involved in biological 
activities associated with, for example, cellular signaling. Accordingly, compositions of the 

25 invention (including polynucleotides, polypeptides and antibodies of the invention, and fragments 
and variants thereof) may be used in diagnosis, prognosis, prevention and/or treatment of diseases 
and/or disorders associated with aberrant activity of secreted polypeptides. 

In preferred embodiments, compositions of the invention (including polynucleotides, 
polypeptides and antibodies of the invention, and fragments and variants thereof) may be used in 

30 the diagnosis, prognosis, prevention, treatment, and/or amelioration of diseases and/or disorders 
relating to the gastrointestinal system (e.g., Crohn's disease, pancreatitis, gallstones, antibiotic- 
associated colitis, duodenitis, gastrointestinal neoplasms, and as described in the "Gastrointestinal 
Disorders" section below). In certain embodiments, a polypeptide of the invention, or 
polynucleotides, antibodies, agonists, or antagonists corresponding to that polypeptide, may be 

35 used to diagnose and/or prognosticate diseases and/or disorders associated with the tissue(s) in 
which the polypeptide of the invention is expressed including one, two, three, four, five, or more 
tissues disclosed in Table IB. 2, column 5 (Tissue Distribution Library Code). 
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Thus, polynucleotides, translation products and antibodies of the invention are useful in 
the diagnosis, detection, prevention, prognistication, and/or treatment of diseases and/or disorders 
associated with activities that include, but are not limited to, prohormone activation, 
neurotransmitter activity, cellular signaling, cellular proliferation, cellular differentiation, and cell 
5 migration. 

More generally, polynucleotides, translation products and antibodies corresponding to this 
gene may be useful for the diagnosis, prognosis, prevention, treatment and/or amelioration of 
diseases and/or disorders associated with the following system or systems. 

10 Immune Activity 

Polynucleotides, polypeptides, antibodies, and/or agonists or antagonists of the present 

invention may be useful in preventing, diagnosing, prognosticating, treating, and/or ameliorating 

diseases, disorders, and/or conditions of the immune system, by, for example, activating or 

inhibiting the proliferation, differentiation, or mobilization (cbemotaxis) of immune cells. 

15 Immune cells develop through a process called hematopoiesis, producing myeloid (platelets, red 
blood ceUs, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) cells from 
pluripotent stem cells. The etiology of these immune diseases, disorders, and/or conditions may 
be genetic, somatic, such as cancer and some autoimmune diseases, acquired (e.g., by 
chemotherapy or toxins), or infectious. Moreover, polynucleotides, polypeptides, antibodies, 

20 and/or agonists or antagonists of the present invention can be used as a marker or detector of a 
particular immune system disease or disorder. 

Wound Healing and Epithelial Cell Proliferation 

In accordance with yet a further aspect of the present invention, there is provided a process 
25 for utilizing polynucleotides or polypeptides, as well as agonists or antagonists of the present 
invention, for therapeutic purposes, for example, to stimulate epithelial cell proliferation and basal 
keratinocytes for the purpose of wound healing, and to stimulate hair follicle production and 
healing of dermal wounds. Polynucleotides or polypeptides, as well as agonists or antagonists of 
the present invention, may be clinically useful in stimulating wound healing including surgical 
30 wounds, excisional wounds, deep wounds involving damage of the dermis and epidermis, eye 
tissue wounds, dental tissue wounds, oral cavity wounds, diabetic ulcers, dermal ulcers, cubitus 
ulcers, arterial ulcers, venous stasis ulcers, burns resulting from heat exposure or chemicals, and 
other abnormal wound healing conditions such as uremia, malnutrition, vitamin deficiencies and 
complications associated with systemic treatment with steroids, radiation therapy and 
35 antineoplastic drugs and antimetabolites. Polynucleotides or polypeptides, as well as agonists or 
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antagonists of the present invention, could be used to promote dermal reestablishment subsequent 
to derma] loss 

Polynucleotides or polypeptides, as well as agonists or antagonists of the present 
invention, could be used to increase the adherence of skin grafts to a wound bed and to stimulate 
5 re-epithelialization from the wound bed. The following are types of grafts that polynucleotides or 
polypeptides, agonists or antagonists of the present invention, could be used to increase adherence 
to a wound bed: autografts, artificial skin, allografts, autodermic graft, autoepdermic grafts, 
avacular grafts, Blair-Brown grafts, bone graft, brephoplastic grafts, cutis graft, delayed graft, 
dermic graft, epidermic graft, fascia graft, full thickness graft, heterologous graft, xenograft, 

10 homologous graft, hyperplastic graft, lamellar graft, mesh graft, mucosal graft, Ollier-Thiersch 
graft, omenpal graft, patch graft, pedicle graft, penetrating graft, split skin graft, thick split graft. 
Polynucleotides or polypeptides, as well as agonists or antagonists of the present invention, can be 
used to promote skin strength and to improve the appearance of aged skin. 

It is believed that polynucleotides or polypeptides, as well as agonists or antagonists of the 

15 present invention, will also produce changes in hepatocyte proliferation, and epithelial cell 
proliferation in the lung, breast, pancreas, stomach, small intestine, and large intestine. 
Polynucleotides or polypeptides, as well as agonists or antagonists of the present invention, could 
promote proliferation of epithelial cells such as sebocytes, hair follicles, hepatocytes, type II 
pneumocytes, mucin-producing goblet cells, and other epithelial cells and their progenitors 

20 contained within the skin, lung, liver, and gastrointestinal tract. Polynucleotides or polypeptides, 
agonists or antagonists of the present invention, may promote proliferation of endothelial cells, 
keratinocytes, and basal keratinocytes. 

Polynucleotides or polypeptides, as well as agonists or antagonists of the present 
invention, could also be used to reduce the side effects of gut toxicity that result from radiation, 

25 chemotherapy treatments or viral infections. Polynucleotides or polypeptides, as well as agonists 
or antagonists of the present invention, may have a cytoprotective effect on the small intestine 
mucosa. Polynucleotides or polypeptides, as well as agonists or antagonists of the present 
invention, may also stimulate healing of mucositis (mouth ulcers) that result from chemotherapy 
and viral infections. 

30 Polynucleotides or polypeptides, as well as agonists or antagonists of the present 

invention, could further be used in full regeneration of skin in full and partial thickness skin 
defects, including burns, (i.e., repopulation of hair follicles, sweat glands, and sebaceous glands), 
treatment of other skin defects such as psoriasis. Polynucleotides or polypeptides, as well as 
agonists or antagonists of the present invention, could be used to treat epidermolysis bullosa, a 

35 defect in adherence of the epidermis to the underlying dermis which results in frequent, open and 
painful blisters by accelerating reepithelialization of these lesions. Polynucleotides or 
polypeptides, as well as agonists or antagonists of the present invention, could also be used to treat 
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gastric and doudenal ulcers and help heal by scar formation of the mucosal lining and regeneration 
of glandular mucosa and duodenal mucosa] lining more rapidly. Inflammatory bowel diseases, 
such as Crohn's disease and ulcerative colitis, are diseases which result in destruction of the 
mucosal surface of the small or large intestine, respectively. Thus, polynucleotides or 
5 polypeptides, as well as agonists or antagonists of the present invention, could be used to promote 
the resurfacing of the mucosal surface to aid more rapid healing and to prevent progression of 
inflammatory bowel disease. Treatment with polynucleotides or polypeptides, agonists or 
antagonists of the present invention, is expected to have a significant effect on the production of 
mucus throughout the gastrointestinal tract and could be used to protect the intestinal mucosa from 

10 injurious substances that are ingested or foDowing surgery. Polynucleotides or polypeptides, as 
well as agonists or antagonists of the present invention, could be used to treat diseases associate 
with the under expression. 

Moreover, polynucleotides or polypeptides, as well as agonists or antagonists of the 
present invention, could be used to prevent and heal damage to the lungs due to various 

15 pathological states. Polynucleotides or polypeptides, as well as agonists or antagonists of the 
present invention, which could stimulate proliferation and differentiation and promote the repair of 
alveoli and brochiolar epithelium to prevent or treat acute or chronic lung damage. For example, 
emphysema, which results in the progressive loss of aveoli, and inhalation injuries, i.e., resulting 
from smoke inhalation and burns, that cause necrosis of the bronchiolar epithelium and alveoli 

20 could be effectively treated using polynucleotides or polypeptides, agonists or antagonists of the 
present invention. Also, polynucleotides or polypeptides, as well as agonists or antagonists of the 
present invention, could be used to stimulate the proliferation of and differentiation of type H 
pneumocytes, which may help treat or prevent disease such as hyaline membrane diseases, such as 
infant respiratory distress syndrome and bronchopulmonary displasia, in premature infants. 

25 Polynucleotides or polypeptides, as well as agonists or antagonists of the present 

invention, could stimulate the proliferation and differentiation of hepatocytes and, thus, could be 
used to alleviate or treat liver diseases and pathologies such as fulminant liver failure caused by 
cirrhosis, liver damage caused by viral hepatitis and toxic substances (i.e., acetaminophen, carbon 
tetraholoride and other hepatotoxins known in the art). 

30 In addition, polynucleotides or polypeptides, as well as agonists or antagonists of the 

present invention, could be used treat or prevent the onset of diabetes mellitus. In patients with 
newly diagnosed Types I and II diabetes, where some islet cell function remains, polynucleotides 
or polypeptides, as well as agonists or antagonists of the present invention, could be used to 
maintain the islet function so as to alleviate, delay or prevent permanent manifestation of the 

35 disease. Also, polynucleotides or polypeptides, as well as agonists or antagonists of the present 
invention, could be used as an auxiliary in islet cell transplantation to improve or promote islet cell 
function. 
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Gastrointestinal Disorders 

Polynucleotides or polypeptides, or agonists or antagonists of the present invention, may 
be used to detect, prevent, diagnose, prognosticate, treat, and/or ameliorate gastrointestinal 
5 diseases and disorders, including inflammatory diseases and/or conditions, infections, cancers 
(e.g., intestinal neoplasms (carcinoid tumor of the small intestine, non-Hodgkin's lymphoma of the 
small intestine, small bowl lymphoma)), and ulcers, such as peptic ulcers. 

Gastrointestinal disorders include dysphagia, odynophagia, inflammation of the 
esophagus, peptic esophagitis, gastric reflux, submucosal fibrosis and stricturing, Mallory-Weiss 

10 lesions, leiomyomas, lipomas, epidermal cancers, adenocarcinomas, gastric retention disorders, 
gastroenteritis, gastric atrophy, gastric/stomach cancers, polyps of the stomach, autoimmune 
disorders such as pernicious anemia, pyloric stenosis, gastritis (bacterial, viral, eosinophilic, stress- 
induced, chronic erosive, atrophic, plasma cell, and Menetrier's), and peritoneal diseases (e.g., 
chyloperioneum, hemoperitoneum, mesenteric cyst, mesenteric lymphadenitis, mesenteric vascular 

15 occlusion, panniculitis, neoplasms, peritonitis, pneumoperitoneum, bubphrenic abscess,). 

Gastrointestinal disorders also include disorders associated with the small intestine, such 
as malabsorption syndromes, distension, irritable bowel syndrome, sugar intolerance, celiac 
disease, duodenal ulcers, duodenitis, tropical sprue, Whipple's disease, intestinal 
lymphangiectasia, Crohn's disease, appendicitis, obstructions of the ileum, Meckel's diverticulum, 

20 multiple diverticula, failure of complete rotation of the small and large intestine, lymphoma, and 
bacterial and parasitic diseases (such as Traveler's diarrhea, typhoid and paratyphoid, cholera, 
infection by Roundworms (Ascariasis lumbricoides), Hookworms (Ancylostoma duodenale), 
Threadworms (Enterobius vertnicularis), Tapeworms (Taenia saginata, Echinococcus granulosus, 
Diphyllobothrium spp. t and 7. solium). 

25 Liver diseases and/or disorders include intrahepatic cholestasis (alagille syndrome, biliary 

liver cirrhosis), fatty liver (alcoholic fatty liver, reye syndrome), hepatic vein thrombosis, 
hepatolenticular degeneration, hepatomegaly, hepatopulmonary syndrome, hepatorenal syndrome, 
portal hypertension (esophageal and gastric varices), liver abscess (amebic liver abscess), liver 
cirrhosis (alcoholic, biliary and experimental), alcoholic liver diseases (fatty liver, hepatitis, 

30 cirrhosis), parasitic (hepatic echinococcosis, fascioliasis, amebic liver abscess), jaundice 
(hemolytic, hepatocellular, and cholestatic), cholestasis, portal hypertension, liver enlargement, 
ascites, hepatitis (alcoholic hepatitis, animal hepatitis, chronic hepatitis (autoimmune, hepatitis B, 
hepatitis C, hepatitis D, drug induced), toxic hepatitis, viral human hepatitis (hepatitis A, hepatitis 
B, hepatitis C, hepatitis D, hepatitis E), Wilson's disease, granulomatous hepatitis, secondary 

35 biliary cirrhosis, hepatic encephalopathy, portal hypertension, varices, hepatic encephalopathy, 
primary biliary cirrhosis, primary sclerosing cholangitis, hepatocellular adenoma, hemangiomas, 
bile stones, liver failure (hepatic encephalopathy, acute liver failure), and liver neoplasms 
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(angiomyolipoma, calcified liver metastases, cystic liver metastases, epithelial tumors, 
fibrolamellar hepatocarcinoma, focal nodular hyperplasia, hepatic adenoma, hepatobiliary 
cystadenoma, hepatoblastoma, hepatocellular carcinoma, hepatoma, liver cancer, liver 
hemangioendothelioma, mesenchymal hamartoma, mesenchymal tumors of liver, nodular 
5 regenerative hyperplasia, benign liver tumors (Hepatic cysts [Simple cysts, Polycystic liver 
disease, Hepatobiliary cystadenoma, Choledochal cyst], Mesenchymal tumors [Mesenchymal 
hamartoma, Infantile hemangioendothelioma, Hemangioma, Peliosis hepatis, Lipomas, 
Inflammatory pseudotumor, Miscellaneous], Epithelial tumors [Bile duct epithelium (Bile duct 
hamartoma, Bile duct adenoma), Hepatocyte (Adenoma, Focal nodular hyperplasia, Nodular 
10 regenerative hyperplasia)], malignant liver tumors [hepatocellular, hepatoblastoma, hepatocellular 
carcinoma, cholangiocellular, cholangiocarcinoma, cystadenocarcinoma, tumors of blood vessels, 
angiosarcoma, Karposi's sarcoma, hemangioendothelioma, other tumors, embryonal sarcoma, 
fibrosarcoma, leiomyosarcoma, rhabdomyosarcoma, carcinosarcoma, teratoma, carcinoid, 
squamous carcinoma, primary lymphoma]), peliosis hepatis, erythrohepatic porphyria, hepatic 
15 porphyria (acute intermittent porphyria, porphyria cutanea tarda), Zellweger syndrome). 

Pancreatic diseases and/or disorders include acute pancreatitis, chronic pancreatitis (acute 
necrotizing pancreatitis, alcoholic pancreatitis), neoplasms , (adenocarcinoma of the pancreas, 
cystadenocarcinoma, insulinoma, gastrinoma, and glucagonoma, cystic neoplasms, islet-cell 
tumors, pancreoblastorna), and other pancreatic diseases (e.g., cystic fibrosis, cyst (pancreatic 
20 pseudocyst, pancreatic fistula, insufficiency)). 

Gallbladder diseases include gallstones (cholelithiasis and choledocholithiasis), 
postcholecystectomy syndrome, diverticulosis of the gallbladder, acute cholecystitis, chronic 
cholecystitis, bile duct tumors, and mucocele. 

Diseases and/or disorders of the large intestine include antibiotic-associated colitis, 
25 diverticulitis, ulcerative colitis, acquired megacolon, abscesses, fungal and bacterial infections, 
anorectal disorders (e.g., fissures, hemorrhoids), colonic diseases (colitis, colonic neoplasms 
[colon cancer, adenomatous colon polyps (e.g., villous adenoma), colon carcinoma, colorectal 
cancer], colonic diverticulitis, colonic diverticulosis, megacolon [Hirschsprung disease, toxic 
megacolon]; sigmoid diseases [proctocolitis, sigmoin neoplasms]), constipation, Crohn's disease, 
30 diarrhea (infantile diarrhea, dysentery), duodenal diseases (duodenal neoplasms, duodenal 
obstruction, duodenal ulcer, duodenitis), enteritis (enterocolitis), HFV enteropathy, ileal diseases 
(ileal neoplasms, ileitis), immunoproliferative small intestinal disease, inflammatory bowel disease 
(ulcerative colitis, Crohn's disease), intestinal atresia, parasitic diseases (anisakiasis, balantidiasis, 
blastocystis infections, cryptosporidiosis, dientamoebiasis, amebic dysentery, giardiasis), intestinal 
35 fistula (rectal fistula), intestinal neoplasms (cecal neoplasms, colonic neoplasms, duodenal 
neoplasms, ileal neoplasms, intestinal polyps, jejunal neoplasms, rectal neoplasms), intestinal 
obstruction (afferent loop syndrome, duodenal obstruction, impacted feces, intestinal pseudo- 
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obstruction [cecal volvulus], intussusception), intestinal perforation, intestinal polyps (colonic 
polyps, gardner syndrome, peutz-jeghers syndrome), jejunal diseases (jejunal neoplasms), 
malabsorption syndromes (blind loop syndrome, celiac disease, lactose intolerance, short bowl 
syndrome, tropical sprue, Whipple's disease), mesenteric vascular occlusion, pneumatosis 

5 cystoides intestinalis, protein-losing enteropathies (intestinal lymphagiectasis), rectal diseases 
(anus diseases, fecal incontinence, hemorrhoids, proctitis, rectal fistula, rectal prolapse, rectocele), 
peptic ulcer (duodenal ulcer, peptic esophagitis, hemorrhage, perforation, stomach ulcer, 
Zollinger-Ellison syndrome), postgastrectomy syndromes (dumping syndrome), stomach diseases 
(e.g., achlorhydria, duodenogastric reflux (bile reflux), gastric antral vascular ectasia, gastric 

10 fistula, gastric outlet obstruction, gastritis (atrophic or hypertrophic), gastroparesis, stomach 
dilatation, stomach diverticulum, stomach neoplasms (gastric cancer, gastric polyps, gastric 
adenocarcinoma, hyperplastic gastric polyp), stomach rupture, stomach ulcer, stomach volvulus), 
tuberculosis, visceroptosis, vomiting (e.g., hematemesis, hyperemesis gravidarum, postoperative 
nausea and vomiting) and hemorrhagic colitis. 

15 Further diseases and/or disorders of the gastrointestinal system include biliary tract 

diseases, such as, gastroschisis, fistula (e.g., biliary fistula, esophageal fistula, gastric fistula, 
intestinal fistula, pancreatic fistula), neoplasms (e.g., biliary tract neoplasms, esophageal 
neoplasms, such as adenocarcinoma of the esophagus, esophageal squamous cell carcinoma, 
gastrointestinal neoplasms, pancreatic neoplasms, such as adenocarcinoma of the pancreas, 

20 mucinous cystic neoplasm of the pancreas, pancreatic cystic neoplasms, pancreatoblastoma, and 
peritoneal neoplasms), esophageal disease (e.g., bullous diseases, candidiasis, glycogenic 
acanthosis, ulceration, barrett esophagus varices, atresia, cyst, diverticulum (e.g., Zenker's 
diverticulum), fistula (e.g., tracheoesophageal fistula), motility disorders (e.g., CREST syndrome, 
deglutition disorders, achalasia, spasm, gastroesophageal reflux), neoplasms, perforation (e.g., 

25 Boerhaave syndrome, Mallory-Weiss syndrome), stenosis, esophagitis, diaphragmatic hernia (e.g., 
hiatal hernia); gastrointestinal diseases, such as, gastroenteritis (e.g., cholera morbus, norwalk 
virus infection), hemorrhage (e.g., hematemesis, melena, peptic ulcer hemorrhage), stomach 
neoplasms (gastric cancer, gastric polyps, gastric adenocarcinoma, stomach cancer)), hernia (e.g., 
congenital diaphragmatic hernia, femoral hernia, inguinal hernia, obturator hernia, umbilical 

30 hernia, ventral hernia), and intestinal diseases (e.g., cecal diseases (appendicitis, cecal 
neoplasms)). 

Chemotaxis 

35 Polynucleotides or polypeptides, as well as agonists or antagonists of the present invention 

may have chemotaxis activity. A chemotaxic molecule attracts or mobilizes cells (e.g., 
monocytes, fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial 
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cells) to a particular site in the body, such as inflammation, infection, or site of hyperproliferation. 
The mobilized cells can then fight off and/or heal the particular trauma or abnormality. 

Polynucleotides or polypeptides, as well as agonists or antagonists of the present invention 
may increase chemotaxic activity of particular cells. These chemotactic molecules can then be 
5 used to treat iriflammation, infection, hyperproliferative disorders, or any immune system disorder 
by increasing the number of cells targeted to a particular location in the body. For example, 
chemotaxic molecules can be used to treat wounds and other trauma to tissues by attracting 
immune cells to the injured location. Chemotactic molecules of the present invention can also 
attract fibroblasts, which can be used to treat wounds. 
10 It is also contemplated that polynucleotides or polypeptides, as well as agonists or 

antagonists of the present invention may inhibit chemotactic activity. These molecules could also 
be used to treat disorders. Thus, polynucleotides or polypeptides, as well as agonists or 
antagonists of the present invention could be used as an inhibitor of chemotaxis. 

15 Binding Activity 

A polypeptide of the present invention may be used to screen for molecules that bind to 
the polypeptide or for molecules to which the polypeptide binds. The binding of the polypeptide 
and the molecule may activate (agonist), increase, inhibit (antagonist), or decrease activity of the 
polypeptide or the molecule bound. Examples of such molecules include antibodies, 
20 oligonucleotides, proteins (e.g., receptors),or small molecules. 

Preferably, the molecule is closely related to the natural ligand of the polypeptide, e.g., a 
fragment of the ligand, or a natural substrate, a ligand, a structural or functional mimetic. (See, 
Coligan et al., Current Protocols in Immunology l(2):Chapter 5 (1991)). Similarly, the molecule 
can be closely related to the natural receptor to which the polypeptide binds, or at least, a fragment 
25 of the receptor capable of being bound by the polypeptide (e.g., active site). In either case, the 
molecule can be rationally designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate cells which 
express the polypeptide. Preferred cells include cells from mammals, yeast, Drosophila, or E. colt 
Cells expressing the polypeptide (or cell membrane containing the expressed polypeptide) are then 
30 preferably contacted with a test compound potentially containing the molecule to observe binding, 
stimulation, or inhibition of activity of either the polypeptide or the molecule. 

The assay may simply test binding of a candidate compound to the polypeptide, wherein 
binding is detected by a label, or in an assay involving competition with a labeled competitor. 
Further, the assay may test whether the candidate compound results in a signal generated by 
35 binding to the polypeptide. 

Alternatively, the assay can be carried out using cell-free preparations, 
polypeptide/molecule affixed to a solid support, chemical libraries, or natural product mixtures. 
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The assay may also simply comprise the steps of mixing a candidate compound with a solution 
containing a polypeptide, measuring polypeptide/molecule activity or binding, and comparing the 
polypeptide/molecule activity or binding to a standard. 

Preferably, an ELISA assay can measure polypeptide level or activity in a sample (e.g., 
5 biological sample) using a monoclonal or polyclonal antibody. The antibody can measure 
polypeptide level or activity by either binding, directly or indirecdy, to the polypeptide or by 
competing with the polypeptide for a substrate. 

Additionally, the receptor to which the polypeptide of the present invention binds can be 
identified by numerous methods known to those of skill in the art, for example, ligand panning and 

10 FACS sorting (Coligan, et al., Current Protocols in Irnmun., 1(2), Chapter 5, (1991)). For 
example, expression cloning is employed wherein polyadenylated RNA is prepared from a cell 
responsive to the polypeptides, for example, NIH3T3 cells which are known to contain multiple 
receptors for the FGF family proteins, and SC-3 cells, and a cDNA library created from this RNA 
is divided into pools and used to transfect COS cells or other cells that are not responsive to the 

15 polypeptides. Transfected cells which are grown on glass slides are exposed to the polypeptide of 
the present invention, after they have been labeled. The polypeptides can be labeled by a variety 
of means including iodination or inclusion of a recognition site for a site-specific protein kinase. 

Following fixation and incubation, the slides are subjected to auto-radiographic analysis. 
Positive pools are identified and sub-pools are prepared and re-transfected using an iterative sub- 

20 pooling and re-screening process, eventually yielding a single clones that encodes the putative 
receptor. 

As an alternative approach for receptor identification, the labeled polypeptides can be 
photoaffinity linked with cell membrane or extract preparations that express the receptor molecule. 
Cross-linked material is resolved by PAGE analysis and exposed to X-ray film. The labeled 

25 complex containing the receptors of the polypeptides can be excised, resolved into peptide 
fragments, and subjected to protein microsequencing. The amino acid sequence obtained from 
microsequencing would be used to design a set of degenerate oligonucleotide probes to screen a 
cDNA library to identify the genes encoding the putative receptors. 

Moreover, the techniques of gene-shuffling, motif-shuffling, exon-shufHing, and/or 

30 codon-shuffling (collectively referred to as "DNA shuffling") may be employed to modulate the 
activities of the polypeptide of the present invention thereby effectively generating agonists and 
antagonists of the polypeptide of the present invention. See generally, U.S. Patent Nos. 5,605,793, 
5,811,238, 5,830,721, 5,834,252, and 5,837,458, and Patten, P. A., et al, Curr. Opinion 
Biotechnol. 8:724-33 (1997); Harayama, S. Trends Biotechnol 16(2):76-82 (1998); Hansson, L. 

35 O., et al, J. Mol Biol. 287:265-76 (1999); and Lorenzo, M. M. and Blasco, R. Biotechniques 
24(2):308-13 (1998); each of these patents and publications are hereby incorporated by reference). 
In one embodiment, alteration of polynucleotides and corresponding polypeptides may be 
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achieved by DNA shuffling. DNA shuffling involves the assembly of two or more DNA segments 
into a desired molecule by homologous, or site-specific, recombination. In another embodiment, 
polynucleotides and corresponding polypeptides may be altered by being subjected to random 
mutagenesis by error-prone PCR, random nucleotide insertion or other methods prior to 
5 recombination. In another embodiment, one or more components, motifs, sections, parts, domains, 
fragments, etc., of the polypeptide of the present invention may be recombined with one or more 
components, motifs, sections, parts, domains, fragments, etc. of one or more heterologous 
molecules. In preferred embodiments, the heterologous molecules are family members. In further 
preferred embodiments, the heterologous molecule is a growth factor such as, for example, 

10 platelet-derived growth factor (PDGF), insulin-like growth factor (IGF-I), transforming growth 
factor (TGF)-alpha, epidermal growth factor (EGF), fibroblast growth factor (FGF), TGF-beta, 
bone morphogenetic protein (BMP>2, BMP-4, BMP-5, BMP-6, BMP-7, activins A and B, 
decapentaplegic(dpp), 60A, OP-2, dbrsalin, growth differentiation factors (GDFs), nodal, MIS, 
inhibin-alpha, TGF-betal, TGF-beta2, TGF-beta3, TGF-beta5, and glial-derived neurotrophic 

15 factor (GDNF). 

Other preferred fragments are biologically active fragments of the polypeptide of the 
present invention. Biologically active fragments are those exhibiting activity similar, but not 
necessarily identical, to an activity of the polypeptide of the present invention. The biological 
activity of the fragments may include an improved desired activity, or a decreased undesirable 
20 activity. 

Additionally, this invention provides a method of screening compounds to identify those 
which modulate the action of the polypeptide of the present invention. An example of such an 
assay comprises combining a mammalian fibroblast cell, a the polypeptide of the present 

invention, the compound to be screened and 3[H] thymidine under cell culture conditions where 
25 the fibroblast cell would normally proliferate. A control assay may be performed in the absence of 
the compound to be screened and compared to the amount of fibroblast proliferation in the 
presence of the compound to determine if the compound stimulates proliferation by determining 

the uptake of -*[H] thymidine in each case. The amount of fibroblast cell proliferation is measured 

by liquid scintillation chromatography which measures the incorporation of 3 [H] thymidine. Both 

30 agonist and antagonist compounds may be identified by this procedure. 

In another method, a mammalian cell or membrane preparation expressing a receptor for a 
polypeptide of the present invention is incubated with a labeled polypeptide of the present 
invention in the presence of the compound. The ability of the compound to enhance or block this 
interaction could then be measured. Alternatively, the response of a known second messenger 

35 system following interaction of a compound to be screened and the receptor is measured and the 
ability of the compound to bind to the receptor and elicit a second messenger response is measured 
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to determine if the compound is a potential agonist or antagonist Such second messenger systems 
include but are not limited to, cAMP guanylate cyclase, ion channels or phosphoinositide 
hydrolysis. 

All of these above assays can be used as diagnostic or prognostic markers. The molecules 
5 discovered using these assays can be used to treat disease or to bring about a particular result in a 
patient (e.g., blood vessel growth) by activating or inhibiting the polypeptide/molecule. 
Moreover, the assays can discover agents which may inhibit or enhance the production of the 
polypeptides of the invention from suitably manipulated cells or tissues. 

Therefore, the invention includes a method of identifying compounds which bind to a 
10 polypeptide of the invention comprising the steps of: (a) incubating a candidate binding 
compound with a polypeptide of the present invention; and (b) determining if binding has 
occurred. Moreover, the invention includes a method of identifying agonists/antagonists 
comprising the steps of: (a) incubating a candidate compound with a polypeptide of the present 
invention, (b) assaying a biological activity, and (b) detenriiriing if a biological activity of the 
1 5 polypeptide has been altered. 

Targeted Delivery 

In another embodiment, the invention provides a method of delivering compositions to 
targeted cells expressing a receptor for a polypeptide of the invention, or cells expressing a cell 

20 bound form of a polypeptide of the invention. 

As discussed herein, polypeptides or antibodies of the invention may be associated with 
heterologous polypeptides, heterologous nucleic acids, toxins, or prodrugs via hydrophobic, 
hydrophilic, ionic and/or covalent interactions. In one embodiment, the invention provides a 
method for the specific delivery of compositions of the invention to cells by administering 

25 polypeptides of the invention (including antibodies) that are associated with heterologous 
polypeptides or nucleic acids. In one example, the invention provides a method for delivering a 
therapeutic protein into the targeted cell. In another example, the invention provides a method for 
delivering a single stranded nucleic acid (e.g., antisense or ribozymes) or double stranded nucleic 
acid (e.g., DNA that can integrate into the cell's genome or replicate episomally and that can be 

30 transcribed) into the targeted cell. 

In another embodiment, the invention provides a method for the specific destruction of 
cells (e.g., the destruction of tumor cells) by administering polypeptides of the invention (e.g., 
polypeptides of the invention or antibodies of the invention) in association with toxins or cytotoxic 
prodrugs. 

35 By "toxin" is meant compounds that bind and activate endogenous cytotoxic effector 

systems, radioisotopes, bolotoxins, modified toxins, catalytic subunits of toxins, or any molecules 
or enzymes not normally present in or on the surface of a cell that under defined conditions cause 
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the cell's death. Toxins that may be used according to the methods of the invention include, but 
are not limited to, radioisotopes known in the art, compounds such as, for example, antibodies (or 
complement fixing containing portions thereof) that bind an inherent or induced endogenous 
cytotoxic effector system, thymidine kinase, endonuclease, RNAse, alpha toxin, ricin, abrin, 
5 Pseudomonas exotoxin A, diphtheria toxin, saporin, momordin, gelonin, pokeweed antiviral 
protein, alpha-sarcin and cholera toxin. By "cytotoxic prodrug" is meant a non-toxic compound 
that is converted by an enzyme, normally present in the cell, into a cytotoxic compound. 
Cytotoxic prodrugs that may be used according to the methods of the invention include, but are not 
limited to, glutamyl derivatives of benzoic acid mustard alkylating agent, phosphate derivatives of 
10 etoposide or mitomycin C, cytosine arabinoside, daunorubisin, and phenoxyacetamide derivatives 
of doxorubicin. 

Drug Screening 

15 Further contemplated is the use of the polypeptides of the present invention, or the 

polynucleotides encoding these polypeptides, to screen for molecules which modify the activities 
of the polypeptides of the present invention. Such a method would include contacting the 
polypeptide of the present invention with a selected compound(s) suspected of having antagonist 
or agonist activity, and assaying the activity of these polypeptides following binding. 

20 This invention is particularly useful for screening therapeutic compounds by using the 

polypeptides of the present invention, or binding fragments thereof, in any of a variety of drug 
screening techniques. The polypeptide or fragment employed in such a test may be affixed to a 
solid support, expressed on a cell surface, free in solution, or located intracellularly. One method 
of drug screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with 

25 recombinant nucleic acids expressing the polypeptide or fragment. Drugs are screened against 
such transformed cells in competitive binding assays. One may measure, for example, the 
formulation of complexes between the agent being tested and a polypeptide of the present 
invention. 

Thus, the present invention provides methods of screening for drugs or any other agents 
30 which affect activities mediated by the polypeptides of the present invention. These methods 
comprise contacting such an agent with a polypeptide of the present invention or a fragment 
thereof and assaying for the presence of a complex between the agent and the polypeptide or a 
fragment thereof, by methods well known in the art. In such a competitive binding assay, the 
agents to screen are typically labeled. Following incubation, free agent is separated from that 
35 present in bound form, and the amount of free or uncomplexed label is a measure of the ability of a 
particular agent to bind to the polypeptides of the present invention. 
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Another technique for drug screening provides high throughput screening for compounds 
having suitable binding affinity to the polypeptides of the present invention, and is described in 
great detail in European Patent Application 84/03564, published on September 13, 1984, which is 
incorporated herein by reference herein. Briefly stated, large numbers of different small peptide 
5 test compounds are synthesized on a solid substrate, such as plastic pins or some other surface. 
The peptide test compounds are reacted with polypeptides of the present invention and washed. 
Bound polypeptides are then detected by methods well known in the art. Purified polypeptides are 
coated directly onto plates for use in the aforementioned drug screening techniques. In addition, 
non-neutralizing antibodies may be used to capture the peptide and immobilize it on the solid 
10 support. 

This invention also contemplates the use of competitive drug screening assays in which 
neutralizing antibodies capable of binding polypeptides of the present invention specifically 
compete with a test compound for binding to the polypeptides or fragments thereof. In this 
manner, the antibodies are used to detect the presence of any peptide which shares one or more 
15 antigenic epitopes with a polypeptide of the invention. 

Antisense And Ribozvme (Antagonists) 

hi specific embodiments, antagonists according to the present invention are nucleic acids 
corresponding to the sequences contained in SEQ ID NO:X, or the complementary strand thereof, 

20 and/or to cDNA sequences contained in cDNA ATCC Deposit No:Z identified for example, in 
Table 1A and/or IB. In one embodiment, antisense sequence is generated internally, by the 
organism, in another embodiment, the antisense sequence is separately administered (see, for 
example, O'Connor, J., Neurochem. 56:560 (1991). Oligodeoxynucleotides as Antisense 
Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988). Antisense technology can be 

25 used to control gene expression through antisense DNA or RNA, or through triple-helix formation. 
Antisense techniques are discussed for example, in Okano, J., Neurochem 56:560 (1991); 
Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL 
(1988). Triple helix formation is discussed in, for instance, Lee et al., Nucleic Acids Research 
6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et al., Science 251:1300 

30 (1991). The methods are based on binding of a polynucleotide to a complementary DNA or RNA. 

For example, the use of c-myc and c-myb antisense RNA constructs to inhibit the growth 
of the non-lymphocytic leukemia cell line HL-60 and other cell lines was previously described. 
(Wickstrom et al. (1988); Anfossi et al. (1989)). These experiments were performed in vitro by 
incubating cells with the oligoribonucleotide. A similar procedure for in vivo use is described in 

35 WO 91/15580. Briefly, a pair of oligonucleotides for a given antisense RNA is produced as 
follows: A sequence complimentary to the first 15 bases of the open reading frame is flanked by 
an EcoRl site on the 5 end and a Hindin site on the 3 end. Next, the pair of oligonucleotides is 
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heated at 90°C for one minute and then annealed in 2X ligation buffer (20mM TRIS HC1 pH 7.5, 
lOmM MgC12, 10MM dithiothreitol (DTT) and 0.2 mM ATP) and then ligated to the EcoRl/Hind 
EI site of the retroviral vector PMV7 (WO 91/15580). 

For example, the 5' coding portion of a polynucleotide that encodes the polypeptide of the 
5 present invention may be used to design an antisense RNA oligonucleotide of from about 10 to 40 
base pairs in length. A DNA oligonucleotide is designed to be complementary to a region of the 
gene involved in transcription thereby preventing transcription and the production of the receptor. 
The antisense RNA oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the 
mRNA molecule into receptor polypeptide. 

10 In one embodiment, the antisense nucleic acid of the invention is produced intracellular^ 

by transcription from an exogenous sequence. For example, a vector or a portion thereof, is 
transcribed, producing an antisense nucleic acid (RNA) of the invention. Such a vector would 
contain a sequence encoding the antisense nucleic acid. Such a vector can remain episomal or 
become chromosomally integrated, as long as it can be transcribed to produce the desired antisense 

15 RNA. Such vectors can be constructed by recombinant DNA technology methods standard in the 
art Vectors can be plasmid, viral, or others known in the art, used for replication and expression 
in vertebrate cells. Expression of the sequence encoding the polypeptide of the present invention 
or fragments thereof, can be by any promoter known in the art to act in vertebrate, preferably 
human cells. Such promoters can be inducible or constitutive. Such promoters include, but are 

20 not limited to, the SV40 early promoter region (Bernoist and Chambon, Nature 29:304-310 (1981), 
the promoter contained in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto et al., Cell 
22:787-797 (1980), the herpes thymidine promoter (Wagner et al., Proc. Natl. Acad. Sci. U.S.A. 
78:1441-1445 (1981), the regulatory sequences of the metallothionein gene (Brinster, et al., Nature 
296:39-42(1982)), etc. 

25 The antisense nucleic acids of the invention comprise a sequence complementary to at 

least a portion of an RNA transcript of a gene of the present invention. However, absolute 
complementarity, although preferred, is not required. A sequence "complementary to at least a 
portion of an RNA," referred to herein, means a sequence having sufficient complementarity to be 
able to hybridize with the RNA, forming a stable duplex; in the case of double stranded antisense 

30 nucleic acids, a single strand of the duplex DNA may thus be tested, or triplex formation may be 
assayed. The ability to hybridize will depend on both the degree of complementarity and the 
length of the antisense nucleic acid. Generally, the larger the hybridizing nucleic acid, the more 
base mismatches with a RNA it may contain and still form a stable duplex (or triplex as the case 
may be). One skilled in the art can ascertain a tolerable degree of mismatch by use of standard 

35 procedures to determine the melting point of the hybridized complex. 

Oligonucleotides that are complementary to the 5' end of the message, e.g., the 5* 
untranslated sequence up to and including the AUG initiation codon, should work most efficiently 
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at inhibiting translation. However, sequences complementary to the 3' untranslated sequences of 
mRNAs have been shown to be effective at inhibiting translation of mRNAs as well. See 
generally, Wagner, R., 1994, Nature 372:333-335. Thus, oligonucleotides complementary to 
either the 5'- or 3 - non- translated, non-coding regions of polynucleotide sequences described 
5 herein could be used in an antisense approach to inhibit translation of endogenous mRNA. 
Oligonucleotides complementary to the 5' untranslated region of the mRNA should include the 
complement of the AUG start codon. Antisense oligonucleotides complementary to mRNA 
coding regions are less efficient inhibitors of translation but could be used in accordance with the 
invention. Whether designed to hybridize to the 5'-, 3'- or coding region of mRNA of the present 
10 invention, antisense nucleic acids should be at least six nucleotides in length, and are preferably 
oligonucleotides ranging from 6 to about 50 nucleotides in length. In specific aspects the 
oligonucleotide is at least 10 nucleotides, at least 17 nucleotides, at least 25 nucleotides or at least 
50 nucleotides. 

The polynucleotides of the invention can be DNA or RNA or chimeric mixtures or 
15 derivatives or modified versions thereof, single-stranded or double-stranded. The oligonucleotide 
can be modified at the base moiety, sugar moiety, or phosphate backbone, for example, to improve 
stability of the molecule, hybridization, etc. The oligonucleotide may include other appended 
groups such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et aL, 1989, Proc. Natl. Acad. Sci. U.S.A. 
20 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. 
WO88/09810, published December 15, 1988) or the blood-brain barrier (see, e.g., PCT Publication 
No. WO89/10134, published April 25, 1988), hybridization-triggered cleavage agents. (See, e.g., 
Krol et al., 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., Zon, 1988, Pharnx 
Res. 5:539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a 
25 peptide, hybridization triggered cross-linking, agent, transport agent, hybridization-triggered 
cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one modified base moiety which is 
selected from the group including, but not limited to, 5-fluorouracil, 5-bromouracii, 5-chlorouracil, 
5-iodouracil, hypoxanthioe, xantine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 

30 5-carboxymethylammomemyl-2-thiouridine, 5-carboxymeihylammomethyluracil, dihydrouracil, 
beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 
2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, 
N6-adenine, 7-methylguanine, 5-memylaminomethyluracil, 5-methoxyammomemyl-2-thiouracil, 
beta-D-mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 

35 isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 
2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxy acetic 
acid methylester, uracil-5-oxyacetic acid (v), 5 -methyl -2-thi ouraci 1, 3-(3-amino-3-N-2- 
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carboxypropyl) uracil, (acp3)w, and 2,6^aniinopurine. 

The antisense oligonucleotide may also comprise at least one modified sugar moiety 
selected from the group including, but not limited to, arabinose, 2-fluoroarabinose, xylulose, and 
hexose. 

5 In yet another embodiment, the antisense oligonucleotide comprises at least one modified 

phosphate backbone selected from the group including, but not limited to, a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a 
methylphosphonate, an alkyl phosphotriester, and a formacetal or analog thereof. 

In yet another embodiment, the antisense oligonucleotide is an a-anomeric 

10 oligonucleotide. An a-anomeric oligonucleotide forms specific double-stranded hybrids with 
complementary RNA in which, contrary to the usual b-units, the strands run parallel to each other 
(Gautier et al., 1987, Nucl. Acids Res. 15:6625-6641). The oligonucleotide is a T-O- 
methylribonucleotide (Inoue et al., 1987, Nucl. Acids Res. 15:6131-6148), or a chimeric RNA- 
DNA analogue (Inoue et al., 1987, FEBS Lett. 215:327-330). 

15 * Polynucleotides of the invention may be synthesized by standard methods known in the 

art, e.g. by use of an automated DNA synthesizer (such as are commercially available from 
Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate oligonucleotides may be 
synthesized by the method of Stein et al. (1988, Nucl. Acids Res. 16:3209), methylphosphonate 
oligonucleotides can be prepared by use of controlled pore glass polymer supports (Sarin et al., 

20 1988, Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451), etc. 

While antisense nucleotides complementary to the coding region sequence could be used, 
those complementary to the transcribed untranslated region are most preferred. 

Potential antagonists according to the invention also include catalytic RNA, or a ribozyme 
(See, e.g., PCT International Publication WO 90/11364, published October 4, 1990; Sarver et al, 

25 Science 247:1222-1225 (1990). While ribozymes that cleave mRNA at site specific recognition 
sequences can be used to destroy mRNAs, the use of hammerhead ribozymes is preferred. 
Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form 
complementary base pairs with the target mRNA. The sole requirement is that the target mRNA 
have the following sequence of two bases: 5'-UG-3\ The construction and production of 

30 hammerhead ribozymes is well known in the art and is described more fully in Haseloff and 
Gerlach, Nature 334:585-591 (1988). There are numerous potential hammerhead ribozyme 
cleavage sites within the nucleotide sequence of SEQ ID NO:X. Preferably, the ribozyme is 
engineered so that the cleavage recognition site is located near the 5' end of the mRNA; i.e., to 
increase efficiency and minimize the intracellular accumulation of non-functional mRNA 

35 transcripts. 

As in the antisense approach, the ribozymes of the invention can be composed of modified 
oligonucleotides (e.g., for improved stability, targeting, etc.) and should be delivered to cells 
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which express in vivo. DNA constructs encoding the ribozyme may be introduced into the cell in 
the same manner as described above for the introduction of antisense encoding DNA. A preferred 
method of delivery involves using a DNA construct "encoding" the ribozyme under the control of 
a strong constitutive promoter, such as, for example, pol HI or poi H promoter, so that transfected 
5 cells will produce sufficient quantities of the ribozyme to destroy endogenous messages and 
inhibit translation. Since ribozymes unlike antisense molecules, are catalytic, a lower intracellular 
concentration is required for efficiency. 

Antagonist/agonist compounds may be employed to inhibit the cell growth and 
proliferation effects of the polypeptides of the present invention on neoplastic cells and tissues, i.e. 
10 stimulation of angiogenesis of tumors, and, therefore, retard or prevent abnormal cellular growth 
and proliferation, for example, in tumor formation or growth. 

The antagonist/agonist may also be employed to prevent hyper-vascular diseases, and 
prevent the proliferation of epithelial lens cells after extracapsular cataract surgery. Prevention of 
the mitogenic activity of the polypeptides of the present invention may also be desirous in cases 
15 such as restenosis after balloon angioplasty. 

The antagonist/agonist may also be employed to prevent the growth of scar tissue during 
wound healing. 

The antagonist/agonist may also be employed to treat the diseases described herein. 

Thus, the invention provides a method of treating disorders or diseases, including but not 
20 limited to the disorders or diseases listed throughout this application, associated with 
overexpression of a polynucleotide of the present invention by administering to a patient (a) an 
antisense molecule directed to the polynucleotide of the present invention, and/or (b) a ribozyme 
directed to the polynucleotide of the present invention. 

25 Binding Peptides and Other Molecules 

The invention also encompasses screening methods for identifying polypeptides and 
nonpolypeptides that bind polypeptides of the invention, and the binding molecules identified 
thereby. These binding molecules are useful, for example, as agonists and antagonists of the 
polypeptides of the invention. Such agonists and antagonists can be used, in accordance with the 
30 invention, in the therapeutic embodiments described in detail, below. 

This method comprises the steps oft 
contacting polypeptides of the invention with a plurality of molecules; and 
identifying a molecule that binds the polypeptides of the invention. 
The step of contacting the polypeptides of the invention with the plurality of molecules 
35 may be effected in a number of ways. For example, one may contemplate immobilizing the 
polypeptides on a solid support and bringing a solution of the plurality of molecules in contact 
with the immobilized polypeptides. Such a procedure would be akin to an affinity 
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chromatographic process, with the affinity matrix being comprised of the immobilized 
polypeptides of the invention. The molecules having a selective affinity for the polypeptides can 
then be purified by affinity selection. The nature of the solid support, process for attachment of the 
polypeptides to the solid support, solvent, and conditions of the affinity isolation or selection are 
5 largely conventional and well known to those of ordinary skill in the art 

Alternatively, one may also separate a plurality of polypeptides into substantially separate 
fractions comprising a subset of or individual polypeptides. For instance, one can separate the 
plurality of polypeptides by gel electrophoresis, column chromatography, or like method known to 
those of ordinary skill for the separation of polypeptides. The individual polypeptides can also be 

10 produced by a transformed host cell in such a way as to be expressed on or about its outer surface 
(e.g., a recombinant phage). Individual isolates can then be "probed" by the polypeptides of the 
invention, optionally in the presence of an inducer should one be required for expression, to 
determine if any selective affinity interaction takes place between the polypeptides and the 
individual clone. Prior to contacting the polypeptides with each fraction comprising individual 

15 polypeptides, the polypeptides could first be transferred to a solid support for additional 
convenience. Such a solid support may simply be a piece of filter membrane, such as one made of 
nitrocellulose or nylon. In this manner, positive clones could be identified from a collection of 
transformed host cells of an expression library, which harbor a DNA construct encoding a 
polypeptide having a selective affinity for polypeptides of the invention. Furthermore, the amino 

20 acid sequence of the polypeptide having a selective affinity for the polypeptides of the invention 
can be determined directly by conventional means or the coding sequence of the DNA encoding 
the polypeptide can frequently be determined more conveniently. The primary sequence can then 
be deduced from the corresponding DNA sequence. If the amino acid sequence is to be determined 
from the polypeptide itself, one may use microsequencing techniques. The sequencing technique 

25 may include mass spectroscopy. 

In certain situations, it may be desirable to wash away any unbound polypeptides from a 
mixture of the polypeptides of the invention and the plurality of polypeptides prior to attempting to 
determine or to detect the presence of a selective affinity interaction. Such a wash step may be 
particularly desirable when the polypeptides of the invention or the plurality of polypeptides are 

30 bound to a solid support 

The plurality of molecules provided according to this method may be provided by way of 
diversity libraries, such as random or combinatorial peptide or nonpeptide libraries which can be 
screened for molecules that specifically bind polypeptides of the invention. Many libraries are 
known in the art that can be used, e.g., chemically synthesized libraries, recombinant (e.g., phage 

35 display libraries), and in vitro translation-based libraries. Examples of chemically synthesized 
libraries are described in Fodor et a!., 1991, Science 251:767-773; Houghten et al., 1991, Nature 
354:84-86; Lam et al., 1991, Nature 354:82-84; Medynski, 1994, Bio/Technology 12:709- 
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710;Gallop et al., 1994, J. Medicinal Chemistry 37(9): 1233-1251; Ohlmeyer et al., 1993, Proc. 
Natl. Acad. Sci. USA 90:10922-10926; Erb et aL, 1994, Proc. Natl. Acad. Sci. USA 91:11422- 
11426; Houghten et aL, 1992, Biotechniques 13:412; Jayawickreme et al., 1994, Proc. Natl. AcacL 
Sci. USA 91:1614-1618; Salmon et al., 1993, Proc. Natl. Acad. Sci. USA 90:11708-11712; PCT 
5 Publication No. WO 93/20242; and Brenner and Lerner, 1992, Proc. Natl. Acad. Sci. USA 
89:5381-5383. 

Examples of phage display libraries are described in Scott and Smith, 1990, Science 
249:386-390; Devlin et al., 1990, Science, 249:404-406; Christian, R. B., et al., 1992, J. MoL Biol. 
227:711-718); Lenstra, 1992, J. Immunol. Meth. 152:149-157; Kay et al., 1993, Gene 128:59-65; 

10 and PCT Publication No. WO 94/183 18 dated Aug. 18, 1994. 

hi vitro translation-based libraries include but are not limited to those described in PCT 
Publication No. WO 91/05058 dated Apr. 18, 1991; and Mattheakis et al., 1994, Proc. Natl Acad. 
Sci. USA 91:9022-9026. 

By way of examples of nonpeptide libraries, a benzodiazepine library (see e.g., Bunin et 

15 al., 1994, Proc. Natl. Acad. Sci. USA 91:4708-4712) can be adapted for use. Peptoid libraries 
(Simon et aL, 1992, Proc. Natl. Acad. Sci. USA 89:9367-9371) can also be used. Another example 
of a library that can be used, in which the amide functionalities in peptides have been 
permethylated to generate a chemically transformed combinatorial library, is described by Ostresh 
et al. (1994, Proc. Natl. Acad. Sci. USA 91:11138-11142). 

20 The variety of non-peptide libraries that are useful in the present invention is great. For 

example, Ecker and Crooke, 1995, Bio/Technology 13:351-360 list benzodiazepines, hydantoirts, 
piperazinediones, biphenyls, sugar analogs, beta-mercaptoketones, arylacetic acids, 
acylpiperidines, benzopyrans, cubanes, xanthines, aminimides, and oxazolones as among the 
chemical species that form the basis of various libraries. 

25 Non-peptide libraries can be classified broadly into two types: decorated monomers and 

oligomers. Decorated monomer libraries employ a relatively simple scaffold structure upon which 
a variety functional groups is added. Often the scaffold will be a molecule with a known useful 
pharmacological activity. For example, the scaffold might be the benzodiazepine structure. 

Non-peptide oligomer libraries utilize a large number of monomers that are assembled 

30 together in ways that create new shapes that depend on the order of the monomers. Among the 
monomer units that have been used are carbamates, pyrrolinones, and morpholinos. Peptoids, 
peptide-like oligomers in which the side chain is attached to the alpha amino group rather than the 
alpha carbon, form the basis of another version of non-peptide oligomer libraries. The first non- 
peptide oligomer libraries utilized a single type of monomer and thus contained a repeating 

35 backbone. Recent libraries have utilized more than one monomer, giving the libraries added 
flexibility. 

Screening the libraries can be accomplished by any of a variety of commonly known 
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methods. See, e.g., the following references, which disclose screening of peptide libraries: Parmley 
and Smith, 1989, Adv. Exp. Med. Biol. 251:215-218; Scott and Smith, 1990, Science 249:386- 
390; Fowlkes et al., 1992; BioTechniques 13:422-427; Oldenburg et aL, 1992, Proc. Natl. Acad. 
Sci. USA 89:5393-5397; Yu et al., 1994, Cell 76:933-945; Staudt et al., 1988, Science 241:577- 
5 580; Bock et al., 1992, Nature 355:564-566; Tuerk et al., 1992, Proc. Natl. Acad. Sci. USA 
89:6988-6992; Ellington et al., 1992, Nature 355:850-852; U.S. Pat No. 5,096,815, U.S. Pat. No. 
5,223,409, and U.S. Pat. No. 5,198,346, all to Ladner et al.; Rebar and Pabo, 1993, Science 
263:671-673; and CT Publication No. WO 94718318. 

In a specific embodiment, screening to identify a molecule that binds polypeptides of the 

10 invention can be carried out by contacting the library members with polypeptides of the invention 
immobilized on a solid phase and harvesting those library members that bind to the polypeptides 
of the invention. Examples of such screening methods, termed "parining" techniques are described 
by way of example in Parmley and Smith, 1988, Gene 73:305-318; Fowlkes et al., 1992, 
BioTechniques 13:422^127; PCT Publication No. WO 94/18318; and in references cited herein. 

15 In another embodiment, the two-hybrid system for selecting interacting proteins in yeast 

(Fields and Song, 1989, Nature 340:245-246; Chien et al., 1991, Proc. Natl. Acad. Sci. USA 
88:9578-9582) can be used to identify molecules that specifically bind to polypeptides of the 
invention. 

Where the binding molecule is a polypeptide, the polypeptide can be conveniently selected 
20 from any peptide library, including random peptide libraries, combinatorial peptide libraries, or 
biased peptide libraries. The term "biased" is used herein to mean that the method of generating 
the library is manipulated so as to restrict one or more parameters that govern the diversity of the 
resulting collection of molecules, in this case peptides. 

Thus, a truly random peptide library would generate a collection of peptides in which the 
25 probability of finding a particular amino acid at a given position of the peptide is the same for all 
20 amino acids. A bias can be introduced into the library, however, by specifying, for example, 
that a lysine occur every fifth amino acid or that positions 4, 8, and 9 of a decapeptide library be 
fixed to include only arginine. Clearly, many types of biases can be contemplated, and the present 
invention is not restricted to any particular bias. Furthermore, the present invention contemplates 
30 specific types of peptide libraries, such as phage displayed peptide libraries and those that utilize a 
DNA construct comprising a lambda phage vector with a DNA insert. 

As mentioned above, in the case of a binding molecule that is a polypeptide, the 
polypeptide may have about 6 to less than about 60 amino acid residues, preferably about 6 to 
about 10 amino acid residues, and most preferably, about 6 to about 22 amino acids. In another 
35 embodiment, a binding polypeptide has in the range of 15-100 amino acids, or 20-50 amino acids. 

The selected binding polypeptide can be obtained by chemical synthesis or recombinant 
expression. 
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Other Activities 

A polypeptide, polynucleotide, agonist, or antagonist of the present invention, as a result 
of the ability to stimulate vascular endothelial cell growth, may be employed in treatment for 
5 stimulating re-vascularization of ischemic tissues due to various disease conditions such as 
thrombosis, arteriosclerosis, and other cardiovascular conditions. The polypeptide, 
polynucleotide, agonist, or antagonist of the present invention may also be employed to stimulate 
angiogeriesis and limb regeneration, as discussed above. 

A polypeptide, polynucleotide, agonist, or antagonist of the present invention may also be 
10 employed for treating wounds due to injuries, burns, post-operative tissue repair, and ulcers since 
they are mitogenic to various cells of different origins, such as fibroblast cells and skeletal muscle 
cells, and therefore, facilitate the repair or replacement of damaged or diseased tissue. 

A polypeptide, polynucleotide, agonist, or antagonist of the present invention may also be 
employed stimulate neuronal growth and to treat and prevent neuronal damage which occurs in 
15 certain neuronal disorders or neurodegenerative conditions such as Alzheimer's disease, 
Parkinson's disease, and AIDS-related complex. A polypeptide, polynucleotide, agonist, or 
antagonist of the present invention may have the ability to stimulate chondrocyte growth, 
therefore, they may be employed to enhance bone and periodontal regeneration and aid in tissue 
transplants or bone grafts. 

20 A polypeptide, polynucleotide, agonist, or antagonist of the present invention may be also 

be employed to prevent skin aging due to sunburn by stimulating keratinocyte growth. 

A polypeptide, polynucleotide, agonist, or antagonist of the present invention may also be 
employed for preventing hair loss, since FGF family members activate hair-forming cells and 
promotes melanocyte growth. Along the same lines, a polypeptide, polynucleotide, agonist, or 
25 antagonist of the present invention may be employed to stimulate growth and differentiation of 
hematopoietic cells and bone marrow cells when used in combination with other cytokines. 

A polypeptide, polynucleotide, agonist, or antagonist of the present invention may also be 
employed to maintain organs before transplantation or for supporting cell culture of primary 
tissues. A polypeptide, polynucleotide, agonist, or antagonist of the present invention may also be 
30 employed for inducing tissue of mesodermal origin to differentiate in early embryos. 

A polypeptide, polynucleotide, agonist, or antagonist of the present invention may also 
increase or decrease the differentiation or proliferation of embryonic stem cells, besides, as 
discussed above, hematopoietic lineage. 

A polypeptide, polynucleotide, agonist, or antagonist of the present invention may also be 
35 used to modulate mammalian characteristics, such as body height, weight, hair color, eye color, 
skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic surgery). 
Similarly, a polypeptide, polynucleotide, agonist, or antagonist of the present invention may be 

632 



WO 02/076488 PCT/US02/08276 

used to modulate mammalian metabolism affecting catabolism, anabolism, processing, utilization, 
and storage of energy. 

A polypeptide, polynucleotide, agonist, or antagonist of the present invention may be used 
to change a mammal's mental state or physical state by influencing biorhythms, caricadic rhythms, 
5 depression (including depressive disorders), tendency for violence, tolerance for pain, reproductive 
capabilities (preferably by Activin or Inhibin-like activity), hormonal or endocrine levels, appetite, 
libido, memory, stress, or other cognitive qualities. 

A polypeptide, polynucleotide, agonist, or antagonist of the present invention may also be 
used as a food additive or preservative, such as to increase or decrease storage capabilities, fat 
10 content, lipid, protein, carbohydrate, vitamins, minerals, cofactors or other nutritional components. 

The above-recited applications have uses in a wide variety of hosts. Such hosts include, 
but are not limited to, human, murine, rabbit, goat, guinea pig, camel, horse, mouse, rat, hamster, 
pig, micro-pig, chicken, goat, cow, sheep, dog, cat, non-human primate, and human. In specific 
embodiments, the host is a mouse, rabbit, goat, guinea pig, chicken, rat, hamster, pig, sheep, dog 
15 or cat. la preferred embodiments, the host is a mammal. In most preferred embodiments, the host 
is a human. 

Other Preferred Embodiments 

Other preferred embodiments of the claimed invention include an isolated nucleic acid 

20 molecule comprising a nucleotide sequence which is at least 95% identical to a sequence of at least 
about 50 contiguous nucleotides in the nucleotide sequence of SEQ ID NO:X or the 
complementary strand thereto, the nucleotide sequence as defined in Table IB or columns 8 and 9 
of Table 2 or the complementary strand thereto, and/or cDNA contained in ATCC Deposit No:Z. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous nucleotides 

25 is included in the nucleotide sequence of the portion of SEQ ID NO:X as defined in column 5, 
"ORF (From-To)", in Table IB.l. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous nucleotides 
is included in the nucleotide sequence of the portion of SEQ ID NO:X as defined in columns 8 and 
9, "NT From" and "NT To" respectively, in Table 2. 

30 Also preferred is an isolated nucleic acid molecule comprising a nucleotide sequence 

which is at least 95% identical to a sequence of at least about 150 contiguous nucleotides in the 
nucleotide sequence of SEQ ID NO:X or the complementary strand thereto, the nucleotide 
sequence as defined in Table IB or columns 8 and 9 of Table 2 or the complementary strand 
thereto, and/or cDNA contained in ATCC Deposit No:Z. 

35 Further preferred is an isolated nucleic acid molecule comprising a nucleotide sequence 

which is at least 95% identical to a sequence of at least about 500 contiguous nucleotides in the 
nucleotide sequence of SEQ ID NO:X or the complementary strand thereto, the nucleotide 
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sequence as defined in Table IB or columns 8 and 9 of Table 2 or the complementary strand 
thereto, and/or cDNA contained in ATCC Deposit No:Z. 

A further preferred embodiment is a nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to the nucleotide sequence of the portion of SEQ ID 
5 NO:X defined in column 5, "ORF (From-To)'\ in Table 1B.1. 

A further preferred embodiment is a nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to the nucleotide sequence of the portion of SEQ ID 
NO:X defined in columns 8 and 9, "NT From" and "NT To", respectively, in Table 2. 

A further preferred embodiment is an isolated nucleic acid molecule comprising a 
10 nucleotide sequence which is at least 95% identical to the complete nucleotide sequence of SEQ 
ID NO:X or the complementary strand thereto, the nucleotide sequence as defined in column 5 of 
Table 1B.1 or columns 8 and 9 of Table 2 or the complementary strand thereto, and/or cDNA 
contained in ATCC Deposit No:Z. 

Also preferred is an isolated nucleic acid molecule which hybridizes under stringent 
15 hybridization conditions to a nucleic acid molecule comprising a nucleotide sequence of SEQ ID 
NO:X or the complementary strand thereto, the nucleotide sequence as defined in column 5 of 
Table 1B.1 or columns 8 and 9 of Table 2 or the complementary strand thereto, and/or cDNA 
contained in ATCC Deposit No:Z, wherein said nucleic acid molecule which hybridizes does not 
hybridize under stringent hybridization conditions to a nucleic acid molecule having a nucleotide 
20 sequence consisting of only A residues or of only T residues. 

Also preferred is a composition of matter comprising a DNA molecule which comprises 
the cDNA contained in ATCC Deposit No:Z. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide sequence 
which is at least 95% identical to a sequence of at least 50 contiguous nucleotides of the cDNA 
25 sequence contained in ATCC Deposit No:Z. 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of at least 50 
contiguous nucleotides is included in the nucleotide sequence of an open reading frame sequence 
encoded by cDNA contained in ATCC Deposit No:Z. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide sequence 
30 which is at least 95% identical to sequence of at least 150 contiguous nucleotides in the nucleotide 
sequence encoded by cDNA contained in ATCC Deposit No:Z. 

A further preferred embodiment is an isolated nucleic acid molecule comprising a 
nucleotide sequence which is at least 95% identical to sequence of at least 500 contiguous 
nucleotides in the nucleotide sequence encoded by cDNA contained in ATCC Deposit No:Z 
35 A further preferred embodiment is an isolated nucleic acid molecule comprising a 

nucleotide sequence which is at least 95% identical to the complete nucleotide sequence encoded 
by cDNA contained in ATCC Deposit No:Z. 
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A further preferred embodiment is a method for detecting in a biological sample a nucleic 
acid molecule comprising a nucleotide sequence which is at least 95% identical to a sequence of at 
least 50 contiguous nucleotides in a sequence selected from the group consisting of: a nucleotide 
sequence of SEQ ID NO:X or the complementary strand thereto; the nucleotide sequence as 
5 defined in column 5 of Table 1B.1 or columns 8 and 9 of Table 2 or the complementary strand 
thereto; and a nucleotide sequence encoded by cDNA contained in ATCC Deposit No:Z; which 
method comprises a step of comparing a nucleotide sequence of at least one nucleic acid molecule 
in said sample with a sequence selected from said group and determining whether the sequence of 
said nucleic acid molecule in said sample is at least 95% identical to said selected sequence. 

10 Also preferred is the above method wherein said step of comparing sequences comprises 

determining the extent of nucleic acid hybridization between nucleic acid molecules in said sample 
and a nucleic acid molecule comprising said sequence selected from said group. Similarly, also 
preferred is the above method wherein said step of comparing sequences is performed by 
comparing the nucleotide sequence determined from a nucleic acid molecule in said sample with 

15 said sequence selected from said group. The nucleic acid molecules can comprise DNA molecules 
or RNA molecules. 

A further preferred embodiment is a method for identifying the species, tissue or cell type 
of a biological sample which method comprises a step of detecting nucleic acid molecules in said 
sample, if any, comprising a nucleotide sequence that is at least 95% identical to a sequence of at 

20 least 50 contiguous nucleotides in a sequence selected from the group consisting of: a nucleotide 
sequence of SEQ ID NO:X or the complementary strand thereto; the nucleotide sequence as 
defined in column 5 of Table 1B.1 or columns 8 and 9 of Table 2 or the complementary strand 
thereto; and a nucleotide sequence of the cDNA contained in ATCC Deposit No:Z. 

The method for identifying the species, tissue or cell type of a biological sample can 

25 comprise a step of detecting nucleic acid molecules comprising a nucleotide sequence in a panel of 
at least two nucleotide sequences, wherein at least one sequence in said panel is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from said 
group. 

Also preferred is a method for diagnosing in a subject a pathological condition associated 
30 with abnormal structure or expression of a nucleotide sequence of SEQ ID NO:X or the 
complementary strand thereto; the nucleotide sequence as defined in column 5 of Table 1B.1 or 
columns 8 and 9 of Table 2 or the complementary strand thereto; or the cDNA contained in ATCC 
Deposit No:Z which encodes a protein, wherein the method comprises a step of detecting in a 
biological sample obtained from said subject nucleic acid molecules, if any, comprising a 
35 nucleotide sequence that is at least 95% identical to a sequence of at least 50 contiguous 
nucleotides in a sequence selected from the group consisting of: a nucleotide sequence of SEQ ID 
NO:X or the complementary strand thereto; the nucleotide sequence as defined in column 5 of 
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Table 1B.1 or columns 8 and 9 of Table 2 or the complementary strand thereto; and a nucleotide 

sequence of cDNA contained in ATCC Deposit No:Z. 

The method for diagnosing a pathological condition can comprise a step of detecting 

nucleic acid molecules comprising a nucleotide sequence in a panel of at least two nucleotide 
5 sequences, wherein at least one sequence in said panel is at least 95% identical to a sequence of at 

least 50 contiguous nucleotides in a sequence selected from said group. 

Also preferred is a composition of matter comprising isolated nucleic acid molecules 

wherein the nucleotide sequences of said nucleic acid molecules comprise a panel of at least two 

nucleotide sequences, wherein at least one sequence in said panel is at least 95% identical to a 
10 sequence of at least 50 contiguous nucleotides in a sequence selected from the group consisting of: 

a nucleotide sequence of SEQ ID NO:X or the complementary strand thereto; the nucleotide 

sequence as defined in column 5 of Table 1B.1 or columns 8 and 9 of Table 2 or the 

complementary strand thereto; and a nucleotide sequence encoded by cDNA contained in ATCC 

Deposit No:Z. The nucleic acid molecules can comprise DNA molecules or RNA molecules. 
15 Also preferred is a composition of matter comprising isolated nucleic acid molecules 

wherein the nucleotide sequences of said nucleic acid molecules comprise a DNA microarray or 

"chip" of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 40, 50, 100, 150, 200, 250, 300, 500, 

1000, 2000, 3000, or 4000 nucleotide sequences, wherein at least one sequence in said DNA 

microarray or "chip" is at least 95% identical to a sequence of at least 50 contiguous nucleotides in 
20 a sequence selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein 

X is any integer as defined in Table 1A and/or IB; and a nucleotide sequence encoded by a human 

cDNA clone identified by a cDNA "Clone ID" in Table 1A and/or IB. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at least 90% 

identical to a sequence of at least about 10 contiguous amino acids in the polypeptide sequence of 
25 SEQ ID NO: Y; a polypeptide encoded by SEQ ID NO:X or the complementary strand thereto; the 

polypeptide encoded by the nucleotide sequence as defined in columns 8 and 9 of Table 2; and/or 

a polypeptide encoded by cDNA contained in ATCC Deposit No:Z. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at least 95% 

identical to a sequence of at least about 30 contiguous amino acids in the amino acid sequence of 
30 SEQ ID NO:Y; a polypeptide encoded by SEQ ID NO:X or the complementary strand thereto; the 

polypeptide encoded by the nucleotide sequence as defined in columns 8 and 9 of Table 2; and/or 

a polypeptide encoded by cDNA contained in ATCC Deposit No:Z. 

Further preferred is an isolated polypeptide comprising an amino acid sequence at least 

95% identical to a sequence of at least about 100 contiguous amino acids in the amino acid 
35 sequence of SEQ ID NO:Y; a polypeptide encoded by SEQ ID NO:X or the complementary strand 

thereto; the polypeptide encoded by the nucleotide sequence as defined in columns 8 and 9 of 

Table 2; and/or a polypeptide encoded by cDNA contained in ATCC Deposit No:Z. 
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Further preferred is an isolated polypeptide comprising an amino acid sequence at least 
95% identical to the complete amino acid sequence of SEQ ID NO:Y; a polypeptide encoded by 
SEQ ID NO:X or the complementary strand thereto; the polypeptide encoded by the nucleotide 
sequence as defined in columns 8 and 9 of Table 2; and/or a polypeptide encoded by cDNA 
5 contained in ATCC Deposit No:Z. 

Further preferred is an isolated polypeptide comprising an amino acid sequence at least 
90% identical to a sequence of at least about 10 contiguous amino acids in the complete amino 
acid sequence of a polypeptide encoded by contained in ATCC Deposit No:Z 

Also preferred is a polypeptide wherein said sequence of contiguous amino acids is 
10 included in the amino acid sequence of a portion of said polypeptide encoded by cDNA contained 
in ATCC Deposit No:Z; a polypeptide encoded by SEQ ID NO:X or the complementary strand 
thereto; the polypeptide encoded by the nucleotide sequence as defined in columns 8, and 9 of 
Table 2; and/or the polypeptide sequence of SEQ ID NO: Y. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at least 95% 
15 identical to a sequence of at least about 30 contiguous amino acids in the amino acid sequence of a 
polypeptide encoded by the cDNA contained in ATCC Deposit No:Z. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at least 95% 
identical to a sequence of at least about 100 contiguous amino acids in the amino acid sequence of 
a polypeptide encoded by cDNA contained in ATCC Deposit No:Z. 
20 Also preferred is an isolated polypeptide comprising an amino acid sequence at least 95% 

identical to the amino acid sequence of a polypeptide encoded by the cDNA contained in ATCC 
Deposit No:Z. 

Further preferred is an isolated antibody which binds specifically to a polypeptide 
comprising an amino acid sequence that is at least 90% identical to a sequence of at least 10 

25 contiguous amino acids in a sequence selected from the group consisting of: a polypeptide 
sequence of SEQ ID NO:Y; a polypeptide encoded by SEQ ID NO:X or the complementary strand 
thereto; the polypeptide encoded by the nucleotide sequence as defined in columns 8 and 9 of 
Table 2; and a polypeptide encoded by the cDNA contained in ATCC Deposit No:Z. 

Further preferred is a method for detecting in a biological sample a polypeptide 

30 comprising an amino acid sequence which is at least 90% identical to a sequence of at least 10 
contiguous amino acids in a sequence selected from the group consisting of: a polypeptide 
sequence of SEQ ID NO:Y; a polypeptide encoded by SEQ ID NO:X or the complementary strand 
thereto; the polypeptide encoded by the nucleotide sequence as defined in columns 8 and 9 of 
Table 2; and a polypeptide encoded by the cDNA contained in ATCC Deposit No:Z; which 

35 method comprises a step of comparing an amino acid sequence of at least one polypeptide 
molecule in said sample with a sequence selected from said group and determining whether the 
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sequence of said polypeptide molecule in said sample is at least 90% identical to said sequence of 
at least 10 contiguous amino acids. 

Also preferred is the above method wherein said step of comparing an amino acid 
sequence of at least one polypeptide molecule in said sample with a sequence selected from said 
5 group comprises determining the extent of specific binding of polypeptides in said sample to an 
antibody which binds specifically to a polypeptide comprising an amino acid sequence that is at 
least 90% identical to a sequence of at least 10 contiguous amino acids in a sequence selected from 
the group consisting of: a polypeptide sequence of SEQ ID NO: Y; a polypeptide encoded by SEQ 
ID NO:X or the complementary strand thereto; the polypeptide encoded by the nucleotide 

10 sequence as defined in columns 8 and 9 of Table 2; and a polypeptide encoded by the cDNA 
contained in ATCC Deposit No:Z. 

Also preferred is the above method wherein said step of comparing sequences is 
performed by comparing the amino acid sequence determined from a polypeptide molecule in said 
sample with said sequence selected from said group. 

15 Also preferred is a method for identifying the species, tissue or cell type of a biological 

sample which method comprises a step of detecting polypeptide molecules in said sample, if any, 
comprising an amino acid sequence that is at least 90% identical to a sequence of at least 10 
contiguous amino acids in a sequence selected from the group consisting of: polypeptide sequence 
of SEQ ID NO: Y; a polypeptide encoded by SEQ ID NO:X or the complementary strand thereto; 

20 the polypeptide encoded by the nucleotide sequence as defined in columns 8 and 9 of Table 2; and 
a polypeptide encoded by the cDNA contained in ATCC Deposit No:Z. 

Also preferred is the above method for identifying the species, tissue or cell type of a 
biological sample, which method comprises a step of detecting polypeptide molecules comprising 
an amino acid sequence in a panel of at least two amino acid sequences, wherein at least one 

25 sequence in said panel is at least 90% identical to a sequence of at least 10 contiguous amino acids 
in a sequence selected from the above group. 

Also preferred is a method for diagnosing in a subject a pathological condition associated 
with abnormal structure or expression of a nucleic acid sequence identified in Table 1A, IB or 
Table 2 encoding a polypeptide, which method comprises a step of detecting in a biological 

30 sample obtained from said subject polypeptide molecules comprising an amino acid sequence in a 
panel of at least two amino acid sequences, wherein at least one sequence in said panel is at least 
90% identical to a sequence of at least 10 contiguous amino acids in a sequence selected from the 
group consisting of: polypeptide sequence of SEQ ID NO:Y; a polypeptide encoded by SEQ ID 
NO:X or the complementary strand thereto; the polypeptide encoded by the nucleotide sequence as 

35 defined in columns 8 and 9 of Table 2; and a polypeptide encoded by the cDNA contained in 
ATCC Deposit No:Z. 
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In any of these methods, the step of detecting said polypeptide molecules includes using 
an antibody. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide sequence 
which is at least 95% identical to a nucleotide sequence encoding a polypeptide wherein said 
5 polypeptide comprises an amino acid sequence that is at least 90% identical to a sequence of at 
least 10 contiguous amino acids in a sequence selected from the group consisting of: polypeptide 
sequence of SEQ ID NO: Y; a polypeptide encoded by SEQ ID NO:X or the complementary strand 
thereto; the polypeptide encoded by the nucleotide sequence as defined in columns 8 and 9 of 
Table 2; and a polypeptide encoded by the cDNA contained in ATCC Deposit No:Z. 
10 • Also preferred is an isolated nucleic acid molecule, wherein said nucleotide sequence 

encoding a polypeptide has been optimized for expression of said polypeptide in a prokaryotic 
host. 

Also preferred is a polypeptide molecule, wherein said polypeptide comprises an amino 

acid sequence selected from the group consisting of: polypeptide sequence of SEQ ID NO:Y; a 
15 polypeptide encoded by SEQ ID NO:X or the complementary strand thereto; the polypeptide 

encoded by the nucleotide sequence as defined in columns 8 and 9 of Table 2; and a polypeptide 

encoded by the cDNA contained in ATCC Deposit No:Z. 

Further preferred is a method of making a recombinant vector comprising inserting any of 

the above isolated nucleic acid molecule into a vector. Also preferred is the recombinant vector 
20 produced by this method. Also preferred is a method of making a recombinant host cell 

comprising introducing the vector into a host cell, as well as the recombinant host cell produced by 

this method. 

Also preferred is a method of making an isolated polypeptide comprising culturing this 
recombinant host cell under conditions such that said polypeptide is expressed and recovering said 

25 polypeptide. Also preferred is this method of making an isolated polypeptide, wherein said 
recombinant host cell is a eukaryotic cell and said polypeptide is a human protein comprising an 
amino acid sequence selected from the group consisting of: polypeptide sequence of SEQ ID 
NO:Y; a polypeptide encoded by SEQ ID NO:X or the complementary strand thereto; the 
polypeptide encoded by the nucleotide sequence as defined in columns 8 and 9 of Table 2; and a 

30 polypeptide encoded by the cDNA contained in ATCC Deposit No:Z. The isolated polypeptide 
produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an increased level of a 
protein activity, which method comprises administering to such an individual a Therapeutic 
comprising an amount of an isolated polypeptide, polynucleotide, immunogenic fragment or 

35 analogue thereof, binding agent, antibody, or antigen binding fragment of the claimed invention 
effective to increase the level of said protein activity in said individual. 



639 



WO 02/076488 PCT/US02/08276 

Also preferred is a method of treatment of an individual in need of a decreased level of a 
protein activity, which method comprised administering to such an individual a Therapeutic 
comprising an amount of an isolated polypeptide, polynucleotide, immunogenic fragment or 
analogue thereof, binding agent, antibody, or antigen binding fragment of the claimed invention 
5 effective to'decrease the level of said protein activity in said individual. 

Also preferred is a method of treatment of an individual in need of a specific delivery of 
toxic compositions to diseased cells (e.g., tumors, leukemias or lymphomas), which method 
comprises administering to such an individual a Therapeutic comprising an amount of an isolated 
polypeptide of the invention, including, but not limited to a binding agent, or antibody of the 
10 claimed invention that are associated with toxin or cytotoxic prodrugs. 

Having generally described the invention, the same will be more readily understood by 
reference to the following examples, which are provided by way of illustration and are not 
intended as limiting. 

15 Description of Table 6 

Table 6 summarizes some of the ATCC Deposits, Deposit dates, and ATCC designation 
numbers of deposits made with the ATCC in connection with the present application. These 
deposits were made in addition to those described in the Table 1 A. 

20 Table 6 



ATCC Deposits 


Deposit Date 


ATCC Designation Number 


LP01, LP02, LP03, LP04, 
LP05, LP06, LP07, LP08, 
LP09,LP10,LP11, 


May-20-97 


209059, 209060, 209061, 209062, 209063, 
209064, 209065, 209066, 209067, 209068, 
209069 


LP 12 


Jan-12-98 


209579 


LP13 


Jan-12-98 


209578 


LP14 


Jul-16-98 


203067 


LP15 


Jul-16-98 


203068 


LP16 


Feb-1-99 


203609 


LP17 


Feb-1-99 


203610 


LP20 


Nov-17-98 


203485 


LP21 


Jun-18-99 


PTA-252 


LP22 


Jun-18-99 


PTA-253 


LP23 


Dec-22-99 


PTA-1081 
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Examples 

Example 1: Isolation of a Selected cDNA Clone From the Deposited Sample 

5 Each ATCC Deposit No:Z is contained in a plasmid vector. Table 7 identifies the vectors 

used to construct the cDNA library from which each clone was isolated. In many cases, the vector 
used to construct the library is a phage vector from which a plasmid has been excised. The 
following correlates the related plasmid for each phage vector used in constructing the cDNA 
library. For example, where a particular clone is identified in Table 7 as being isolated in the 
10 vector "Lambda Zap," the corresponding deposited clone is in "pBluescript" 

Vector Used to Construct Library Corresponding Deposited Plasmid 
Lambda Zap pBluescript (pBS) 

Uni-Zap XR pB luescript (pBS) 

Zap Express pBK 
1 5 lafmid B A plafmid B A 

pSportl pSportl 
pCMVSport 2.0 pCMVSport 2.0 

pCMVSport 3.0 pCMVSport 3.0 

pCR®2.1 pCR®2.1 
20 Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uhi-Zap XR (U.S. 

Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 5,128,256 and 5,286,636), 
pBluescript (pBS) (Short, J. M. et al., Nucleic Acids Res. 16:7583-7600 (1988); Alting-Mees, M. 
A. and Short, J. M., Nucleic Acids Res. 17:9494 (1989)) and pBK (Alting-Mees, M. A. et al., 
Strategies 5:58-61 (1992)) are commercially available from Stratagene Cloning Systems, Inc., 
25 1 101 1 N. Torrey Pines Road, La Jolla, CA, 92037. pBS contains an ampicillin resistance gene and 
pBK contains a neomycin resistance gene. Both can be transformed into E. coli strain XL-1 Blue, 
also available from Stratagene. pBS comes in 4 forms SK+, SK-, KS+ and KS. The S and K 
refers to the orientation of the polylinker to the T7 and T3 primer sequences which flank the 
polylinker region ("S" is for Sad and "K" is for Kpnl which are the first sites on each respective 
30 end of the linker). "+ M or "-" refer to the orientation of the fl origin of replication ("ori"), such that 
in one orientation, single stranded rescue initiated from the f 1 ori generates sense strand DNA and 
in the other, antisense. 

Vectors pSportl, pCMVSport 2.0 and pCMVSport 3.0, were obtained from Life 
Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors contain an 
35 ampicillin resistance gene and may be transformed into R coli strain DH10B, also available from 
Life Technologies. (See, few instance, Gruber, C. E., et al., Focus 15:59 (1993)). Vector lafmid 
BA (Bento Soares, Columbia University, NY) contains an ampicillin resistance gene and can be 
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transformed into E. coli strain XL-1 Blue. Vector pCR 2.1, which is available from Invitrogen, 
1600 Faraday Avenue, Carlsbad, CA 92008, contains an ampicillin resistance gene and may be 
transformed into E. coli strain DH10B, available from Life Technologies. (See, for instance, 
Clark, J. M., Nuc. Acids Res. 16:9677-9686 (1988) and Mead, D. et al., Bio/Technology 9: 
(1991)). Preferably, a polynucleotide of the present invention does not comprise the phage vector 
sequences identified for the particular clone in Table 7, as well as the corresponding plasmid 
vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number cited by 
reference to Table 1 A, Table 2, Table 6 and Table 7 for any given cDNA clone also may contain 
one or more additional plasmids, each comprising a cDNA clone different from that given clone. 
Thus, deposits sharing the same ATCC Deposit Number contain at least a plasmid for each ATCC 
Deposit No:Z. 

TABLE 7 



Libraries owned by Catalog 


Catalog Description 


Vector 


ATCC 
Deposit 


HUKAHUKB HUKCHUKD 


Human Utenne Cancer 


Lambda ZAP II 


LP01 


TTT TT^T"? TTT Tl/T? TTT n/ri 

HUKE HUKF I1UKG 








HCNA HCNB 


Human Colon 


Lambda Zap II 


LP01 


HFFA 


Human Fetal Brain, random 
pnmed 


Lambda Zap II 


LP01 


HTWA 


Resting T-Cell 


Lambda ZAP H 


LP01 


HBQA 


Early Stage Human Brain, 
random primed 


Lambda ZAP II 


LP01 


HLMB HLMF IILMG HLMH 


breast lymph node CDNA 


Lambda ZAP H 


LP01 


HLMI HLMJ HLMM HLMN 


library 






HCQA HCQB 


human colon cancer 


LamdaZAPII 


LP01 


HMEA HMEC HMED HMEE 


Human Microvascular 


Lambda ZAP II 


LP01 


HMEF HMEG HMEI HMEJ 


Endothelial Cells, tract A 






HMEKHMEL 








HUSA HUSC 


Human Umbilical Vein 
Endothelial Cells, tract A 


Lambda ZAP 0 


LP01 


HLQA HLQB 


Hepatocellular Tumor 


Lambda ZAP II 


LP01 


HHGA HHGB HHGC HHGD 


Hemangiopericytoma 


Lambda ZAP H 


LP01 


HSDM 


Human Striatum Depression, re- 
rescue 


Lambda ZAP H 


LP01 


HUSH 


H Umbilical Vein Endothelial 
Cells, frac A, re-excision 


TambdaZAPII 


LP01 




HSGS 


Salivary gland, subtracted 


Lambda ZAP II 


LP01 


HFXA HFXB HFXC HFXD 
HFXE HFXF HFXG HFXH 


Brain frontal cortex 


TambdaZAPn 


LP01 




HPQA HPQB HPQC 


PERMTF274 


Lambda ZAP n 


LP01 


HFXJHFXK 


Brain Frontal Cortex, re-excision 


Lambda ZAP H 


LP01 


HCWA HCWB HCWC HCWD 


CD34 positive cells (Cord 


ZAP Express 


LP02 


HCWE HCWF HCWG HCWH 


Blood) 






HCWI HCWJ HCWK 








HCUA HCUB HCUC 


CD34 depleted Buffy Coat 


ZAP Express 


LP02 
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Libraries owned bv Catalog 


Catalog Descriotion 


Vector 


Denosit 




(Cord Blood) 






HRSM 


A- 14 cell line 


ZAP ExTvre^<i 

* M AA AmVI/AvOu 


T PO?. 


HRSA 


A 1 -CELL LINE 


ZAP Exrjres*; 


T PO? 


HCUD HCUE HCUF HCUG 
HCUHHCUI 


CD34 depleted Buffy Coat 
fCord Blood j. re-excision 


ZAP Express 


LP02 


HBXE HBXF HBXG 


H. Whole Brain #2, re-excision 


ZAP Exnress 


LP02 


HRLM 


L8 cell line 


ZAP ExDress 




HBXA HBXB HBXC HBXD 


Human Whole Brain #2 - Oligo 
dT > 1 5Kb 


ZAP Express 


LP02 


HUD A HUDB HUDC 


Testes 


ZAP Express 


LP02 


UUTU tTHTNT HWTn 


xi- nypoujaiainus, rrac A>re- 
excision 


nxpress 


LrUZ i 




xi. nypoinaianius, irac /v 


zj\r nxpress 




HASA HASD 


Human Adult Spleen 


Uni-ZAP XR 


LP03 


HFKC HFKD HFKE HFKF 
HrJsXj 


Human Fetal Kidney 


Uni-ZAP XR 


LP03 


HE8A HE8B HE8C HE8D 

TJTCOT7 TTCOt? TTT70X X TTROXT 

HE8E HE8F HE8M HE8N 


Human 8 Week Whole Embryo 


Uni-ZAP XR 


LP03 


HlxBA HCjxjD HCjBE HCjJBF 

up.Rn TTHRW unRT 
UXJDKJ XlOJOXl XlUDl 


Human Gall Bladder 


Uni-ZAP XR 


LP03 


HLHA HLHB HLHC HLHD 

HLHE HLHF HLHG HLHH 
HT met 


Human Fetal Lung IE 


Uni-ZAP XR 


LP03 


HPMA HPMB HPMC HPMD 

WPMT7 T-TPAyTP TTPX/Tf^ UP\4TT 


Human Placenta 


Uni-ZAP XR 


LP03 


HPRA HPRB HPRC HPRD 


Human Prostate 


Uni-ZAP XR 


LP03 


ITOTA Uptp ITOTpv TJCTC 

HolA HolC Ho ID Holb 


Human Adult Small Intestine 


T T * A TT X/Tl 

Uni-ZAP XR. 


LP03 


HlhA HIEB HI EC Hi ED 
HTEE HTEF HTEG HTEH 
HTE1 HTEJ HTEK 


TTT r i v a 

Human Testes 


Uni-ZAP XR 


LP03 


H1PA HTPB HTPC HTPD 
HTPE 


TT T% m 

Human Pancreas Tumor 


Um-ZAPXR 


LP03 


HTTA H'l l'b JHTI C W1TJJ 
HTTE H1TJK 


TT r-r-» . rT' 

Human Testes Tumor 


Um-ZAPXR 


LP03 


TTi HA TTA TVPfc T T A TW1 TT A TV* M 

HAP A HAPB HAPC HAPM 


Human Adult Pulmonary 


Um-ZAPXR 


LP03 


HETA HETB HEl'C HbTD 
HETE HETF HETG HHl'H 

TTTJTT 

HETI 


Human Endometrial Tumor 


Uni-ZAP XR 


LP03 


HHFB HHFCHHFD HHFE 

T'l 1 1 1 'I > T IT 1 I" V~* T TT II 'IT TTT 1 ■ »1 

HHFF HHFG HHFH HHFI 


Human Fetal Heart 


Uni-ZAP XR 


LP03 


TTTTTJT) TTTTTV/^ TTTTDTN TTT TTYT? 

HHP B HHPC HHPD HHPE 
HHPFHHPGHHPH 


T T TT? . 

Human Hippocampus 


T T * T A 1% VT> 

Um-ZAPXR 


LP03 


upFl xjppo XJrTR^ U/TA 
xiv-JCri rxK^EZ, nLDj xiv-Jjrf 

HCE5 HCEB HCEC HCED 
HCEE HCEF HCEG 


xiuman L^ereoenum 


T T • 7AD YP 
UHl-ZAr AJ\. 




HUVB HUVC HUVD HUVE 


Human Umbilical Vein, Endo. 
remake 


Uni-ZAP XR 


LP03 


HSTA HSTB HSTC HSTD 


Human Skin Tumor 


Uni-ZAP XR 


LP03 


HTAA HTAB HTAC HTAD 
HTAE 


Human Activated T-Cells 


Uni-ZAP XR 


LP03 
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libraries owned by Catalog 


Catalog Description 


Vector 


ATCC 


HWKA HFfiB HHHC I 

X XX JJfl 111 J >J * JL XX JL^V^ 


Human Fetal Epithelium fSkin^ 


Uni-ZAP XR 


LP03 


HJPA HJPB HJPC HJPD 


HUMAN JURK AT 
MFMBR ANE BOUND 

yXX-j-iyXXJ X\JrxX 'IJL-/ XJ \J Ui 1 Xs 

POLYSOMES 


Uni-ZAP XR 


LP03 


HESA 


Human epithelioid sarcoma 


Uni-Zap XR 


LP03 


HLTA HLTB HLTC HLTD 

XXX~tX jTV IJLLj A JL* A ,1 i < X Vx XXX-i X XJ 

HLTEHLTF 


TTiimflTi T-C!p11 I -vmnhoma 

AAUUlAli 1 V_^/1J J XliXJ KXyJXlXiX 


Uni-ZAP XR 

\J 111 ZjTU <t VXV 


LP03 

1— »1 


All 1 A 111 A XJ AAA A AAA X xJ 


Wnman Fptal Dnra A/Tatpr 

AAlAlAuUl A \slt\X XJ Hid lyAdLt^l 


Uni-ZAP XR 

\J All £uTxA YV1 v 


LP03 

AwA V*/ 


HRDA HRDB FTRDO HRDD 

AAAVA-/JT. 11TVL/A-/ AAAVL-'Vx AAXVLAL/ 

HRDEHRDF 


Human RT^lvlnTTivncarrrvrna 

AALtillu.il AvllClLAlL/llljf LJCxll \^\J11L(X 


Uni-ZAP XR 

yj in z^ru /vl\ 


LP03 

l—rl \J -J 


HCAA HCAB HCAC 


Cem cells cyclohexamide treated 


Uni-ZAP XR 


LP03 


tti» n A TJTD fiD XXD HP TXD 

ilKurA ItKIjJd JtlKLiv^ rlKijrU 


Kaji ceus, cycionexamiGe 
treated 


T Tn i 7AP YD 




XJCTTA XJCTTD XJCTT/^' UCT TAA 

JtioUA jloUrv Jrioul^ rloUiVl 


oirpt ceils, cycionexamiae 

trt>c*tf>r\ 
UCaLCU 


tt,,: 7ADYP 


T XXYX 


HT4A HT4C HT4D 

111 "A 111 tV^ All tl/ 


jtxl» 11 v aiLU x v-oiio, i £t ill o. 


Uni-ZAP XR 

Will t-ixXX. jTVAV 


LP03 


HE9A HE9B HE9C HE9D 

HFQF TTFQF HFOH HFQH 
xxx-*yx-> xxxZiyx x \i jZ/kj a JUJ..7A a 

HE9M HE9N 

JL ULlylT JL JL JJ-i ^ JL * 


Nine Week Old Early Stage 

AALlillaLI 


Uni-ZAP XR 

• 


LP03 


HATA HATB HATC HATD 

A. A_4 X JL A JL 1 Ai JL JL JL^ JL JU. A JL JL A.J. JL A—*' 

HATE 


Human Adrenal Gland Tumor 


Uni-ZAP XR 


LP03 


HT5A 


Activated T-Cells, 24 hrs. 


Uni-ZAP XR 


LP03 


HFGA HFGM 


Human Fetal Brain 


Uni-ZAP XR 


LP03 


WNTFA TTNTP'R HNFP HNPH 

fllNAJ//\ IAINAjO AA±N.CfV^ AAiNlJrJLJ' 

HNEE 


jnUillaU INCULIUpAAli , 


vj ill / -»rvr yvix. 


T PO^ 

Li VJ 


AAA>VJJD XlXj \JxJ 


HiimQTi Pmrmri; Drpact i QTif*pr 
XTUlIIali rrilllaiy DlCdoL v>illlL/CI 


TTni-7AP XR 


T PO^ 


HBNAHBNB 


Human Normal Breast 


Uni-ZAP XR 


LP03 


OP A O 


Cem Cells, cyclohexamide 
treated, subtra 


T T_ : 7AD VD 

Um-ZAr AK 


LrUJ 


Y TT TOO 

HHro 


Human Hippocampus, 
subtracted 


pb6 


T DAQ 

LrUJ 


HKCS HKCU 


Human Colon Cancer, 
subtracted 


pBc> 


LF03 


HRGS 


Raji cells, cycionexamiae 
ueatea, suDrractea 


pBS 


LrPUJ 


HSUT 


Supt cells, cyclohexamide 
ireaiea, airrerenoaiiy expressea 


pBS 


LP03 


HT4S 


Activated T-Cells, 12 hrs, 

r* 11 Kirn r+t r± r\ 

suDrraciea 


Uni-ZAP XR 


LP03 


hpda wrTYR umr Hrrvn 

xl\^xJr\ Xx\^xJxj flLL/L. lIS^UU 

HCDE 


nUIIlaJl v^ilUilLU UoaiL'UJlia 


TTni-7AP ICR 
LJlll / *r\x AI\ 


T P03 

I -«I L/ J 


TJfi A A WO A R UHAP 
JtlVJ/vrt. rx\JJ\I> xi\J/\k_s 


AiUiiuin vysieusdrcunid 


T Tni-7 AP XR 


T P03 


HTLA HTLB HTLC HTLD 

A. -A. -A. JL-rf A JL X JL JL 1 JM F JL JL JL 1 J V JL JL JL l.JIV 

HTLEHTLF 


Hurnan adult testis, laree inserts 


Uni-ZAP XR 


LP03 


HLMA HLMC HLMD 


Breast Lymph node cDNA 
library 


Uni-ZAP XR 


LP03 


H6E A H6EB H6EC 


HL-60, PMA 4H 


Uni-ZAP XR 


LP03 


HTXA HTXB HTXC HTXD 
HTXE HTXF HTXG HTXH 


Activated T-Cell 
(12hs)/Thiouridine labelledEco 


Uni-ZAP XR 


LP03 


HNFAHNFB HNFC HNFD 


Human Neutrophil, Activated 


Uni-ZAP XR 


LP03 
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Libranes owned by Catalog 


Catalog Description 


Vector 


ATCC 
Deposit 


HNFE HNEF HNFG HNFH 
HNFJ 








HTOB HTOC 


HUMAN TONSILS, 
i^RACTlUN Z 


Uni-ZAP XR 


T TWO 

LP03 


iUVLux> 


luman UJt> jviooj coniroi 
Taction I 


TTni 7AP YD 
UIU-ZAr AJv 




HOPB 


Human OB HOS control fraction 

T 
1 


Uni-ZAP XR 


LP03 


nUKD 


U nmon f\Ti Tlf\Q trp»a+£»rl M ft n"N/f 

riunian \JD nuo ueaieu mvi 

•fT'Qf'tlAfl T 


TTni TAP YT? 


t pn^ 


HSVA HSVB HSVC 


Human Chronic Synovitis 


Uni-ZAP XR 


LP03 


TTT) f% A 

riKUA 


ttt txa a M QTOA/T A C*VX 
rlUMAIN o 1 L/JVlAv^rl 


TTr^i 7AP YT? 


T Pft^ 


HBJA HBJB HBJC HBJD HBJE 
HBJF HBJG HBJH HBJI HBJJ 

ITnTV 

HBJK 


HUMAN B CELL 
LYMPHOMA 


Uni-ZAP XR 


LP03 


TT/~nT) A TT/~ , r>'D OPT) 

riCKA tLCK±> HCKU 


numan corpus coiosum 






HUD A HvJUB HUDC HUDD 


numan ovanan cancer 


tt-; 7 An "YT? 
Uni-ZAJr AK 


T PHI 


rJLDoA 


JL/ennatoii Drosarcoma 

T lULUlJClaJJL-C 


tt„- 7AP YP 


T T>(\1 
JUrUJ 


UMWA HMWR HMWf 

xXlYI VV /Tk. XXLV1 VY -L> I JU.VJL YV 

HMWD HMWEHMWF 
HMWG HMWH HMWI HMWJ 


(RS4:11) 


Uni-ZAP XR 


LP03 


HSOA 


stomach cancer (human) 


Uni-ZAP XR 


LP03 


HERA 


SKIN 


Uni-ZAP XR 


LP03 


HMDA 


Brain-medul loblas toma 


Uni-ZAP XR 


LP03 


xlOJ_./\ rIKjLiD rlXJLjU 


OJ1UDJ aMUIIia 


TTni-7AP YR 


t pen 


HEAA 


H. Atrophic Endometrium 


Uni-ZAP XR 


LP03 


HBCA HBCB 


H. Lymph node breast Cancer 


Uni-ZAr AK 


LrUi 


HPWT 


Human Prostate BPH, re- 

* * 

excision 


T T_- 7 AT) VD 

Um-ZAP XR 


LP03 


HFVGHFVHHFVI 


Fetal Liver, subtraction 11 


pBS 


LP03 


HNFI 


Human Neutrophils, Activated, 

* * 

re-excision 


pBS 


T DAI 

JLPU3 


JrXDjViD JTJDiVil^ XlJtJIVJJJ 


xiumau Done ividirow, re- 

V/AV i O lull 


pJ5C> 


T PO^ 


HKMLHKMMHKMN 

X 1 tVl T.I.I j X 1 IX IT-i-tTA X JUViVJLi T 


H Kidnev Medulla re-excision 


t)BS 


LP03 


HKKHKIY 


H. Kidney Cortex, subtracted 


pBS 


LP03 


HADT 


H. Amygdala Depression, 

i>UUUd.CLCU 


pBS 


LP03 


H6AS 


Hl-60, untreated, subtracted 


Uni-ZAP XR 


LP03 


H6ES 


HL-60, PMA 4H, subtracted 


Uni-ZAP XR 


LP03 


H6BS 


HL-60, RA 4n, Subtracted 


Uni-ZAP XK 




JnLCwJo 


XXL.-DU, JrlVlA 1Q, SUiXIaCiea 


TTni 7AP YR 




HTXJHTXK 


Activated T- 

cell( 12h)/lTuouridme-re- 

excision 


Uni-ZAP XR 


LP03 


HMS A HMSB HMSC HMSD 
HMSE HMSF HMSG HMSH 
HMSI HMS J HMSK 


Monocyte activated 


Uni-ZAP XR 


LP03 


HAGA HAGB HAGC HAGD 


Human Amygdala 


Uni-ZAP XR 


LP03 
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Catalog Description 


Vector 


ATCC 

iJprvooif" 
XVOpUML 


HAGE HAGF 








noivrv xioivD rioJtvJ-» 


QTPfYN/f AT 

OSTEOCLASTOMA 




T VCYX 


TT<lPTi TTCPT7 UQPn UCDU 
rcoivL/ ixoivr jiorvo lxoivxi 


xxULIuua WolCCX-laSlOlIla OUUlilal 

Cells - unamplified 


TT n ; 7 A P YD 


T PfY* 


ucnA H^OTl UQDP "H^nn 
noyA nov^JD xiov^v^ ±io\^l/ 

HSOE HSOF HSOG 


OUOIIaoX ecu ir^/t 




T PO^ 


HSKA HSKB HSKC HSKD 
HSKE HSKF HSKZ 


Smooth muscle, serum treated 


Uni-ZAPXR 


LP03 


HSLA HSLB HSLC HSLD 

XXOXjXj A AOXjA XaOXjVJ 


Smooth muscle,control 


Uni-ZAPXR 


LP03 


HSDA HSDD HSDF HSDF 

llOJL/n A 101-/iv IlOlVij XXOX-/L 

HSDG HSDH 


SniiiAl corH 

\JIJXXICIX VV/LlA 


TJni-7AP XR 

w All Z jlxJi 1 VAV 




XXI VT O 


X lUoLdlG XJX x 1. oUULiutlvu XL 


px>o 




HSKW HSKX HSKY 


Smooth Muscle- HASTE 

■n /"\i*mn 1 1 ?&f\ 
liUI iktillkijCAA 


pBS 


LP03 


HFPB HFPC HFPD 


H. Frontal cortex,epileptic;re- 


Uni-ZAPXR 


LP03 


HSDI HSDJ HSDK 

J. J. v~JJ — ' J. 11UA/J AAiJA-'Ak. 


Sninal ford rp-P"XfMQioTi 


I In i -ZAP XR 

WAIA # iill /Vl\ 


LP03 


HSKN HSKO 

A Av_> AVA 1 JL J1\J Ivw 


Smooth Mii^cle Sprum Treated 

Norm 


dBS 

poo 


LP03 

J— 'A KJ—f 


HSKG RSKH HSKI 

JL1U1V1J. A AiJAVA 


Smooth muscle spnim 

induced,re-exc 


nBS 


LP03 


HFCA HFCB HFCC HFCD 

A XX ' — . A_< A AA V-^JL 


Human Fetal Brain 


Uni-ZAP XR 


LP04 


HPTAHPTB HPTD 


Human Pituitary 


Uni-ZAP XR 


LP04 


TTTTTO 1^X1'^ ITTTTTi 
ixixxD Ixl xx^ LxxxxlJ 


xiuman inyrnus 


Uni-ZiAT Ax\. 


T PfU 


ttt:<x> TXTJ^r^ TXPAFk TTP^Th T4Th^T7 
xxiiOx> xlxiDv^ I1JC.01J JtlcOxS IxxlOr' 

HE6G HE6S 


xiuman wnoie oix weeK \jiq. 
Embryo 


Uni-ZAr AK 


i-xU4 


tree a xjccr> uccp urccr* 
xlooA xloorf xxoov^ riooi 1 

HSSE HSSF HSSG HSSH HSSI 

xxooJ xxooxv 


xiuman synovial aarcoma 


T T«i 7AD YD 




HE7T 


7 Week Old Early Stage Human, 

bUULTaCLCU 


Uni-ZAP XR 


LP04 


HEPA HEPB HEPC 


Human Epididymus 


Uni-ZAPXR 


LP04 


xloiNA xloINiJ xloINC xxoiNM 
USNN 

XXOl^li. ^1 


riuman oynovium 


T T_: TAD VD 


JLx\J4 


HPFB HPFC HPFD HPFK 

XXI X XV XXL X XXL X IS XXL X X—t 


TTnman T^rn^tatp (^anrj*r SJtapf* C* 

JLaUXIuIXI X lVJOUUt V_^U.HVA_/Jl 9 OLagV V- ' 

fraction 


Uni-ZAP XR 


LP04 


HE2A HE2D HK2F HE2H HR7T 
HE2M HE2N HE20 


1 2 Week Old Earlv Stape 
Human 


Uni-ZAP XR 


LP04 


HE2B HE2C HE2F HE2G HE2P 
HE2Q 


12 Week Old Early Stage 
Human, JJ 


Uni-ZAP XR 


LP04 


HPTSHPTTHPTU 


Human Pituitary, subtracted 


Uni-ZAP XR 


LP04 


HAUA HAUB HAUC 


Amniotic Cells - TNF induced 


Uni-ZAP XR 


LP04 


HAQA HAQB HAQC HAQD 


Amniotic Cells - Primary 
Culture 


Uni-ZAPXR 


LP04 


HWTA HWTB HWTC 


wilm's tumor 


Uni-ZAP XR 


LP04 


HBSD. 


Bone Cancer, re-excision 


Uni-ZAPXR 


LP04 


HSGB 


Salivary gland, re-excision 


Uni-ZAPXR 


LP04 



646 



WO 02/076488 PCT/US02/08276 



Libraries owned by Catalog 


Catalog Description 


Vector 


ATCC 
Deoosit 


HSJA HSJB HSJC 


Smooth muscle-ILb induced 


Uni-ZAP XR 


LP04 


HSXA HSXB HSXC HSXD 


Human ^siiHcfantiJi NiPTJl 


Uni-ZAP XR 




HSHA HSHB HSHC 


Smooth muscle, ILlb induced 


Uni-ZAP XR 


LP04 


jIUUA riuuJD riL/ULx rlL/UJL/ 
HOT IF 


Adipocytes 


llni 7AP YP 


r t>ri/t 
JUrU4 


HPWA HPWB HPWC HPWD 
HPWF 

XXL V»X-r 


Prostate BPH 


Uni-ZAP XR 


LP04 


HPT A HELB HELC HELD 
HF,T ,F. HRT ,F HRT jG HRT ,H 


Endothelial cells-control 


Uni-ZAP XR 


LP04 


JtlDlVIA IxxiiVJxj IxxiJViv^ xxC/JVlJL' 

HEME HEMF HEMG HEMH 


xinQOineiiai-inaucea 

* 


TTni 7AP YT> 


LrU4 


HBIA HBIB HBIC 


Human Brain, Striatum 


Uni-ZAP XR 


LP04 


TTTTO A T TTTOTi TTTTPD TTTTOTN 

HHS A HHSB HHSC HHSD 
truce 


Human 

nypo tnaimus, ocnizopnrenia 


Uni-ZAP XR 


LP04 


TTIMTiA TTVnP TTMnr* T-TMf^Fl 
JnuNvjA rilNLrxJ xixNvj^ JnLlNVjLJ 

HNGE HNGF HNGG HNGH 

XXL 1 Ul JLJUL 1 KJJ 


neuuopniis coniroi 


TTni 7APYP 


T PO/1 
JLxU^f 


HNHAHNHB HNHCHNHD 

HNHR HNTTF TTNTTfi 1TNTTT T 

XX1NXXL-/ XXliXXJL XXI iXAvJ XXI N 111 1 

HNfflHNHJ 


Neutrophils IL-1 and LPS 

iilUUCCU 


Uni-ZAP XR 


LP04 




O X XVLfA. 1 \J 1V1 IVXjX XVX-vO o JLW1> 


TTni-7AP XR 


T PfM 


HHPT 


Hypothalamus 


Uni-ZAP XR 


LP04 


XJC A T UC ATT T_TC A \T UO A X\T 

xxoAl rioAU noAV xioAW 
AV TJC A Y W5A7 


Anergic 1-celi 


Uni-ZAr AK 


LrU4 


HBMS HBMT HBMU HBMV 

XXDlVl W JXDiVxA. 


Bone marrow 


Uni-ZAP XR 


LP04 


HOEA HOEB HOEC HOED 

xikJLZlj JoXJLjr XivJxU 


Osteoblasts 


Uni-ZAP XR 


LP04 


WATA TTATR TTATP HATTi MATT? 

XXrvXrV XX/tJLX> XXrYll^ XX/tJU./ XXrvXC 

HAIF 


FnithAliol-TTsTP'i nnH TTSTP 
X^piUlCllaJ. 1 liXtl dilu XLNX 

lTldl1Cf*H 


TTni 7AP 


T POA 


HTGA HTGB HTGC HTGD 

AAA Vi A 4 Jl A V-J J— AAA V-F AAA \JA^ 


ADODtotic T-cell 


Uni-ZAP XR 

W AAA M -** AA X VAX 


LP04 

Jul vy r 


HMCA HMCB HMCC HMCD 
HMCE 


Macrophage-oxLDL 


Uni-ZAP XR 


LP04 


HMAA HMAB I1MAC HMAD 

AJUTXi Ai X AAl'Ai AA/ AAif Ai X AAl'Ai LA/ 

HMAEHMAF HMAG 


Macronhaee (GM-CSF treated"} 


Uni-ZAP XR 

V> AAA A-41 AA « AAV 


LP04 

xja vy r 


HPHA 

A A-A A AX A 


Normal Prostate 

■A. 1 \Jx 1 1 Iff 1 1 A X V/Oiulv 


Uni-ZAP XR 

\-V AJIA #i 41 AA AAV 


LP04 

A-iA vy ^ 


HPIA HPBB HPIC 


LNCAP prostate cell line 


Uni-ZAP XR 


LP04 


xipTA TTPTR IIP TP 
n±Jr\. xxx J x> xxJr Jv^ 


rv^j x rusiaic ecu ixiic 




T POd 


TTO^F HOSF TTO^n 
nuoij xivor iivjovj 


XXUlXlaXl wolCL/wlaolUllla, 1 C 

excision 


TTni-7AP XR 

LVJXI Z_«r\X ^VXV 


T P04. 


xx i vjxj. xi i ur 


XxpUpiUUC 1 -CCL1, IC-CAvlolUIl 


UIXl— CjJrix j<\J\. 


T P04. 

J-<X Vrr 


XXLYXrVJ XXlVXrVJV 


XX lYladUpiiagC ^^jlVl-^Or 

treated), re-excision 


TTni-7AP VP 


T P04 


HACB HACC HACD 


Human Adipose Tissue, re- 
excision 


Uni-ZAP XR 


LP04 


HFPA 


H. Frontal Cortex, Epileptic 


Uni-ZAP XR 


LP04 


HFAA HFAB HFAC HFAD 
HFAE 


Alzheimer's, spongy change 


Uni-ZAP XR 


LP04 


HFAM 


Frontal Lobe, Dementia 


Uni-ZAP XR 


LP04 


HMIA HMIB HMIC 


Human Manic Depression 


Uni-ZAP XR 


LP04 
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Liurarics ownea oy uatalog 


Catalog uescnpuon 


vector 


A1CC 
ueposit 




Ticciif* 






HTSA HTSE HTSF HTSG ' 
HTSH 


T-Tiim^n TTyvttiii<! 

XxlXXXlALl X llj lUUO 


pr>^ 




HPBA HPBB HPBC HPBD 
HPBE 


Human Pineal Gland 


pBS 


LP05 


HSAA HSAB HSAC 


HSA 172 Cells 


pBS 


LP05 


HSBA HSBB HSBC HSBM 


HSC172 cells 


pBS 


LP05 


HJAA HJAB HJAC HJ AD 


Jurkat T-cell Gl phase 


pBS 

• 


LP05 


HJB A HJBB HJBC HJBD 


Jurkat T-Cell, S phase 


pBS 


LP05 


HAFAHAFB 


Aorta endothelial cells + TNF-a 


pBS 


LP05 


HAW A HAWB HAWC 


Human White Adipose 


pBS 


LP05 


HTNAHTNB 


Human Thyroid 


pBS 


LP05 


HONA 


Normal Ovary, Premenopausal 


pBS 


LP05 


HARAHARB 


Human Adult Retina 


pBS 


LP05 


HLJAHLJB 


Human Lung 


pCMVSport 1 


LP06 


TTOFM T-TOPTM TTOFO 
xjxjvyyfx LiXJxvi xixjryj 


Ovarian Tnmnr TT fiV < \9'*7 


r\P\A\7 Qnnrt 0 O 
pwivi V opt>ii £,,\J 


T PH7 

JLJrU / 


HOGA HOGB HOGC 


OV 10-3-95 


pCMVSport 2.0 


LP07 


HCijrJL 


ClJ34+cells, 11 


pCMVi>port 2.U 


T "DAT 


T"TP\T A 

HDJLA 


HoagKin s Lymphoma 1 


pCMVSport 2.0 


LP07 


T 1 1 \T* A T 1 1 VI'U Til \T > /~ y Tj i Vl'l \ 

JHLul A JHULJIB HJL)1C JHJL)1 JL) 

T-TT^TTR 
xLL/iH 


HoagKin s Lymphoma 11 


pCMVSport 2.0 


LP07 


TWA A TTKAR HTC AP TTK" AF> 

HKAE HKAF HKAG HKAH 


PTfttinfvpvtp 

jvci a LJjj yic 


pv_^ivi v opuruc.u 


T PH7 


HCIM 


CAPFINDER. Crohn's Disease, 
lib 2 


nCMVSnort 2 0 


LP07 


HKAL 


Keratinocyte, lib 2 


pCMVSport2.0 


LP07 


HKAT 


Keratinocyte, lib 3 


pCMVSport2.0 


LP07 


HNDA 


Nasal polyps 


pCMVSport2.0 


DEW 


HDRA 


H. Primary Dendritic Cellsjib 3 


pCMVSport2.0 


LP07 


HOT-TA TTOTfR HOttP 


J71Lliila.il KJci ICUUldoLo 11 


rsPIVTV Qr»rvr+9 fi 
pv^ivi v oporiz..u 


T PH7 


nL*JL/if\ ITLLcU'X) XXLJL'V-/ 


uver, jxepdLuiiiii 


pv^ivi v opoiu>.u 


T PAR 


XJT T^AJ TTT T\i~V TTT T\T> 

Xli-iX/lN llJL-JU'vy xiX->±Jx 


liiiiiiaii iviver, noiiijai 


Pv^ivi v oporuJ.u 


T PAR 


TTMTA 
JXiVl I rv 


t\T^\AP ctiTrmlafpfl xxil r\rA\) T/P 

pjoiviv^ sumuidxcu w/ poiy i/v^ 


pv^ivi v opoiw.u 


T PAR 


TTNJTA 


TCTPP A 9 rrvntrnl 


pv_4Vl V OpOlLJ.v 


T PAR 


irnpA TTnpPt tjTjPP WTYPT) 

HDPF HDPG HDPH HDPI 
HDPJ HDPK 


Xllllliiiy ISOUUilLll, V_A" llo. 11 U 1 


pv^ivi v opuiuj.vy 


f POR 


HDPM HDPN HDPO HDPP 


Primary Dendritic cells,fxac 2 


pCMVSport3.0 


LP08 


ITMTTA TJMTTR KMTTP 


iviyoiuiu irrogcLuit)! v^cii i_»ilic 


r\ PA/TV ClrvM-K^ fl 
p v^ivi v opui U>.U 


r par 


pnurp A T-nTPTR TJTTFP T-TTTPFi 

xxXXD/\ JxIxClD JnulJX XxXxJJrJL^ 


i v^cii iicipcr i 


p\^ivi v opuru>.u 


T PAR 


TTHTTX/T TTPTRXT TTRFfi T-TTTEP 
XxxaElIVI XxIXCiiN xlxlCAJ nnr.r 


1 cexi neiper 11 


Pv-ivi v oporu>.u 


T PAR 

i»ruo 


HEOA HEOB HEOC 

AU'^il IJLu^Xf IIU^V/ 


T-Tuman endometrial stromal cell^ 

xxmumi \*ixw 1 1 i xtxx ju vj i m who 


DCMVSoort3 0 


LP08 


HJMA HIMB 


Human endometrial stromal 
cells-treated with progesterone 


pCMVSport3.0 


LP08 


HSWA HS WB HSWC 


Human endometrial stromal 
cells-treated with estradiol 


pCMVSport3.0 


LP08 


HSYA HSYB HSYC 


Human Thymus Stromal Cells 


pCMVSport3.0 


LP08 


HLWA HLWB HLWC 


Human Placenta 


pCMVSport3.0 


LP08 


HRAAHRAB HRAC 


Rejected Kidney, lib 4 


pCMVSport3.0 


LP08 



648 



WO 02/076488 PCT/US02/08276 



jLaoranes ownea oy catalog 


Catalog JL/CbUiipuiJU 


v ecior 


A ICC 
Deposit 


TTN/TTTV/T 
xilVi 1 iVl 


x^Cxv, pX>iVlC J/L^ irCdlCu 






ITA/fTA - 
J0LiViJ/\ 


xl. ivieniiii ginia, ivio 


poport I 


1 P1A 


xiJ.vjjv/\ xiivjlisjj Jruvjuvv^ xxlVlJSJL' 
HMKE 

X XJLtXXVJLj 


xi. ivieniugiiiki, ivii 


popOK 1 


t Pin 


HUSG HUSI 


Human umbilical vein 


pSport 1 


LP10 


HUSX HTJSY 


Human Umbilical Vein 

liU 1 1 1<1 1 1 VlXivlllVOl V Will 

Endothelial Cells, uninduced 


nSnnrt 1 


LP10 

A-fJL XV/ 


HOFA 


Ovarian Tumor 1 OV5232 


nSJnnTt 1 


LP10 

x>i. xv/ 


HPFA HPFB HCFP HPFD 


T-Pell PHA 16 hrs 




T P10 

X>X XI/ 


HCFL HCFM HCFN HCFO 


T-Cell PHA 24 hrs 


pSport 1 


LP10 


xlAL/A xxAL>C rlAUU HA1/H 

TJATYF WAFin 
rXcxXJS/ XxrVL/vJ 


xiuman Adipose 


pbport l 


LrlO 


HOVA WYVR HO VP 

LLKJ V /"V xi\J V X> llv/ V V_^ 


XXIXJUuoli v/ vol y 




T Pin 


TTTWT* HTWr TTTWD TTTWF 

XX X VV X> XI JL VY X 1 1 V Y U XI 1 TVJU 

HTWF 


Ppcrincr T-Pp11 T i hrarv TT 




t Pin 


HMMA 


Spleen inetastic melanoma 


nSoort 1 


LP10 

* * ■ JL \J* 


HLYA HLYB HLYC HLYD 
HLYE 


Spleen, Chronic lymphocytic 
leukemia 


pSport 1 


LP10 


HCGA 


CD34+ cell, I 


□Snort 1 


LP10 

m mM JL \/ 


HEOM HEON 


Human Eosinoohils 


DSDOrt 1 


LP10 


HTDA 


Human Tonsil. Lib 3 


nSnort 1 


LP10 


HSPA 


Salivary Gland, Lib 2 


pSport 1 


LP10 


HCHA HCHB HCHC 


Breast Cancer cell line, MDA 36 


pSport 1 


LP10 


TTPHM TTPITN 

r 


T3 t»£> o ct Panrpr P^»1l linA 
XJICaol \_/OlltvA V^Cii 1XUC, 

angiogenic 


■nC-nnTt 1 


T P10 


HPTA 




t\Q r\OTf" 1 

popuxi 1 


t Pin 


HDAA HDAB HDAC 


HEL cell line 


pSport 1 


LP10 


xj Ada 
HAoA 


Human Astrocyte 


pSport 1 


T niA 

LrlO 


TTT TT? A TTT TCHO TUT TCP 

riur A HUx\t> xiUrC 


Ulcerative Colitis 


poport 1 


LrlO 


HNTM 


NTERA2 + retinoic acid, 14 
days 


pSport 1 


LP10 


HDQA 


Primary Dendritic 
cells,CapFmder2, frac 1 


pSport 1 


LP10 


HDQM 


Primary Dendritic Cells, 
CapFinder, frac 2 


pSport 1 


LP10 


HLDX 


Human Liver, normal,CapFinder 


pSport 1 


LP10 


HULA HULB HULC 


Human Dermal Endothelial 
CeUs,untreated 


pSport 1 


LP10 


HUMA 


Human Dermal Endothelial 
cells,treated 


pSportl 


LP10 


HCJA 


Human Stromal Endometrial 
fibroblasts, untreated 


pSportl 


LP10 


HCJM 


Human Stromal endometrial 
fibroblasts, treated w/ estradiol 


pSportl 


LP10 


HEDA 


Human Stromal endometrial 
fibroblasts, treated with 
progesterone 


pSportl 


LP10 


HFNA 


Human ovary tumor cell 
OV350721 


pSportl 


LP10 
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Libraries owned by Catalog 


Catalog Description 


Vector 


ATCC 
Deposit 


HKGA HKGB HKGC HKGD 


Merkel Cells 


pSportl 


LP10 


fflSA EDLSB HISC 


Pancreas Islet Cell Tumor 


pSportl 


LP10 


HLSA 


Skin, burned 


pSportl 


LP10 


HBZA 


Prostate,BPH, Lib 2 


pSport 1 

r r 


LP10 


HBZS 


Prostate BPHJLib 2, subtracted 


pSport 1 


LP10 


HFIAHFIB HFIC 

x xx lrx x xxj x xx iv 


Svnnvial Fibroblasts f control • 




LP10 


HKIHHFUHMI 


Synovial hypoxia 


pSport 1 


LP10 


WRIT 141*11 I HKI'V I 

rxrij rtriu xtjtiv 


oynoviai il-i/ hn.t sumuiaieu 


popOTL 1 


t Din 




jviessangiai ceil, rrac i 


popom 


T Pin 


rlMVA ilMVxJ riMVL, 


Jo one Marrow stromal Cell, 
untreated 


pbportl 


t ni/\ 

LrlO 


HJrlA J-LrlY rlrlz, 


oynoviai Fibroblasts (Ul/lINr), 

SUDt 


poportl 


LrlO 


T-rpfw uunv in?n7 


oynoviai nypoxia-Kor 

SUQuaCLCU 


popom 


I Dlft 

L,r 1U 


uuaa uunn TTNAOP HMnn 

niVlV^rx Xxl\x\£I> xTiVAV<lv_, xTJYIV^JL' 


Unman A r»fri AyT/~4Ti <~\r*\rf f»c 

nuiiiaii rvuiivdieu jyionouyies 


TTni "7AP yH 


T PI 1 


tjt TA TTT TO TIT TP" 


riurnan i_»iver 


pdvivoport i 


T r>mo 


uxiD a TpXL_ro"D xranRr 1 T4TTRTT 
JxnJt5/\ xjjtdJj ixItdL^ nxiDU 

HHBE 


riuman xiean 


pLJMVoport 1 


T pnio 


UDD A XTDRR 
xTLr>£>/\ X1X515J5 


jiuman Drain 


pi^ivi v opon i 


T pnn 
L»rUlZ 


HLJAHLJB HUCHLJDHLJE 


Human Lung 


pCMVSport 1 


LP012 


JrlUCjrA JtiUUrt> JtiUOU 


Uvanan lumor 


pLM Vbport z.U 


LrUlZ 


JtiXJM 


Human tonsils, Lib 2 


plJMVbpOrt 2.U 


Lrulz 


HAMFHAMG 


KMH2 


pCMVSport 3.0 


LP012 


TT ATA TT A m TT A T/"~l 

HAJAHAJB HAJC 


L428 


pCMVSport 3.0 


T TV\ 1 i"\ 

LP012 


HWBA HWBB HWBC HWBD 
HWBE 


Dendritic cells, pooled 


pCMVSport 3.0 


LP012 


HWAA HWAB HWAC HWAD 
HWAE 


Human Bone Marrow, treated 


pCMVSport 3.0 


LP012 


HYAA HYAB HYAC 


B Cell lymphoma 


pCMVSport 3.0 


LP012 


HWHGHWHHHWHI 


Healing groin wound, 6.5 hours 
post incision 


pCMVSport 3.0 


LP012 


IWHPHWHQHWHR 


Healmg groin wound; 7.5 hours 
post incision 


pCMVSport 3.0 


LP012 


HARM 


Healmg groin wound - zero hr 
post-incision (control) 


pCMVSport 3.0 


LP012 


-w- -w-w, * 

HBIM 


Olfactory epithehum; 
nasalcavity 


pCMVSport 3.0 


LP012 


TTTT nrx A 

HWDA 


TT 1 * At I J 

Healmg Abdomen wound; 
70&90 min post incision 


pCMVSport 3.0 


LP012 


HWEA 


TT 1 • A1 1 TTT J 1 if 

Healmg Abdomen Wound;l5 
days post incision 


pCMVSport 3.0 


T TVTV 1 O 

LP012 


livVJA 


neaiing ADuomen 
Wound;2l&29 days 


pV-.lVl V OpOIT J.v/ 




HNAL 


Human Tongue, frac 2 


pSportl 


LP012 


HMJA 


H. Meminginia, M6 


pSportl 


LP012 


HMKAHMKB HMKCHMKD 
HMKE 


H. Meningima, Ml 


pSportl 


LP012 


HOFA 


Ovarian Tumor I, 0 V5232 


pSportl 


LP012 


HCFA HCFB HCFC HCFD 


T-CeU PHA 16 hrs 


pSportl 


LP012 
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T>ihranft<i owned hv f^ntnlno- 

JL./1 KJX CXX XwO V/ViUwU _/ V^uUllUll 


vernal JL^ L' i>C 1 1 ]_> U Uil 


vector 


A1LC 
lveposil 


HCFL HCFM HCFN HCFO 


T-Cell PHA24hrs 




T pnio 

JurUlZ 


HMMA HMMB HMMC 


Snleen metallic mGlaTirunn 


nSnrvrt 1 

L/OIJUJLLX 


T PHI 7 


HTDA 


Human Tonsil Lib ^ 


Ti ^nnrt 1 




HDBA 


T-fiiTTvan Petal ThvmiiQ 


poponi 


T POT} 


HDUA 


Pericardium 


popuni 


JL.Jrul^ 


HBZA 


Prostate BPH Lib 2 


popOITl 




HWCA 


Larvnx tumor 


poporn 


T P019 


HWKA 


Normal lunp - 

x i vv i iinn x ixxx 




T POl? 


HSMB 


Bone marrow stroma treated 

JL^VXX^ 1 1 Ml XX KJ TT OIX V/XHB t LlwmvU 


UOUUltl 


T PH19 


HBHM 

■ ■■■ * A^ 4 ■ A 


Normal trachea 

X * V/X XX 1 CXI |_X uwi 1^>U 


pO|lvUL X 


T PH10 
AjfU 1 Z 


HLFC 


MiTmaTi T .prvn y 
x xuxxuxxx Lioi y ixA 


poporti 


t x>ni o 


HLRB 


Siebbert Po1vno<Hs; 


papom 


T P019 


HNIA 

AMI 14.1 A 


,vi " |>ii> "* i y vjioxxu 


poponi 


T PO10 


HNJB 

-A. 1 4 IV 


Palatp pflrfMTinTTia 

x cxiQXVv i^cxXV'lxlUxlJa. 


poponi 


T PHI 0 
LrU 1Z. 


HNKA 


Palate normal 


pSportl 


LP012 


HM7A 


x ilaXyxiA CaXCinOIXIa 


poportl 


T Df\1 O 


HABG 


Cheek Carcinoma 


pSportl 


LP012 




r narynx carcinoma 


pSportl 


T TVM ^ 

LP012 


HDRM 


Larynx Carcinoma 


pSportl 


LP012 


ITVA A 
JOL V /V/\ 


Pancreas normal rLA4 No 


pSportl 


LP012 


HICA 


Tongue carcinoma 


pSportl 


LP012 


T TT TV A TTTTl/'D TTT TPT' TJT Tl/TA 

HUKA HUisJ5 HUKC HUKJD 
HUKE 


Human uterine Cancer 


T ml A T^ y 

Lambda ZAP IT 


LP013 


tjxtu 1 a 
xirrA 


Human Fetal Brain, random 
primed 


Lambda ZAP II 


LP013 


HTUA 


Activated T-cell labeled with 4- 
thiolun 


Lambda ZAP II 


LP013 


HBQA 


Early Stage Human Brain, 
random primed 


Lambda ZAP II 


LP013 


HMEB 


Human microvascular 
Endothelial cells, fract B 


Lambda ZAP II 


LP013 


T TT TO TT 

HUSH 


Human Umbilical Vein 
Endothelial cells, fract. A, re- 

* * 

excision 


Lambda ZAP II 


LP013 


or r^r -1 in /^\t\ 


Hepatocellular tumor, re- 
excision 


Lambda ZAP II 


LP013 


UTTX7T U'VWTV UTT17T 

HI W J HI WK HI WL 


Restmg T-cell, re-excision 


Lambda ZAP II 


LP013 


HF6S 


Human Whole 6 week Old 

. — i_ _ /YY\ f _ j. 

Embryo (II), subt 


pBluescript 


LP013 


HHPS 


Human Hippocampus, 
subtracted 


pBluescript 

* 


LP013 


HL1S 


LNCAP, differential expression 


pBluescript 


LP013 


UT T_TC 1JT Tjnn 
xTLLXlO JjLL»H1 


nariy otage Human Lung, 
Subtracted 


pBluescnpt j 


LP013 


HSUS 


Supt cells, cyclohexamide 
treated, subtracted 


pBluescript 


LP013 


HSUT 


Supt cells, cyclohexamide 
treated, differentially expressed 


pBluescript 


LP013 


HSDS 


H. Striatum Depression, 
subtracted 


pBluescript 


LP013 
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f rhrsifipc nu/DPrl Tiv Cental r\cr 
l^ilUiall&i} UW1IGU Uj v^alcUOg 




Vf*r*tor 


/\ 1 v^v^ 

T}pnoQ it 


HPTZ 


Human Pituitarv Subtracted VII 


nB luescTTnt 




HSDX 


H Striatum Deoression. subt II 


oBluescnDt 


LP013 


HSDZ 


H Striatum Deoression. subt 


nBluescriDt 


LP013 


HPBA HPBB HPBC HPBD 
HPBE 


Human Pineal Gland 


oBluescrint SK- 


LP013 


HRTA 


Colorectal Tumor 


pBluescript SK- 


LP013 


HSBA HSBB HSBC HSBM 


HSC172 cells 


nBluescrint SK- 


LP013 


fflAA HJAB HJ AC HJ AD 


Jurkat T-cell Gl phase 


pBluescript SK- 


LP013 


HJR A HTRR HTRr HTRTj 

X U iJA ID UU 111 XJ 1 XJ D Ls 


J 111 IvuL X -V>CXI 9 v-7 X JLJXltlov/ 




T POI^ 

•Lauij 


HTNAHTNB 


Human Thyroid 


pBluescript SK- 


LP013 


trAiiA "WAT-TP 


11 Li Ilia 11 /\(xUll xlcdXl 


TTni *7AP YR 






wnoie o wccK kjih nuiDryo 


TTni 7AP YT? 


T PTkl 1 
LiiUl j 


HFCA HFCB HFCC HFCD 

xiri^xl 


Human Fetal Brain 


Uni-ZAPXR 


LP013 


HFKC HFKD HFKE HFKF 
xlrxSAj 


Human Fetal Kidney 


Uni-ZAPXR 


LP013 


"Pytr a iirrori tjtirt? tjytrt? 

xll_Ji>/\ riKjDLf XnXj&E, rxvJ&r 


xiuman uraix oiauuer 


TTni 7AP VP 


t phiq 
LrUlj 


HPRA HPRB HPRC HPRD 


Human Prostate 


Uni-ZAPXR 


LP013 


T-TTK A TJTTETi HTT^r^ T-JTTJTi 
ill J}/\ xi I JCo xl 1 Hl^ XilxVU 

HTEE 


xiuman lesies 


TTni 7AP YT? 


t phi ^ 
LrUlj 


rlll/\riilij nllL xll ID 
m jl x_* 


xiuman testes tumor 


TTni 7AP VP 


LrUlj 


XI 1 Dt\ XX I X>X> 


nmnaii relax xDvjiic 


TTni-yAP YR 

yJiXx. / tt\JL J\J\. 




HFLA 


Human Fetal Liver 


Uni-ZAP XR 


LP013 


XTTTCT) TTXJC , P , TJTOTTp\ T-TCTGTG 

xltlro xlxirl^ ilrlriJ rlrlrxi 
xixxrvr 


riuman petal xiearx 


T Tni 7AD YD 




HUVB HUVC HUVD HUVE 


Human Umbilical Vein, End. 
renioKe 


Uni-ZAPXR 


LP013 


"PTTTD l-TTTJTr^ T-TTUT^ 

xiixid xiixxi^ xix xilj 


Tinman TKirninc 

xiuman lnyxnus 


TTni 7 AT> YP 


t pni7 

LtUIj 


uqta T-TCTP W^Tf* T4QTn 
no 1 r\ xlo 1 0 Xlo 1 \^ no 1 U 


xiuman oKin i umor 


TTni 7 AT> YP 


t phi ^ 


UTA A UTAR TTT A P T-TTATi 

xii/\/\ JnLi/vr> xiiAi^ xix/\u 
HTAE 


xiuiTian Aciivatea i -ceus 


TTni 7AP YP 


T PAl ^ 
LrUlj 


XITIIA. LiVEjLJ rjJrJj/V^ 


xxurnan x*eiai jopiujeiium ^SKinj 


TTni 7AP YP 


t pm^ 


HJPA HJPB HJPC HJPD 


Human Jurkat Membrane Bound 
x oiysomes 


Uni-ZAP XR 


LP013 


HESA 


Human Epithelioid Sarcoma 


Uni-ZAPXR 


LP013 




xiuman Aauit i^iver, oUDtractea 


T Tni 7AO YD 
Uni-ZAx AK 


T T>A1 1 


XJL7I» A XJT7TO J-lWrr* UWVT\ 

xirlA xirlxJ Jtlrlv^ xlrllJ 


xiuman retai uura Mater 


T Twi 7AD VP 
Uni-ZALT AK 




HCAA HCAB HCAC 


Cem cells, cyclohexamide 
treateu 


Uni-ZAPXR 


LP013 


imn a rrn ryr> irDnp tjt> fZTs 
xiKvjA xiKljxJ XlKljrC xiKvjrlJ 


Kaji ijeiis, cycionexanuae 

treated 


T TVii 7AD YP 
Um-ZAx AK 




HE9A HE9B HE9C HE9D 
HE9E 


Nine Week Old Early Stage 
Human 


Uni-ZAPXR 


LP013 


HSFA 


Human Fibrosarcoma 


Uni-ZAPXR 


LP013 


HATA HATB HATC HATD 
HATE 


Human Adrenal Gland Tumor 


Uni-ZAPXR 


LP013 


HTRA 


Human Trachea Tumor 


Uni-ZAPXR 


LP013 


HE2A HE2D HE2E HE2H HE2I 


12 Week Old Early Stage 


Uni-ZAP XR 


LP013 
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j^ioraries owiicu. uy catalog 


Catalog Description 


Vector 


ATCC 
Deposit 




jtiuman 






KF9R VTF7C HF9F TTF9H T-TR9P 


Human, II 


Tint 7 AP YP 


T PA1 1 




HNF A HNFR TTNFP KNTPD 

JLJLL> J-^/Tk JLJU.1J *P X JUL 1 JJ«\^ n 1 N 17.1 J 

HNEE 


AlUlIJUlIl INCUUUpilli 


TTni 7AP YD 


T DAI ^ 




FlUliiaii rillllai Y OICaoL V_^aUCCI 


TTni 7AP YP 
Uni-ZjAr AJv 


T Dfll "2 


HPTSHPTTHPTU 


Human Pituitary, subtracted 


Uni-ZAPXR 


LP013 


XTA/TO A T-TA/ffYR T-fNAOT^ T-JA/ffYTi 

ihyii^a Jnu.viv^x> jiiYiv^v-. rliVii^JL/ 


riuman AcuvateQ monocytes 


T T*»; 7AD YD 


T DAI "> 


HOAA HOAB HOAC 


Human Osteosarcoma 


Uni-ZAP XR 


LP013 


Hi OA Hi UD HI Oil HTOr 
xli \Jkj 


human tonsils 


Uni-ZAP XR 


LP013 


HMGB 


Human OB MG63 control 
fraction I 


Uni-ZAP XR 


LP013 




Human OB HOS control traction 
I 


Uni-ZAP XR 


LP013 




Human OB HOb treated (1 nM 
xiz^ Traction 1 


Uni-ZAP XR 


LP013 


TTATTA T-T AT TP IT AT TP 
ITrvlJ A xlAU,D HAUL- 


Amniotic ueus - i iNr moucea 


T T - v AP YP 


T DA1 1 


xlAl^A rlA^JD JtiAl^C JtlAv^lJ 


Ammotic cells - 1 nmary 
^uiiure 


TT-; 7 ad vn 
Uni-ZAr AK 


T DAl "2 




XllJiVlAIN O x \Jvfxl\ \^XX 


TTrn* 7AP YP 


T PAT* 


HRTA T-TRTR HRTP HRTD TTRTF 

XXXJJ /Tk XXXjJXj ilDJL/ JTLJJJ Xj 


Xj X XV XX L X vl VlvTV 


TTni 7AP YP 


T PATi 


HODA HODB HODC HODD 


hrnnari nvflrian ranr^pr 


Uni-7AP XR 

UALL z-uxX j\J\. 


T POT* 


HCPA 


Corpus CaUosnm 


Uni-ZAP XR 


LP013 




siomacn cancer v.numan^ 


TTni 7AP YP 


T PA1 *2 


HERA 


SKIN 


Uni-ZAPXR 


LP013 


riiVJLUA 


rSram-meaull ODlastoma 


tt_ * 7 An VD 

Uni-ZAr XK 


LP013 


liUJLA nUJLo HUJLL> 


Lrlioolastoma 


Uni-ZAr ak 


LP013 


HWTA HWTB HWTC 


wilm's tumor 


Uni-ZAPXR 


LP013 


OC A A 

HJbAA 


fT A A l_ * T"* J a * 

H. Atrophic Endometnum 


T T * r"7 a T\ T7T* 

Um-ZAPXR 


LP013 


TT A T1X 7 TT A T>/""V TT A TVT* TT A T"J/~ V 

HAPN HAPO HAPP HAPQ 

TJADD 

XlArK 


Human Adult Pulmonary ire- 
excision 


Uni-ZAP XR 


LP013 


HLTGHLTH 


Human T-cell lymphoma;re~ 
excision 


Uni-ZAPXR 


LP013 


UATTP TJTATJTi T-TAT-TR 
loAriC nJ\xi\J JtiAxiti 


Human Adult Heart ,re-excision 


T 7 AD VD 

Uni-ZAr AK 


T DAI O 


jiAOA riAljrS JaALrC JHLALfJJ 

HAGE 


Human Amy goal a 


T T«ii 7AD VD 

Uni-ZAr AK 


T DA1 1 


JtloJA JrloJr> rioJC 


omooin muscie-iiwD lnuuceo 


T T« i 7 A P YP 

Uni-ZAr AK 


T DA 1 1 

L,rvl3 


HSHA HSHB HSHC 


Smooth muscle, ELlb induced 


Uni-ZAP XR 


LP013 


rirw A Jar Wo HrWL Hr WJL> 

nrVVE 


Prostate BPH 


Um-ZAr XR 


¥ nni t 

LP013 


"HPT A MPTR HPIP 

Xxx Lf\ XXX XD XXX 1L 




TTni 7AP YP 


T POT 1 


HPJA HPJB HPJC 


PC3 Prostate cell line 


Uni-ZAPXR 


LP013 


HBTA 


Bone Marrow Stroma, 
TNF&LPS ind 


Uni-ZAPXR 


LP013 


HMCF HMCG HMCH HMCI 
HMCJ 


Macrophage-oxLDL; re-excision 


Uni-ZAPXR 


LP013 


HAGG HAGH HAGI 


Human Amygdala^e-excision 


Uni-ZAP XR 


LP013 


HACA 


H. Adipose Tissue 


Uni-ZAPXR 


LP013 
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Libraries owned by Catalog 


Catalog Description 


Vector 


ATCC 
Deposit 


TTT/rTI 

HKFB 


K562 + PMA (36 hrsXre- 
excision 


ZAP Express 


LP013 


TT/ HI IT* TT/TTITTT TT/TT1 TX 7 

HCWT HCWU HCWV 
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Two nonlimiting examples are provided below for isolating a particular clone from the 
deposited sample of plasmid cDNAs cited for that clone in Table 7. First, a plasmid is directly 
isolated by screening the clones using a polynucleotide probe corresponding to the nucleotide 
sequence of SEQ ID NO:X. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized using an 
Applied Biosystems DNA synthesizer according to the sequence reported. The oligonucleotide is 
labeled, for instance, with 32 P-y-ATP using T4 polynucleotide kinase and purified according to 
routine methods. (E.g., Maniatis et al. a Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Press, Cold Spring, NY (1982)). The plasmid mixture is transformed into a suitable host, 
as indicated above (such as XL-1 Blue (Stratagene)) using techniques known to those of skill in 
the art, such as those provided by the vector supplier or in related publications or patents cited 
above. The txaiisfonnants are plated on 1.5% agar plates (containing the appropriate selection 
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agent, e.g., ampicillin) to a density of about 150 transformants (colonies) per plate. These plates 
are screened using Nylon membranes according to routine methods for bacterial colony screening 
(e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Edit., (1989), Cold Spring 
Harbor Laboratory Press, pages 1.93 to 1.104), or other techniques known to those of skill in the 
5 art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the nucleotide 
sequence of SEQ ID NO:X are synthesized and used to amplify the desired cDNA using the 
deposited cDNA plasmid as a template. The polymerase chain reaction is carried out under 
routine conditions, for instance, in 25 /il of reaction mixture with 0.5 ug of the above cDNA 

10 template. A convenient reaction mixture is 1.5-5 mM Mgd 2 , 0.01% (w/v) gelatin, 20 /iM each of 
dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five 
cycles of PCR (denaturation at 94°C for 1 rnin; armealing at 55°C for 1 rnin; elongation at 72°C 
for 1 rnin) are performed with a Perkin-Elmer Cetus automated thermal cycler. The amplified 
product is analyzed by agarose gel electrophoresis and the DNA band with expected molecular 

15 weight is excised and purified. The PCR product is verified to be the selected sequence by 
subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding portions of a 
gene which may not be present in the deposited clone. These methods include but are not limited 
to, filter probing, clone enrichment using specific probes, and protocols similar or identical to 5' 

20 and 3' "RACE" protocols which are well known in the art. For instance, a method similar to 5' 
RACE is available for generating the missing 5' end of a desired full-length transcript (Fromont- 
Racine et al, Nucleic Acids Res. 21(7): 1683-1684 (1993)). 

Briefly, a specific RNA oligonucleotide is ligated to the 5 f ends of a population of RNA 
presumably containing full-length gene RNA transcripts. A primer set containing a primer 

25 specific to the ligated RNA oligonucleotide and a primer specific to a known sequence of the gene 
of interest is used to PCR amplify the 5* portion of the desired full-length gene. This amplified 
product may then be sequenced and used to generate the full length gene. 

This above method starts with total RNA isolated from the desired source, although poly- 
A+ RNA can be used. The RNA preparation can then be treated with phosphatase if necessary to 

30 eliminate 5' phosphate groups on degraded or damaged RNA which may interfere with the later 
RNA ligase step. The phosphatase should then be inactivated and the RNA treated with tobacco 
acid pyrophosphatase in order to remove the cap structure present at the 5' ends of messenger 
RNAs. This reaction leaves a 5* phosphate group at the 5' end of the cap cleaved RNA which can 
then be ligated to an RNA oligonucleotide using T4 RNA ligase. 

35 This modified RNA preparation is used as a template for first strand cDNA synthesis 

using a gene specific oligonucleotide. The first strand synthesis reaction is used as a template for 
PCR amplification of the desired 5 1 end using a primer specific to the ligated RNA oligonucleotide 
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and a primer specific to the known sequence of the gene of interest. The resultant product is then 
sequenced and analyzed to confirm that the 5' end sequence belongs to the desired gene. 

Example 2: Isolation of Genomic Clones Corresponding to a Polynucleotide 

5 

A human genomic PI library (Genomic Systems, Inc.) is screened by PCR using primers 
selected for the sequence corresponding to SEQ ID NO:X according to the method described in 
Example 1. (See also, Sambrook.) 

10 Example 3: Tissue specific expression analysis 

The Human Genome Sciences, Inc. (HGS) database is derived from sequencing tissue 
and/or disease specific cDNA libraries. Libraries generated from a particular tissue are selected 
and the specific tissue expression pattern of EST groups or assembled contigs within these 
15 libraries is determined by comparison of the expression patterns of those groups or contigs within 
the entire database. ESTs and assembled contigs which show tissue specific expression are 
selected. 

The original clone from which the specific EST sequence was generated, or in the case of 
an assembled contig, the clone from which the 5* most EST sequence was generated, is obtained 

■ 

20 from the catalogued library of clones and the insert amplified by PCR using methods known in the 
art. The PCR product is denatured and then transferred in 96 or 384 well format to a nylon 
membrane (Schleicher and Scheull) generating an array filter of tissue ' specific clones. 
Housekeeping genes, maize genes, and known tissue specific genes are included on the filters. 
These targets can be used in signal normalization and to validate assay sensitivity. Additional 

25 targets are included to monitor probe length and specificity of hybridization. 

Radioactively labeled hybridization probes are generated by first strand cDNA synthesis 
per the manufacturer's instructions (Life Technologies) from mRNA/RNA samples prepared from 
the specific tissue being analyzed (e.g., prostate, prostate cancer, ovarian, ovarian cancer, etc.). 
The hybridization probes are purified by gel exclusion chromatography, quantitated, and 

30 hybridized with the array filters in hybridization bottles at 65°C overnight The filters are washed 
under stringent conditions and signals are captured using a Fuji phosphorimager. 

Data is extracted using AIS software and following background subtraction, signal 
normalization is performed. This includes a normalization of filter-wide expression levels between 
different experimental runs. Genes that are differentially expressed in the tissue of interest are 

35 identified. 
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Example 4: Chromosomal Mapping of the Polynucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5* end of SEQ 
ID NO:X_ This primer preferably spans about 100 nucleotides. This primer set is then used in a 
5 polymerase chain reaction under the following set of conditions: 30 seconds, 95°C; 1 minute, 
56°C; 1 minute, 70°C. This cycle is repeated 32 times followed by one 5 minute cycle at 70°C. 
Human, mouse, and hamster DNA is used as template in addition to a somatic cell hybrid panel 
containing individual chromosomes or chromosome fragments (Bios, Lie). Hie reactions are 
analyzed on either 8% polyacrylamide gels or 3.5 % agarose gels. Chromosome mapping is 
10 determined by the presence of an approximately 100 bp PGR fragment in the particular somatic 
cell, hybrid. 

Example 5: Bacterial Expression of a Polypeptide 

15 A polynucleotide encoding a polypeptide of the present invention is amplified using PCR 

oligonucleotide primers corresponding to the 5' and 3' ends of the DNA sequence, as outlined in 
Example 1, to synthesize insertion fragments. The primers used to amplify the cDNA insert 
should preferably contain restriction sites, such as BamHI and Xbal, at the 5' end of the primers in 
order to clone the amplified product into the expression vector. For example, BamHI and Xbal 

20 correspond to the restriction enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., 

Chatsworth, CA). This plasmid vector encodes antibiotic resistance (Amp 1 ), a bacterial origin of 
replication (ori), an IPTG-regulatable promoter/operator (P/O), a ribosome binding site (RBS), a 
6-histidine tag (6-His), and restriction enzyme cloning sites. 

The pQE-9 vector is digested with BamHI and Xbal and the amplified fragment is ligated 
25 into the pQE-9 vector maintaining the reading frame initiated at the bacterial RBS. The ligation 
mixture is then used to transform the E. coli strain M15/rep4 (Qiagen, Inc.) which contains 
multiple copies of the plasmid pREP4, which expresses the lad repressor and also confers 

kanamycin resistance (Kan 1 ). Transformants are identified by their ability to grow on LB plates 
and ampicillin/kanamycin resistant colonies are selected. Plasmid DNA is isolated and confirmed 

30 by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in liquid culture in LB 
media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). The O/N culture is used to 
inoculate a large culture at a ratio of 1:100 to 1:250. The cells are grown to an optical density 600 
(O.D. 600 ) of between 0.4 and 0.6. EPTG (Isopropyl-B-D-thiogalacto pyranoside) is then added to a 

35 final concentration of 1 mM. IPTG induces by inactivating the lad repressor, clearing the P/O 
leading to increased gene expression. 
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Cells are grown for an extra 3 to 4 hours. Cells are then harvested by centrifugation (20 
mins at 6000Xg). The cell pellet is solubilized in the chaotropic agent 6 Molar Guanidine HC1 by 
stirring for 3-4 hours at 4°C. The cell debris is removed by centrifugation, and the supernatant 
containing the polypeptide is loaded onto a nickel-nitrilo-tri-acetic acid C*Ni-NTA") affinity resin 
5 column (available from QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA 
resin with high affinity and can be purified in a simple one-step procedure (for details see: The 
QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH 8. The 
column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed with 10 
10 volumes of 6 M guanidine-HCl pH 6, and finally the polypeptide is eluted with 6 M guanidine- 
HC1, pH 5. 

The purified protein is then renarured by dialyzing it against phosphate-buffered saline 
(PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCL Alternatively, the protein can be 
successfully refolded while immobilized on the Ni-NTA column. The recommended conditions 

15 are as follows: renature using a linear 6M-1M urea gradient in 500 mM NaCl, 20% glycerol, 20 
mM Tris/HCl pH 7.4, containing protease inhibitors. The renaturation should be performed over a 
period of 1.5 hours or more. After renaturation the proteins are eluted by the addition of 250 mM 
immidazole. Immidazole is removed by a final dialyzing step against PBS or 50 mM sodium 
acetate pH 6 buffer plus 200 mM NaCl. The purified protein is stored at 4°C or frozen at -80° C. 

20 In addition to the above expression vector, the present invention further includes an 

expression vector, called pHE4a (ATCC Accession Number 209645, deposited on February 25, 
1998) which contains phage operator and promoter elements operatively linked to a polynucleotide 
of the present invention, called pHE4a. (ATCC Accession Number 209645, deposited on 
February 25, 1998.) This vector contains: 1) a neomycinphosphotransferase gene as a selection 

25 marker, 2) an E. coli origin of replication, 3) a T5 phage promoter sequence, 4) two lac operator 
sequences, 5) a Shine-Delgarno sequence, and 6) the lactose operon repressor gene (laclq). The 
origin of replication (oriC) is derived from pUC19 (LTI, Gaithersburg, MD). The promoter and 
operator sequences are made synthetically. 

DNA can be inserted into the pHE4a by restricting the vector with Ndel and Xbal, 

30 BamHI, Xhol, or Asp718, running the restricted product on a gel, and isolating the larger fragment 
(the stuffer fragment should be about 310 base pairs). The DNA insert is generated according to 
the PCR protocol described in Example 1, using PCR primers having restriction sites for Ndel (5' 
primer) and Xbal, BamHI, Xhol, or Asp718 (3* primer). The PCR insert is gel purified and 
restricted with compatible enzymes. The insert and vector are ligated according to standard 

35 protocols. 

The engineered vector could easily be substituted in the above protocol to express protein 
in a bacterial system. 
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Example 6: Purification of a Polypeptide from an Inclusion Body 

The following alternative method can be used to purify a polypeptide expressed in E coli 
5 when it is present in the form of inclusion bodies. Unless otherwise specified, all of the following 
steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell culture is 

r 

cooled to 4-10°C and the cells harvested by continuous centrifugation at 15,000 rpm (Heraeus 

Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the 
10 amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended 

in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a 

homogeneous suspension using a high shear mixer. 

The cells are then lysed by passing the solution through a microfluidizer (Microfuidics, 

Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with Nad 
15 solution to a final concentration of 0.5 M NaCl, followed by centrifugation at 7000 xg for 15 min. 

The resultant pellet is washed again using 0.5M NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

The resulting washed inclusion bodies are solubilized with 1.5 M guanidine hydrochloride 

(GuHCl) for 2-4 hours. After 7000 xg centrifugation for 15 min., the pellet is discarded and the 

polypeptide containing supernatant is incubated at 4°C overnight to allow further GuHCl 
20 extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, the GuHCl 
solubilized protein is refolded by quickly mixing the GuHCl extract with 20 volumes of buffer 
containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by vigorous stirring. The 
refolded diluted protein solution is kept at 4°C without mixing for 12 hours prior to further 

25 purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential filtration unit 
equipped with 0.16 Jim membrane filter with appropriate surface area (e.g., Hltron), equilibrated 
with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample is loaded onto a cation 
exchange resin (e.g., Poros HS-50, Perseptive Biosystems). The column is washed with 40 mM 

30 sodium acetate, pH 6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the 
same buffer, in a stepwise manner. The absorbance at 280 nm. of the effluent is continuously 
monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the polypeptide are then pooled and mixed with 4 volumes of water. 
The diluted sample is then loaded onto a previously prepared set of tandem columns of strong 

35 anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, Perseptive 
Biosystems) exchange resins. The columns are equilibrated with 40 mM sodium acetate, pH 6.0. 
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Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCl. The CM-20 
column is then eluted using a 10 column volume linear gradient ranging from 0.2 M NaCl, 50 mM 
sodium acetate, pH 6.0 to L0M NaCl, 50 mM sodium acetate, pH 6.5. Fractions are collected 
under constant A280 monitoring of the effluent. Fractions containing the polypeptide (determined, 
5 for instance, by 16% SDS-PAGE) are then pooled. 

The resultant polypeptide should exhibit greater than 95% purity after the above refolding 
and purification steps. No major contaminant bands should be observed from Commassie blue 
stained 16% SDS-PAGE gel when 5 jig of purified protein is loaded. The purified protein can also 
be tested for endotoxin/LPS contamination, and typically the LPS content is less than 0.1 ng/ml 
10 according to LAL assays. 

Example 7: Cloning and Expression of a Polypeptide in a Baculovirus Expression 

System 

15 In this example, the plasmid shuttle vector pA2 is used to insert a polynucleotide into a 

baculovirus to express a polypeptide. This expression vector contains the strong polyhedrin 
promoter of the Autographa californica nuclear polyhedrosis virus (AcMNPV) followed by 
convenient restriction sites such as BamHI, Xba I and Asp718. The polyadenylation site of the 
simian virus 40 ("SV40") is used for efficient polyadenylation. For easy selection of recombinant 

20 virus, the plasmid contains the beta-galactosidase gene from E. coli under control of a weak 
Drosophila promoter in the same orientation, followed by the polyadenylation signal of the 
polyhedrin gene. The inserted genes are flanked on both sides by viral sequences for cell- 
mediated homologous recombination with wild-type viral DNA to generate a viable virus that 
express the cloned polynucleotide. 

25 Many other baculovirus vectors can be used in place of the vector above, such as pAc373, 

pVL941, and pAcMl, as one skilled in the art would readily appreciate, as long as the construct 
provides appropriately located signals for transcription, translation, secretion and the like, 
including a signal peptide and an in-frame AUG as required. Such vectors are described, for 
instance, in Luckow et al., Virology 170:31-39 (1989). 

30 Specifically, the cDNA sequence contained in the deposited clone, including the AUG 

initiation codon, is amplified using the PCR protocol described in Example 1. If a naturally 
occurring signal sequence is used to produce the polypeptide of the present invention, the pA2 
vector does not need a second signal peptide. Alternatively, the vector can be modified (pA2 GP) 
to include a baculovirus leader sequence, using the standard methods described in Summers et al., 

35 "A Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures," Texas 
Agricultural Experimental Station Bulletin No. 1555 (1987). 
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The amplified fragment is isolated from a 1% agarose gel using a commercially available 
kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment then is digested with appropriate 
restriction enzymes and again purified on a 1% agarose gel. 

The plasmid is digested with the corresponding restriction enzymes and optionally, can be 
dephosphorylated using calf intestinal phosphatase, using routine procedures known in the art. 
The DNA is then isolated from a 1% agarose gel using a commercially available kit ("Geneclean" 
BIO 101 Inc., La Jolla, Ca.). 

The fragment and the dephosphorylated plasmid are ligated together with T4 DNA ligase. 
E. coli HB101 or other suitable E. coli hosts such as XL-1 Blue (Stratagene Cloning Systems, La 
Jolla, CA) cells are transformed with the ligation mixture and spread on culture plates. Bacteria 
containing the plasmid are identified by digesting DNA from individual colonies and analyzing the 
digestion product by gel electrophoresis. The sequence of the cloned fragment is confirmed by 
DNA sequencing. 

Five \ig of a plasmid containing the polynucleotide is co-transfected with 1.0 fig of a 
commercially available linearized baculovirus DNA ("BaculoGold™ baculovirus DNA, 
Pharmingen, San Diego, CA), using the lipofection method described by Feigner et al., Proc. Natl. 
Acad. Sci. USA 84:7413-7417 (1987). One jig of BaculoGold™ virus DNA and 5 \ig of the 
plasmid are mixed in a sterile well of a microtiter plate containing 50 }i\ of serum-free Graced 
medium (Life Technologies Inc., Gaithersburg, MD). Afterwards, 10 /*! Lipofectin plus 90 /zl 
Grace's medium are added, mixed and incubated for 15 minutes at room temperature. Then the 
transfection mixture is added drop-wise to Sf9 insect cells (ATCC CRL 1711) seeded in a 35 mm 
tissue culture plate with 1 ml Grace's medium without serum. The plate is then incubated for 5 
hours at 27° C. The transfection solution is then removed from the plate and 1 ml of Grace's insect 
medium supplemented with 10% fetal calf serum is added. Cultivation is then continued at 27° C 
for four days. 

After four days the supernatant is collected and a plaque assay is performed, as described 
by Summers and Smith, supra. An agarose gel with "Blue Gal" (Life Technologies Lie, 
Gaithersburg) is used to allow easy identification and isolation of gal-expressing clones, which 
produce blue-stained plaques. (A detailed description of a "plaque assay" of this type can also be 
found in the user's guide for insect cell culture and baculovirology distributed by Life 
Technologies Inc., Gaithersburg, page 9-10.) After appropriate incubation, blue stained plaques 
are picked with the tip of a micropipettor (e.g., Eppendorf). The agar containing the recombinant 
viruses is then resuspended in a microcentrifuge tube containing 200 /il of Grace's medium and the 
suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded in 35 mm 

* 

dishes. Four days later the supernatants of these culture dishes are harvested and then they are 
stored at 4° C. 
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To verify the expression of the polypeptide, Sf9 cells are grown in Grace's medium 
supplemented with 10% heat-inactivated FBS. The cells are infected with the recombinant 
baculovirus containing the polynucleotide at a multiplicity of infection ("MOI") of about 2. If 
radiolabeled proteins are desired, 6 hours later the medium is removed and is replaced with SF900 
5 II medium minus methionine and cysteine (available from Life Technologies Inc., Rockville, MD). 
After 42 hours, 5 fid of 35 S-methionine and 5 /xCi 35 S-cysteine (available from Amersham) are 
added. The cells are further incubated for 16 hours and then are harvested by centrifugation. The 
proteins in the supernatant as well as the intracellular proteins are analyzed by SDS-PAGE 
followed by autoradiography (if radiolabeled). 
10 Microsequencing of the amino acid sequence of the amino terminus of purified protein 

may be used to determine the amino terminal sequence of the produced protein. 

Example 8: Expression of a Polypeptide in Mammalian Cells 

15 The polypeptide of the present invention can be expressed in a mammalian celL A typical 

mammalian expression vector contains a promoter element, which mediates the initiation of 
transcription of mRNA, a protein coding sequence, and signals required for the termination of 
transcription and polyadenylation of the transcript. Additional elements include enhancers, Kozak 
sequences and intervening sequences flanked by donor and acceptor sites for RNA splicing. 

20 Highly efficient transcription is achieved with the early and late promoters from SV40, the long 
terminal repeats (LTRs) from Retroviruses, e.g., RSV, HTLVI, HIVI and the early promoter of the 
cytomegalovirus (CMV). However, cellular elements can also be used (e.g., the human actin 
promoter). 

Suitable expression vectors for use in practicing the present invention include, for 
25 example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), pRSVcat (ATCC 
37152), pSV2dhfr (ATCC 37146), pBC12MI (ATCC 67109), pCMVSport 2.0, and pCMVSport 
3.0. Mammalian host cells that could be used include, human Hela, 293, H9 and Jurkat cells, 
mouse NIH3T3 and C127 cells, Cos 1, Cos 7 and CV1, quail QC1-3 cells, mouse L cells and 
Chinese hamster ovary (CHO) cells. 
30 Alternatively, the polypeptide can be expressed in stable cell lines containing the 

polynucleotide integrated into a chromosome. The co-transfection with a selectable marker such 
as DHFR, gpt, neomycin, or hygromycin allows the identification and isolation of the transfected 
cells. 

The transfected gene can also be amplified to express large amounts of the encoded 
35 protein. The DHFR (dihydrofolate reductase) marker is useful in developing cell lines that carry 
several hundred or even several thousand copies of the gene of interest. (See, e.g., Alt, F. W., et 
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al., J. Biol Chem. 253:1357-1370 (1978); Hamlin, J. L. and Ma, C, Biochem. et Biophys. Acta, 
1097:107-143 (1990); Page, M. J. and Sydenham, M. A., Biotechnology 9:64-68 (1991)). Another 
useful selection marker is the enzyme glutamine synthase (GS) (Murphy et al., Biochem J. 
227:277-279 (1991); Bebbington et al., Bio/Technology 10:169-175 (1992). Using these markers, 
5 the mammalian cells are grown in selective medium and the cells with the highest resistance are 
selected. These cell lines contain the amplified gene(s) integrated into a chromosome. Chinese 
hamster ovary (CHO) and NSO cells are often used for the production of proteins. 

Derivatives of the piasmid pSV2-dhfr (ATCC Accession No. 37146), the expression 
vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession No.209647) contain the 

10 strong promoter (LTR) of the Rous Sarcoma Virus (Cullen et al., Molecular and Cellular Biology, 
438^447 (March, 1985)) plus a fragment of the CMV-enhancer (Boshart et al., Cell 41:521-530 
(1985)). Multiple cloning sites, e.g., with the restriction enzyme cleavage sites BamHI, Xbal and 
Asp718, facilitate the cloning of the gene of interest. The vectors also contain the 3' intron, the 
polyadenylation and tennination signal of the rat preproinsulin gene, and the mouse DHFR gene 

15 under control of the SV40 early promoter. 

Specifically, the piasmid pC6, for example, is digested with appropriate restriction 
enzymes and then dephosphorylated using calf intestinal phosphates by procedures known in the 
art The vector is then isolated from a 1% agarose gel. 

A polynucleotide of the present invention is amplified according to the protocol outlined 

20 in Example 1. Tf a naturally occurring signal sequence is used to produce the polypeptide of the 
present invention, the vector does not need a second signal peptide. Alternatively, if a naturally 
occurring signal sequence is not used, the vector can be modified to include a heterologous signal 
sequence. (See, e.g., International Publication No. WO 96/34891.) 

The amplified fragment is isolated from a 1% agarose gel using a commercially available 

25 kit ("Geneclean," BIO 101 Inc., La Jolla, Ca.). The fragment then is digested with appropriate 
restriction enzymes and again purified on a 1% agarose gel. 

The amplified fragment is then digested with the same restriction enzyme and purified on 
a 1% agarose gel. The isolated fragment and the dephosphorylated vector are then ligated with T4 
DNA ligase. E. coli HB101 or XL-1 Blue cells are then transformed and bacteria are identified 

30 that contain the fragment inserted into piasmid pC6 using, for instance, restriction enzyme 
analysis. 

Chinese hamster ovary cells lacking an active DHFR gene is used for transfection. Five 
fig of the expression piasmid pC6 or pC4 is cotransfected with 0.5 jig of the piasmid pSVneo 
using lipofectin (Feigner et al., supra). The piasmid pSV2-neo contains a dominant selectable 
35 marker, the neo gene from Tn5 encoding an enzyme that confers resistance to a group of 
antibiotics including G418. The cells are seeded in alpha minus MEM supplemented with 1 
mg/ml G418. After 2 days, the cells are trypsinized and seeded in hybridoma cloning plates 
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(Greiner, Germany) in alpha minus MEM supplemented with 10, 25, or 50 ng/ml of methotrexate 
plus 1 mg/ml G418. After about 10-14 days single clones are trypsinized and then seeded in 6- 
well petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 100 nM, 
200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of methotrexate are then 
5 transferred to new 6-well plates containing even higher concentrations of methotrexate (1 /iM, 2 
/iM, 5 jiM, 10 mM, 20 mM). The same procedure is repeated until clones are obtained which 
grow at a concentration of 100 - 200 /zM. Expression of the desired gene product is analyzed, for 

instance, by SDS-PAGE and Western blot or by reversed phase HPLC analysis. 

♦ 

10 Example 9: Protein Fusions 

The polypeptides of the present invention are preferably fused to other proteins. These 
fusion proteins can be used for a variety of applications. For example, fusion of the present 
polypeptides to KhVtag, HA-tag, protein A, IgG domains, and maltose binding protein facilitates 

15 purification. (See Example 5; see also EPA 394,827; Traunecker, et al, Nature 331:84-86 (1988)). 
Similarly, fusion to IgG-1, IgG-3, and albumin increases the halflife time in vivo. Nuclear 
localization signals fused to the polypeptides of the present invention can target the protein to a 
specific subcellular localization, while covalent heterodimer or homodimers can increase or 
decrease the activity of a fusion protein. Fusion proteins can also create chimeric molecules 

20 having more than one function. Finally, fusion proteins can increase solubility and/or stability of 
the fused protein compared to the non-fused protein. All of the types of fusion proteins described 
above can be made by modifying the following protocol, which outlines the fusion of a 
polypeptide to an IgG molecule, or the protocol described in Example 5. 

Briefly, the human Fc portion of the IgG molecule can be PCR amplified, using primers 

25 that span the 5' and 3' ends of the sequence described below. These primers also should have 
convenient restriction enzyme sites that will facilitate cloning into an expression vector, preferably 
a mammalian expression vector. 

For example, if pC4 (ATCC Accession No. 209646) is used, the human Fc portion can be 
ligated into the BamHI cloning site. Note that the 3' BamHI site should be destroyed. Next, the 

30 vector containing the human Fc portion is re-restricted with BamHI, linearizing the vector, and a 
polynucleotide of the present invention, isolated by the PCR protocol described in Example 1, is 
ligated into this BamHI site. Note that the polynucleotide is cloned without a stop codon, 
otherwise a fusion protein will not be produced. 

If the naturally occurring signal sequence is used to produce the polypeptide of the present 

35 invention, pC4 does not need a second signal peptide. Alternatively, if the naturally occurring 
signal sequence is not used, the vector can be modified to include a heterologous signal sequence. 
(See, e.g., International Publication No. WO 96/34891.) 
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Human IgG Fc region: 

GGGATCCGGAGCCCAAATCITCTGACAAAACTCACACATGCCCACCG 
GCACCTGAATTCGAGGGTGCACCGTCAGTCTT<X 

5 CCCTCATGATCTCCCGGACTCCT^ 

AGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAAT^ 
GACAAAGCXGCGGGAGGAGCAGTACAACAGCACXjTACCGTGTGGTCAGCGTCCTCAC 
CGTCCTGCACCAGGACTGGCrGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAAC^ 
AGCCCTCCCAAC(XCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCC^ 
10 ACCACAGGTGTACACCCTGCrcCC^^ 

CCTGACCTGK^CTGGTCAAAGGCITCTATCCAAGCGACATCGCCGTTO 

CAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGG 

CTCCTTCTTCCTCTACAGCAAGCT 

CGTCITCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACT 
15 CTCTCCCTGTCTCCGGGTAAATGAGTGCGACGGCCGCGACTCTAGAGGAT (SEQ ID 
NO: 1) 

Example 10: Production of an Antibody from a Polypeptide 

20 a) Hybridoma Technology 

The antibodies of the present invention can be prepared by a variety of methods. (See, 
Current Protocols, Chapter 2.) As one example of such methods, cells expressing a polypeptide of 
the present invention are administered to an animal to induce the production of sera containing 

9 

polyclonal antibodies. La a preferred method, a preparation of a polypeptide of the present 
25 invention is prepared and purified to render it substantially free of natural contaminants. Such a 
preparation is then introduced into an animal in order to produce polyclonal antisera of greater 
specific activity. 

Monoclonal antibodies specific for a polypeptide of the present invention are prepared 
using hybridoma technology (Kohler et al., Nature 256:495 (1975); Kohler et al., Eur. J. Immunol. 

30 6:511 (1976); Kohler et al., Eur. J. Immunol. 6:292 (1976); Hammerling et al., in: Monoclonal 
Antibodies and T-Cell Hybridomas, Elsevier, N.Y., pp. 563-681 (1981)). In general, an animal 
(preferably a mouse) is immunized with a polypeptide of the present invention or, more preferably, 
with a secreted polypeptide-expressing cell. Such polypeptide-expressing cells are cultured in any 
suitable tissue culture medium, preferably in Earle's modified Eagle's medium supplemented with 

35 10% fetal bovine serum (inactivated at about 56°Q, and supplemented with about 10 g/1 of 
nonessential amino acids, about 1,000 U/ml of penicillin, and about 100 /ig/ml of streptomycin. 
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The splenocytes of such mice are extracted and fused with a suitable myeloma cell line. 
Any suitable myeloma cell line may be employed in accordance with the present invention; 
however, it is preferable to employ the parent myeloma cell line (SP20), available from the 
ATCC. After fusion, the resulting hybridoma cells are selectively maintained in HAT medium, 
and then cloned by limiting dilution as described by Wands et al. (Gastroenterology 80:225-232 
(1981)). The hybridoma cells obtained through such a selection are then assayed to identify clones 
which secrete antibodies capable of binding the polypeptide of the present invention . 

Alternatively, additional antibodies capable of binding to a polypeptide of the present 
invention can be produced in a two-step procedure using anti-idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and therefore, it is possible to obtain 
an antibody which binds to a second antibody. In accordance with this method, protein specific 
antibodies are used to immunize an animal, preferably a mouse. The splenocytes of such an 
animal are then used to produce hybridoma cells, and the hybridoma cells are screened to identify 
clones which produce an antibody whose ability to bind to the polypeptide-specific antibody can 
be blocked by said polypeptide. Such antibodies comprise anti-idiotypic antibodies to the 
polypeptide-specific antibody and are used to immunize an animal to induce formation of further 
polypeptide-specific antibodies. 

For in vivo use of antibodies in humans, an antibody is "humanized". Such antibodies can 
be produced using genetic constructs derived from hybridoma cells producing the monoclonal 
antibodies described above. Methods for producing chimeric and humanized antibodies are known 
in the art and are discussed herein. (See, for review, Morrison, Science 229: 1202 (1985); Oi et al., 
BioTechniques 4:214 (1986); Cabilly et al., U.S. Patent No. 4,816,567; Taniguchi et al., EP 
171496; Morrison et al., EP 173494; Neuberger et al., WO 8601533; Robinson et al. 1 , International 
Publication No. WO 8702671; Boulianne et al., Nature 312:643 (1984); Neuberger et al., Nature 
314:268 (1985)). 

b) Isolation Of Antibody Fragments Directed Against a Polypeptide of the Present 
Invention From A Library Of scFvs 

Naturally occurring V-genes isolated from human PBLs are constructed into a library of 

■ 

antibody fragments which contain reactivities against a polypeptide of the present invention to 
which the donor may or may not have been exposed (see e.g., U.S. Patent 5,885,793 incorporated 
herein by reference in its entirety). 

Rescue of the Library. A library of scFvs is constructed from the RNA of human PBLs as 
described in International Publication No. WO 92/01047. To rescue phage displaying antibody 
fragments, approximately 10 9 K coli harboring the phagemid are used to inoculate 50 ml of 2xTY 
containing 1% glucose and 100 jtig/ml of ampicillin (2xTY-AMP-GLU) and grown to an O.D. of 
0.8 with shaking. Five ml of this culture is used to inoculate 50 ml of 2xTY-AMP-GLU, 2 x 108 
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TU of delta gene 3 helper (M13 delta gene m, see International Publication No. WO 92/01047) 
are added and the culture incubated at 37°C for 45 minutes without shaking and then at 37°C for 
45 minutes with shaking. The culture is centrifuged at 4000 r.p.m. for 10 min. and the pellet 
resuspended in 2 liters of 2xTY containing 100 /ig/ml ampicillin and 50 ug/ml kanamycin and 
5 grown overnight. Phage are prepared as described in International Publication No. WO 92/01047. 

M13 delta gene IE is prepared as follows: M13 delta gene DDE helper phage does not 
encode gene in protein, hence the phage(mid) displaying antibody fragments have a greater 
avidity of binding to antigen. Infectious M13 delta gene HI particles are made by growing the 
helper phage in cells harboring a pUC19 derivative supplying the wild type gene III protein during 

10 . phage morphogenesis. The culture is incubated for 1 hour at 37° C without shaking and then for a 
further hour at 37°C with shaking. Cells are spun down (IEC-Centra 8,400 r.p.m. for 10 rnin), 
resuspended in 300 ml 2xTY broth containing 100 /ig ampicillin/ml and 25 \ig kanamycin/ml 
(2xTY-AMP-KAN) and grown overnight, shaking at 37°C. Phage particles are purified and 
concentrated from the culture medium by two PEG-precipitations (Sambrook et al., 1990), 

15 resuspended in 2 ml PBS and passed through a 0.45 /xm filter (Minisart NML; Sartorius) to give a 
final concentration of approximately 10 13 transducing units/ml (ampicillin-resistant clones). 

Panning of the Library. Immunotubes (Nunc) are coated overnight in PBS with 4 ml of 
either 100 ptg/ml or 10 /zg/ml of a polypeptide of the present invention. Tubes are blocked with 
2% Marvel-PBS for 2 hours at 37°C and then washed 3 times in PBS. Approximately 10 13 TU of 

20 phage is applied to the tube and incubated for 30 minutes at room temperature tumbling on an over 
and under turntable and then left to stand for another 1.5 hours. Tubes are washed 10 times with 
PBS 0.1% Tween-20 and 10 times with PBS. Phage are eluted by adding 1 ml of 100 mM 
triethylamine and rotating 15 minutes on an under and over turntable after which the solution is 
immediately neutralized with 0.5 ml of 1.0M Tris-HCl, pH 7.4. Phage are then used to infect 10 

25 ml of mid-log E. coli TGI by incubating eluted phage with bacteria for 30 minutes at 37°C. The 
E. coli are then plated on TYE plates containing 1% glucose and 100 /ig/ml ampicillin. The 
resulting bacterial library is then rescued with delta gene 3 helper phage as described above to 
prepare phage for a subsequent round of selection. This process is then repeated for a total of 4 
rounds of affinity purification with mbe-washing increased to 20 times with PBS, 0.1% Tween-20 

30 and 20 times with PBS for rounds 3 and 4. 

Characterization of Binders. Eluted phage from the 3rd and 4th rounds of selection are 
used to infect E. coli HB 2151 and soluble scFv is produced (Marks, et al., 1991) from single 
colonies for assay. ELISAs are performed with microtitre plates coated with either 10 pg/ml of the 
polypeptide of the present invention in 50 mM bicarbonate pH 9.6. Clones positive in ELISA are 

35 further characterized by PCR fingerprinting (see, e.g., International Publication No. WO 
92/01047) and then by sequencing. These ELISA positive clones may also be further 
characterized by techniques known in the art, such as, for example, epitope mapping, binding 
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affinity, receptor signal transduction, ability to block or competitively inhibit antibody/antigen 
binding, and competitive agonistic or antagonistic activity. 

Example 11: Method of Determining Alterations in a Gene Corresponding to a 

Polynucleotide 

RNA isolated from entire families or individual patients presenting with a gastrointestinal 
disease or disorder is isolated. cDNA is then generated from these RNA samples using protocols 
known in the art. (See, Sambrook.) The cDNA is then used as a template for PCR, employing 
primers surrounding regions of interest in SEQ ID NO:X; and/or the nucleotide sequence of the 
cDNA contained in ATCC Deposit No:ZL Suggested PCR conditions consist of 35 cycles at 95 
degrees C for 30 seconds; 60-120 seconds at 52-58 degrees C; and 60-120 seconds at 70 degrees 
C, using buffer solutions described in Sidransky et aL, Science 252:706 (1991). 

PCR products are then sequenced using primers labeled at their 5' end with T4 
polynucleotide kinase, employing SequiTherm Polymerase (Epicentre Technologies). The intron- 
exon boundaries of selected exons is also determined and genomic PCR products analyzed to 
confirm the results. PCR products harboring suspected mutations are then cloned and sequenced 
to validate the results of the direct sequencing. 

PCR products are cloned into T-tailed vectors as described in Holton et al., Nucleic Acids 
Research, 19:1156 (1991) and sequenced with T7 polymerase (United States Biochemical). 
Affected individuals are identified by mutations not present in unaffected individuals. 

Genomic rearrangements are also observed as a method of detenrnning alterations in a 
gene corresponding to a polynucleotide. Genomic clones isolated according to Example 2 are 
nick-translated with digoxigenindeoxy-uridine 5 ! 4riphosphate (Boehringer Manheim), and FISH 
performed as described in Johnson et al., Methods Ceil Biol. 35:73-99 (1991). Hybridization with 
the labeled probe is carried out using a vast excess of human cot-1 DNA for specific hybridization 
to the corresponding genomic locus. 

Chromosomes are counterstained with 4,6^iamino-2~phenylidoIe and propidium iodide, 
producing a combination of C- and R-bands. Aligned images for precise mapping are obtained 
using a triple-band filter set (Chroma Technology, Brattleboro, VT) in combination with a cooled 
charge-coupled device camera (Photometries, Tucson, AZ) and variable excitation wavelength 
filters. (Johnson et al., Genet Anal. Tech. Appl., 8:75 (1991)). Image collection, analysis and 
chromosomal fractional length measurements are performed using the ISee Graphical Program 
System (Inovision Corporation, Durham, NC.) Chromosome alterations of the genomic region 
hybridized by the probe are identified as insertions, deletions, and translocations. These 
alterations are used as a diagnostic marker for an associated disease. 
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Example 12: Method of Detecting Abnormal Levels of a Polypeptide in a Biological 

Sample 

A polypeptide of the present invention can be detected in a biological sample, and if an 
5 increased or decreased level of the polypeptide is detected, this polypeptide is a marker for a 
particular phenotype. Methods of detection are numerous, and thus, it is understood that one 
skilled in the art can modify the following assay to fit their particular needs. 

For example, antibody-sandwich ELISAs are used to detect polypeptides in a sample, 
preferably a biological sample. Wells of a microtiter plate are coated with specific antibodies, at a 
10 final concentration of 0.2 to 10 ug/ml. The antibodies are either monoclonal or polyclonal and are 
produced by the method described in Example 10. The wells are blocked so that non-specific 
binding of the polypeptide to the well is reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample containing the 
polypeptide. Preferably, serial dilutions of the sample should be used to validate results. The 
15 plates are then washed three times with deionized or distilled water to remove unbound 
polypeptide. 

Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a concentration of 25- 
400 ng, is added and incubated for 2 hours at room temperature. The plates are again washed three 
times with deionized or distilled water to remove unbound conjugate. 

20 Add 75 ul of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl phosphate (NPP) 

substrate solution to each well and incubate 1 hour at room temperature. Measure the reaction by 
a microtiter plate reader. Prepare a standard curve, using serial dilutions of a control sample, and 
plot polypeptide concentration on the X-axis (log scale) and fluorescence or absorbance of the Y- 
axis (linear scale). Interpolate the concentration of the polypeptide in the sample using the 

25 standard curve. 



Example 13: Formulation 

The invention also provides methods of preventing, treating and/or ameliorating a 
30 gastrointestinal disease or disorder by administration to a subject of an effective amount of a 
Therapeutic. By therapeutic is meant polynucleotides or polypeptides of the invention (including 
fragments and variants), agonists or antagonists thereof, and/or antibodies thereto, in combination 
with a pharmaceutically acceptable carrier type (e.g., a sterile carrier). 

The Therapeutic will be formulated and dosed in a fashion consistent with good medical 
35 practice, taking into account the clinical condition of the individual patient (especially the side 
effects of treatment with the Therapeutic alone), the site of delivery, the method of administration, 
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the scheduling of administration, and other factors known to practitioners. The "effective amount" 
for purposes herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutical effective amount of the Therapeutic 
administered parenterally per dose will be in the range of about lug/kg/day to 10 mg/kg/day of 
5 patient body weight, although, as noted above, this will be subject to therapeutic discretion. More 
preferably, this dose is at least 0.01 mg/kg/day, and most preferably for humans between about 
0.01 and 1 mg/kg/day for the hormone. If given continuously, the Therapeutic is typically 
administered at a dose rate of about 1 ug/kg/hour to about 50 ug/kg/hour, either by 1-4 injections 
per day or by continuous subcutaneous infusions, for example, using a rnini-pump. An 
10 intravenous bag solution may also be employed. The length of treatment needed to observe 
changes and the interval following treatment for responses to occur appears to vary depending on 
the desired effect. 

Therapeutics can be are administered orally, rectally, parenterally, intracistemally, 
intravaginally, intraperitoneally, topically (as by powders, ointments, gels, drops or transdermal 
15 patch), bucally, or as an oral or nasal spray. "Pharmaceutically acceptable carrier" refers to a non- 
toxic solid, semisolid or liquid filler, diluent, encapsulating material or formulation auxiliary of 
any. The term "parenteral" as used herein refers to modes of administration which include 
intravenous, intramuscular, intraperitoneal, intrasternal, subcutaneous and intraarticular injection 
and infusion. 

20 Therapeutics of the invention are also suitably administered by sustained-release systems. 

Suitable examples of sustained-release Therapeutics are administered orally, rectally, parenterally, 
intracistemally, intravaginally, intraperitoneally, topically (as by powders, ointments, gels, drops 
or transdermal patch), bucally, or as an oral or nasal spray. "Pharmaceutically acceptable carrier" 
refers to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material or formulation 

25 auxiliary of any type. The term "parenteral" as used herein refers to modes of administration 
which include intravenous, intramuscular, intraperitoneal, intrasternal, subcutaneous and 
intraarticular injection and infusion. 

Therapeutics of the invention are also suitably administered by sustained-release systems. 
Suitable examples of sustained-release Therapeutics include suitable polymeric materials (such as, 

30 for example, semi-permeable polymer matrices in the form of shaped articles, e.g., films, or 
mirocapsules), suitable hydrophobic materials (for example as an emulsion in an acceptable oil) or 
ion exchange resins, and sparingly soluble derivatives (such as, for example, a sparingly soluble 
salt). 

Sustained-release matrices include polylactides (U.S. Pat No. 3,773,919, EP 58,481), 
35 copolymers of L-glutamic acid and gamma-ethyl-L-glutamate (Sidman et al., Biopolymers 
22:547-556 (1983)), poly (2- hydroxyethyl metbacrylate) (Langer et al., J. Biomed. Mater. Res. 
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15:167-277 (1981), and Langer, Chem. Tech. 12:98-105 (1982)), ethylene vinyl acetate (Langer et 
al„ Id.) or poly-D- (-)-3-hydroxybutyric acid (EP 133,988). 

In a preferred embodiment, polypeptide, polynucleotide, and antibody compositions of the 
invention are formulated in a biodegradable, polymeric drug delivery system, for example as 
5 described in U.S. Patent Nos. 4,938,763; 5,278,201; 5,278,202; 5,324,519; 5,340,849; and 
5,487,897 and in International Publication Numbers WO01/35929, WO00/24374, and 
WO00/06117 which are hereby incorporated by reference in their entirety. In specific preferred 
embodiments the polypeptide, polynucleotide, and antibody compositions of the invention are 
formulated using the ATRIGEL® Biodegradable System of Atrix Laboratories, Inc. (Fort Collins, 
10 Colorado). 

Examples of biodegradable polymers which can be used in the formulation of polypeptide, 
polynucleotide, and antibody compositions, include but are not limited to, polylactides, 
polyglycolides, polycaprolactones, polyanhydrides, polyamides, polyurethanes, polyesteramides, 
polyortboesters, polydioxanones, polyacetals, polyketals, polycarbonates, polyorthocarbonates, 

15 polyphosphazenes, polyhydroxybutyrates, polyhydroxyvalerates, polyalkylene oxalates, 
polyalkylene succinates, poly(malic acid), poly(amino acids), poly(methyl vinyl ether), 
poly(maleic anhydride), polyvinylpyrrolidone, polyethylene glycol, polyhydroxycellulose, chitin, 
chitosan, and copolymers, terpolymers, or combinations or mixtures of the above materials. The 
preferred polymers are those that have a lower degree of crystallization and are more hydrophobic. 

20 These polymers and copolymers are more soluble in the biocompatible solvents than the highly 
crystalline polymers such as polyglycolide and chitin which also have a high degree of hydrogen- 
bonding. Preferred materials with the desired solubility parameters are the polylactides, 
polycaprolactones, and copolymers of these with glycolide in which there are more amorphous 
regions to enhance solubility. In specific preferred embodiments, the biodegradable polymers 

25 which can be used in the formulation of polypeptide, polynucleotide, and antibody compositions 
are poly(lactide-co-glycolides). Polymer properties such as molecular weight, hydrophobicity, and 
lactide/glycolide ratio may be modified to obtain the desired polypeptide, polynucleotide, or 
antibody release profile (See, e.g., Ravivarapu et al., Journal of Pharmaceutical Sciences 89:732- 
741 (2000), which is hereby incorporated by reference in its entirety). 

30 It is also preferred that the solvent for the biodegradable polymer be non-toxic, water 

miscible, and otherwise biocompatible. Examples of such solvents include, but are not limited to, 
N-methyl-2-pyrrolidone, 2-pyrrolidone, C2 to C6 alkanols, CI to C15 alchohols, dils, triols, and 
tetraols such as ethanol, glycerine propylene glycol, butanol; C3 to C15 alkyl ketones such as 
acetone, diethyl ketone and methyl ethyl ketone; C3 to C15 esters such as methyl acetate, ethyl 

35 acetate, ethyl lactate; alkyl ketones such as methyl ethyl ketone, CI to C15 amides such as 
dimemylformamide, dimemylacetamide and caprolactam; C3 to C20 ethers such as 
tetrahydrofuran, or solketal; tweens, triacetin, propylene carbonate, decylmethyl sulfoxide, 
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dimethyl sulfoxide, oleic acid, l-dodecylazacycloheptan-2-one, Other preferred solvents are 
benzyl alchohol, benzyl benzoate, dipropylene glycol, tributyrin, ethyl oleate, glycerin, glycofural, 
isopropyl myristate, isopropyl palmitate, oleic acid, polyethylene glycol, propylene carbonate, and 
triethyl citrate. The most preferred solvents are N-methyl-2-pyrrolidone, 2-pyrrolidone, dimethyl 
5 sulfoxide, triacetin, and propylene carbonate because of the solvating ability and their 
compatibihty. 

Additionally, formulations comprising polypeptide, polynucleotide, and antibody 
compositions and a biodegradable polymer may also include release-rate modification agents 
and/or pore-forming agents. Examples of release-rate modification agents include, but are not 

10 limited to, fatty acids, triglycerides, other like hydrophobic compounds, organic solvents, 
plasticizing compounds and hydrophilic compounds. Suitable release rate modification agents 
include, for example, esters of mono-, di-, and tricarboxylic acids, such as 2-ethoxyethyl acetate, 
methyl acetate, ethyl acetate, diethyl phthalate, dimethyl phthalate, dibutyl phthalate, dimethyl 
adipate, dimethyl succinate, dimethyl oxalate, dimethyl citrate, triethyl citrate, acetyl tributyl 

15 citrate, acetyl triethyl citrate, glycerol triacetate, di(n-butyl) sebecate, and the like; polyhydroxy 
alcohols, such as propylene glycol, polyethylene glycol, glycerin, sorbitol, and the like; fatty acids; 
triesters of glycerol, such as triglycerides, epoxidized soybean oil, and other epoxidized vegetable 
oils; sterols, such as cholesterol; alcohols, such as C.sub.6 -C.sub.12 alkanols, 2-ethoxyethanol. 
The release rate modification agent may be used singly or in combination with other such agents. 

20 Suitable combinations of release rate modification agents include, but are not limited to, 
glycerin/propylene glycol, sorbitol/giycerine, ethylene oxide/propylene oxide, butylene 
glycol/adipic acid, and the like. Preferred release rate modification agents include, but are not 
limited to, dimethyl citrate, triethyl citrate, ethyl heptanoate, glycerin, and hexanediol. Suitable 
pore-forming agents that may be used in the polymer composition include, but are not limited to, 

25 sugars such as sucrose and dextrose, salts such as sodium chloride and sodium carbonate, 
polymers such as hydroxylpropylcelluiose, carboxymethyicellulose, polyethylene glycol, and 
polyvinylpyrrolidone. Solid crystals that will provide a defined pore size, such as salt or sugar, are 
preferred. 

In specific preferred embodiments the polypeptide, polynucleotide, and antibody 
30 compositions of the invention are formulated using the BEMA™ BioErodible Mucoadhesive 
System, MCA™ Mucocutaneous Absorption System, SMP™ Solvent MicroParticle System, or 
BCP™ Biocompatible Polymer System of Atrix Laboratories, Inc. (Fort Collins, Colorado). 

Sustained-release Therapeutics also include liposomally entrapped Therapeutics of the 
invention (see generally, Langer, Science 249:1527-1533 (1990); Treat et al., in Liposomes in the 
35 Tlierapy of Infectious Disease and Cancer, Lopez-Berestein and Fidler (eds.), Liss, New York, pp. 
317 -327 and 353-365 (1989)). Liposomes containing the Therapeutic are prepared by methods 
known per se: DE 3,218,121; Epstein et al., Proa Natl. Acad. Sci. (USA) 82:3688-3692 (1985); 
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Hwang et al., Proc. Natl. Acad. Sci.(USA) 77:403(M034 (1980); EP 52,322; EP 36,676; EP 
88,046; EP 143,949; EP 142,641; Japanese Pat AppL 83-118008; U.S. Pat. Nos. 4,485,045 and 
4,544,545; and EP 102,324. Ordinarily, the liposomes are of the small (about 200-800 Angstroms) 
unilamellar type in which the lipid content is greater than about 30 mol. percent cholesterol, the 

5 selected proportion being adjusted for the optimal Therapeutic. 

In yet an additional embodiment, the Therapeutics of the invention are delivered by way 
of a pump (see Langer, supra; Sefton, CRC Crit. Ref. Biomed. Eng. 14:201 (1987); Buchwald et 
al., Surgery 88:507 (1980); Saudek et al., N. Engl. J. Med. 321:574 (1989)). 

Other controlled release systems are discussed in the review by Langer (Science 

10 249:1527-1533 (1990)). 

For parenteral administration, in one embodiment, the Therapeutic is formulated generally 
by mixing it at the desired degree of purity, in a unit dosage injectable form (solution, suspension, 
or emulsion), with a pharmaceutically acceptable carrier, i.e., one that is non-toxic to recipients at 
the dosages and concentrations employed and is compatible with other ingredients of the 

15 formulation. For example, the formulation preferably does not include oxidizing agents and other 
compounds that are known to be deleterious to the Therapeutic. 

Generally, the formulations are prepared by contacting the Therapeutic uniformly and 
mtirnately with liquid carriers or finely divided solid carriers or both. Then, if necessary, the 
product is shaped into the desired formulation. Preferably the carrier is a parenteral carrier, more 

20 preferably a solution that is isotonic with the blood of the recipient. Examples of such carrier 
vehicles include water, saline, Ringer's solution, and dextrose solution. Non-aqueous vehicles 
such as fixed oils and ethyl oleate are also useful herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as substances that enhance 
isotonicity and chemical stability. Such materials are non-toxic to recipients at the dosages and 

25 concentrations employed, and include buffers such as phosphate, citrate, succinate, acetic acid, and 
other organic acids or their salts; antioxidants such as ascorbic acid; low molecular weight (less 
than about ten residues) polypeptides, e.g., polyarginine or tripeptides; proteins, such as serum 
albumin, gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino 
acids, such as glycine, glutamic acid, aspartic acid, or arginine; monosaccharides, disaccharides, 

30 and other carbohydrates including cellulose or its derivatives, glucose, manose, or dextrins; 
chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; counterions such as 
sodium; and/or nonionic surfactants such as poly sorbates, poloxamers, or PEG. 

The Therapeutic is typically formulated in such vehicles at a concentration of about 0.1 
mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of about 3 to 8. It will be understood that the 

35 use of certain of the foregoing excipients, carriers, or stabilizers will result in the formation of 
polypeptide salts. 
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Any pharmaceutical used for therapeutic administration can be sterile. Sterility is readily 
accomplished by filtration through sterile filtration membranes (e.g., 0.2 micron membranes). 
Therapeutics generally are placed into a container having a sterile access port, for example, an 
intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle. 

Therapeutics ordinarily will be stored in unit or multi-dose containers, for example, sealed 
ampoules or vials, as an aqueous solution or as a lyophilized formulation for reconstitution. As an 
example of a lyophilized formulation, 10-ml vials are filled with 5 ml of sterile-filtered 1% (w/v) 
aqueous Therapeutic solution, and the resulting mixture is lyophilized. The infusion solution is 
prepared by reconstituting the lyophilized Therapeutic using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more ,of the ingredients of the Therapeutics of the invention. 
Associated with such container(s) can be a notice in the form prescribed by a governmental agency 
regulating the manufacture, use or sale of pharmaceuticals or biological products, which notice 
reflects approval by the agency of manufacture, use or sale for human administration. In addition, 
the Therapeutics may be employed in conjunction with other therapeutic compounds. 

The Therapeutics of the invention may be administered alone or in combination with 
adjuvants. Adjuvants that may be administered with the Therapeutics of the invention include, but 
are not limited to, alum, alum plus deoxycholate (ImmunoAg), MTP-PE (Biocine Corp.), QS21 
(Genentech, Inc.), BCG (e.g., THERACYS®), MPL and nonviable prepartions of 
Corynebacterium parvum. In a specific embodiment, Therapeutics of the invention are 
administered in combination with alum. In another specific embodiment, Therapeutics of the 
invention are administered in combination with QS-21. Further adjuvants that may be 
administered with the Therapeutics of the invention include, but are not limited to, 
Monophosphoryl lipid immunomodulator, AdjuVax 100a, QS-21, QS-18, CRL1005, Aluminum 
salts, MF-59, and Virosomal adjuvant technology. Vaccines that may be administered with the 
Therapeutics of the invention include, but are not limited to, vaccines directed toward protection 
against MMR (measles, mumps, rubella), polio, varicella, tetanus/diptheria, hepatitis A, hepatitis 
B, Haemophilus influenzae B, whooping cough, pneumonia, influenza, Lyme's Disease, rotavirus, 
cholera, yellow fever, Japanese encephalitis, poliomyelitis, rabies, typhoid fever, and pertussis. 
Combinations may be administered either concomitandy, e.g., as an admixture, separately but 
simultaneously or concurrently; or sequentially. This includes presentations in which the 
combined agents are administered together as a therapeutic mixture, and also procedures in which 
the combined agents are administered separately but simultaneously, e.g., as through separate 
intravenous lines into the same individual. Administration "in combination" further includes the 
separate administration of one of the compounds or agents given first, followed by the second. 

The Therapeutics of the invention may be administered alone or in combination with other 
therapeutic agents. Therapeutic agents that may be administered in combination with the 
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Therapeutics of the invention, include but not limited to, chemotherapeutic agents, antibiotics, 
steroidal and non-steroidal anti-inflammatories, conventional immunotherapeutic agents, and/or 
therapeutic treatments described below. Combinations may be administered either concomitantly, 
e.g., as an admixture, separately but simultaneously or concurrently; or sequentially. This includes 
5 presentations in which the combined agents are administered together as a therapeutic mixture, and 
also procedures in which the combined agents are administered separately but simultaneously, e.g., 
as through separate intravenous lines into the same individual. Administration n in combination" 
further includes the separate administration of one of the compounds or agents given first, 
followed by the second. 

10 In one embodiment, the Therapeutics of the invention are administered in combination 

with an anticoagulant. Anticoagulants that may be administered with the compositions of the 
invention include, but are not limited to, heparin, low molecular weight heparin, warfarin sodium 
(e.g., COUMADIN®), dicumarol, 4-hydroxycoumarin, anisindione (e.g., MIRADON™), 
acenocoumarol (e.g., nicoumalone, SINTHROME™), indan~l,3-dione, phenprocoumon (e.g., 

15 MARCUMAR™), ethyl biscoumacetate (e.g., TROMEXAN™), and aspirin. In a specific 
embodiment, compositions of the invention are administered in combination with heparin and/or 
warfarin. In another specific embodiment, compositions of the invention are administered in 
combination with warfarin. Id another specific embodiment, compositions of the invention are 
administered in combination with warfarin and aspirin. In another specific embodiment, 

20 compositions of the invention are administered in combination with heparin. In another specific 
embodiment, compositions of the invention are administered in combination with heparin and 
aspirin. 

In another embodiment, the Therapeutics of the invention are administered in combination 
with thrombolytic drugs. Thrombolytic drugs that may be administered with the compositions of 

25 the invention include, but are not limited to, plasminogen, iys-plasminogen, alpha2-antiplasmin, 
streptokinae (e.g., KABIKINASE™), antiresplace (e.g., EMINASE™), tissue plasminogen 
activator (t-PA, altevase, ACTIVASE™), urokinase (e.g., ABBOKINASE™), sauruplase, 
(Prourokinase, single chain urokinase), and aminocaproic acid (e.g., AMICAR™). In a specific 
embodiment, compositions of the invention are administered in combination with tissue 

30 plasminogen activator and aspirin. 

In another embodiment, the Therapeutics of the invention are administered in combination 
with antiplatelet drugs. Antiplatelet drugs that may be administered with the compositions of the 
invention include, but are not limited to, aspirin, dipyridamole (e.g., PERSANTINE™), and 
ticlopidine (e.g., TICLID™). 

35 In specific embodiments, the use of anti-coagulants, thrombolytic and/or antiplatelet drugs 

in combination with Therapeutics of the invention is contemplated for the detection, prevention, 
diagnosis, prognostication, treatment, and/or amelioration of thrombosis, arterial thrombosis, 
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venous thrombosis, thromboembolism, pulmonary embolism, atherosclerosis, myocardial 
infarction, transient ischemic attack, unstable angina. In specific embodiments, the use of 
anticoagulants, thrombolytic drugs and/or antiplatelet drugs in combination with Therapeutics of 
the invention is contemplated for the prevention of occulsion of saphenous grafts, for reducing the 
risk of periprocedural thrombosis as might accompany angioplasty procedures, for reducing the 
risk of stroke in patients with atrial fibrillation including nonrheumatic atrial fibrillation, for 
reducing the risk of embolism associated with mechanical heart valves and or mitral valves 
disease. Other uses for the therapeutics of the invention, alone or in combination with antiplatelet, 
anticoagulant, and/or thrombolytic drugs, include, but are not limited to, the prevention of 
occlusions m extracorporeal devices (e.g., intravascular canulas, vascular access shunts in 
hemodialysis patients, hemodialysis machines, and cardiopulmonary bypass machines). 

In certain embodiments, Therapeutics of the invention are administered in combination 
with antiretroviral agents, nucleoside/nucleotide reverse transcriptase inhibitors (NRTIs), non- 
nucleoside reverse transcriptase inhibitors (NNRTIs), and/or protease inhibitors (Pis). NRTIs that 
may be administered in combination with the Therapeutics of the invention, include, but are not 
limited to, RETROVIR™ (zidovudine/AZT), VIDEX™ (didanosine/ddl), HTVTD™ 
(zalcitabine/ddC), ZERTT™ (stavudine/d4T), EPIVIR™ (3amivudine/3TC), and COMBIVIR™ 
(zidovudine/lamivudine). NNRTIs that may be administered in combination with the Therapeutics 
of the invention, include, but are not limited to, VIRAMUNE™ (nevirapine), RESCRBPTOR™ 
(delavirdine), and SUSTIVA™ (efavirenz). Protease inhibitors that may be administered in 
combination with the Therapeutics of the invention, include, but are not limited to, CRIXIVAN™ 
(indinavir), NORVIR™ (ritonavir), INVIRASE™ (saquinavir), and VIRACEPT™ (nelfinavir). In 
a specific embodiment, antiretroviral agents, nucleoside reverse transcriptase inhibitors, non- 
nucleoside reverse transcriptase inhibitors, and/or protease inhibitors may be used in any 
combination with Therapeutics of the invention to treat AIDS and/or to prevent or treat HIV 
infection. 

Additional NRTIs include LODENOS1NE™ (F-ddA; an acid-stable adenosine NRTI; 
Triangle/Abbott; COVIRACIL™ (emtricitabine/FTC; structurally related to larnivudine (3TC) but 
with 3- to 10-fold greater activity in vitro; Triangle/Abbott); dOTC (BCH-10652, also structurally 
related to.lamivudine but retains activity against a substantial proportion of lamivudine-resistant 
isolates; Biochem Pharma); Adefovir (refused approval for anti-HIV therapy by FDA; Gilead 
Sciences); PREVEON® (Adefovir Dipivoxil, the active prodrug of adefovir, its active form is 
PMEA-pp); TENOFOVIR™ (bis-POC PMPA, a PMPA prodrug; Gilead); DAPD/DXG (active 
metabolite of DAPD; Triangle/Abbott); D-D4FC (related to 3TC, with activity against AZT/3TC- 
resistant virus); GW420867X (Glaxo Wellcome); ZIAGEN™ (abacavir/159U89; Glaxo Wellcome 
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Inc.); CS-87 tf'azidc^^'-dideoxyiiridine; WO 99/66936); and S-acyl-2-thioethyI (SATE)- 
bearing prodrug forms of 0-L-FD4C and fJ-L-FddC (WO 98/17281). 

Additional NNRTIs include COACTINON™ (Ernivirine/MKC-442, potent NNRTI of the 
HEPT class; Triangle/Abbott); CAPRAVIRINE™ (AG-1549/S-1153, a next generation NNRTI 
5 with activity against viruses containing the K103N mutation; Agouron); PNIM42721 (has 20- to 
50-fold greater activity than its predecessor delavirdine and is active against K103N mutants; 
Pharmacia & Upjohn); DPC-961 and DPC-963 (second-generation derivatives of efavirenz, 
designed to be active against viruses with the K103N mutation; DuPont); GW-420867X (has 25- 
fold greater activity than HBY097 and is active against K103N mutants; Glaxo Wellcome); 

10 CALANOLIDE A (naturally occurring agent from the latex tree; active against viruses containing 
either or both the Y181C and K103N mutations); and Propolis (WO 99/49830). 

Additional protease inhibitors include LOPINAVIR™ (ABT378/r; Abbott Laboratories); 
BMS-232632 (an azapeptide; Bristol-Myres Squibb); TIPRANAVIR™ (PNU-140690, a non- 
peptic dihydxopyrone; Pharmacia & Upjohn); PD-178390 (a nonpeptidic dihydropyrone; Parke- 

15 Davis); BMS 232632 (an azapeptide; Bristol-Myers Squibb); L-756,423 (an indinavir analog; 
Merck); DMP-450 (a cyclic urea compound; Avid & DuPont); AG-1776 (a rjeptidomimetic with 
in vitro activity against protease inhibitor-resistant viruses; Agouron); VX-175/GW-433908 
(phosphate prodrug of amprenavir; Vertex & Glaxo Welcome); CGP61755 (Ciba); and 
AGENERASE™ (amprenavir; Glaxo Wellcome Inc.). 

20 Additional antiretroviral agents include fusion inhibitors/gp41 binders. Fusion 

inhibitors/gp41 binders include T-20 (a peptide from residues 643-678 of the HIV gp41 
transmembrane protein ectodomain which binds to gp41 in its resting state and prevents 
transformation to the fiisogenic state; Trimeris) and T-1249 (a second-generation fusion inhibitor, 
Trimeris). 

25 Additional antiretroviral agents include fusion inhibitors/chemokine receptor antagonists. 

Fusion inhibitors/chemokine receptor antagonists include CXCR4 antagonists such as AMD 3100 
(a bicyclam), SDF-1 and its analogs, and ALX40-4C (a cationic peptide), T22 (an 18 amino acid 
peptide; Trimeris) and the T22 analogs T 134 and T140; CCR5 antagonists such as RANTES (9- 
68), AOP-RANTES, NNY-RANTES, and TAK-779; and CCR5/CXCR4 antagonists such as NSC 

30 651016 (a distamycin analog). Also included are CCR2B, CCR3, and CCR6 antagonists. 
Chemokine recpetor agonists such as RANTES, SDF-1, MlP-la, MIP-ip, etc., may also inhibit 
fusion. 

Additional antiretroviral agents include integrase inhibitors. Integrase inhibitors include 
dicaffeoylquinic (DFQA) acids; L-chicoric acid (a dicaffeoyltartaric (DCTA) acid); qumalizarin 
35 (QLQ and related anthraquinones; ZINTEVIR™ (AR 177, an oligonucleotide that probably acts at 
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cell surface rather than being a true integrase inhibitor; Arondex); and naphthols such as those 
disclosed in WO 98/50347. 

Additional antiretroviral agents include hydroxyurea-like compunds such as BCX-34 (a 
purine nucleoside phosphorylase inhibitor, Biocryst); ribonucleotide reductase inhibitors such as 
5 DIDOX™ (Molecules for Health); inosine monophosphate dehydrogenase (IMPDH) inhibitors 
sucha as VX-497 (Vertex); and mycopholic acids such as CellCept (mycophenolate mofetil; 
Roche). 

Additional antiretroviral agents include inhibitors of viral integrase, inhibitors of viral 
genome nuclear translocation such as arylene bis(methylketone) compounds; inhibitors of HIV 
10 entry such as AOP-RANTES, NNY-RANTES, RANTES-IgG fusion protein, soluble complexes 
of RANTES and glycosaminoglycans (GAG), and AMD-3100; nucleocapsid zinc finger inhibitors 
such as dithiane compounds; targets of HIV Tat and Rev; and pharmacoenhancers such as ABT- 

r 

378. 

Other antiretroviral therapies and adjunct therapies include cytokines and lymphokines 

15 such as MIP-lo, MIP-ip, SDF-la, IL-2, PROLEUKIN™ (aldesleukin/L2-7001; Chiron), IL-4, IL- 
10, DL-12, and IL-13; interferons such as IFN-<x2a; antagonists of TNFs, NFkB, GM-CSF, M-CSF, 
and IL-10; agents that modulate immune activation such as cyclosporin and prednisone; vaccines 
such as Remune™ (HIV Immunogen), APL 400-003 (Apollon), recombinant gpl20 and 
fragments, bivalent (B/E) recombinant envelope glycoprotein, rgpl20CM235, MN rgpl20, SF-2 

20 rgpl20, gpl20/soluble CD4 complex, Delta JR-FL protein, branched synthetic peptide derived 
from discontinuous gpl20 C3/C4 domain, fusion-competent immunogens, and Gag, Pol, Nef, and 
Tat vaccines; gene-based therapies such as genetic suppressor elements (GSEs; WO 98/54366), 
and mtrakines (genetically modified CC chemokines targetted to the ER to block surface 
expression of newly synthesized CCR5 (Yang et al, PNAS 94:11567-72 (1997); Chen et al y Nat. 

25 Med. 3:1110-16 (1997)); antibodies such as the anti-CXCR4 antibody 12G5, the anti-CCR5 
antibodies 2D7, 5C7, PA8, PA9, PA10, PA11, PA12, and PA14, the anti-CD4 antibodies Q4120 
and RPA-T4, the anti-CCR3 antibody 7B11, the anti-gpl20 antibodies 17b, 48d, 447-52D, 257-D, 
268-D and 50.1, anti-Tat antibodies, anti-TNF-a antibodies, and monoclonal antibody 33 A; aryl 
hydrocarbon (AH) receptor agonists and antagonists such as TCDD, 3,3\4,4\5- 

30 pentachlorobiphenyl, 3,3\4,4 , -tetrachlorobiphenyl, and a-naphthoflavone (WO 98/30213); and 
antioxidants such as y-L-glutamyl-L-cysteine ethyl ester (v-GCE; WO 99/56764). 

In a further embodiment, the Therapeutics of the invention are administered in 
combination with an antiviral agent. Antiviral agents that may be administered with the 
Therapeutics of the invention include, but are not limited to, acyclovir, ribavirin, amantadine, and 

35 remantidine. 
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In other embodiments, Therapeutics of the invention may be administered in combination 
with anti-opportunistic infection agents. Anti-opportunistic agents that may be administered in 
combination with the Therapeutics of the invention, include, but are not limited to, 
TRIMKniOPRIM-SlJLFAMETHOXAZOLE TM , DAPS ONE™, PENTAMIDINE™, 

5 ATOVAQUONE™, ISONIAZID™, RIFAMPIN™, PYRAZIN AMIDE™, ETHAMBUTOL™, 
RIFABUTIN™, CLARITHROMYCIN™, AZITHROMYCIN™, GANCICLOVIR™, 
FOSCARNET™, CIDOFOVIR™, FLUCONAZOLE™, ITRACONAZOLE™, 
KETOCONAZOLE™, ACYCLOVIR™, FAMCICOLVIR™, PYRIMETHAMINE™, 
LEUCOVORIN™, NEUPOGEN™ (filgrastim/G-CSF), and LEUKINE™ (sargramostim/GM- 

10 CSF). In a specific embodiment, Therapeutics of the invention are used in any combination with 
TRIMETHOPRIM-SULFAMETHOXAZOLE™, DAPSONE™, PENTAMIDINE™, and/or 
ATOVAQUONE™ to prophylactically treat or prevent an opportunistic Pneumocystis carinii 
pneumonia infection. In another specific embodiment, Therapeutics of the invention are used in 
any combination with ISONIAZID™, RIFAMPIN™, PYRAZINAMIDE™, and/or 

15 ETHAMBUTOL™ to prophylactically treat or prevent an opportunistic Mycobacterium avium 
complex infection. In another specific embodiment, Therapeutics of the invention are used in any 
combination with RIFABUTIN™, CLARITHROMYCIN™, and/or AZITHROMYCIN™ to 
prophylactically treat or prevent an opportunistic Mycobacterium tuberculosis infection. In 
another specific embodiment, Therapeutics of the invention are used in any combination with 

20 GANCICLOVIR™, FOSCARNET™, and/or CIDOFOVIR™ to prophylactically treat or prevent an 
opportunistic cytomegalovirus infection. In another specific embodiment, Therapeutics of the 
invention are used in any combination with FLUCONAZOLE™, ITRACONAZOLE™, and/or 
KETOCONAZOLE™ to prophylactically treat or prevent an opportunistic fungal infection. In 
another specific embodiment, Therapeutics of the invention are used in any combination with 

25 ACYCLOVIR™ and/or FAMCICOLVIR™ to prophylactically treat or prevent an opportunistic 
herpes simplex virus type I and/or type II infection. In another specific embodiment, Therapeutics 
of the invention are used in any combination with PYRIMETHAMINE™ and/or LEUCOVORIN™ 
to prophylactically treat or prevent an opportunistic Toxoplasma gondii infection. In another 
specific embodiment, Therapeutics of the invention are used in any combination with 

30 LEUCOVORIN™ and/or NEUPOGEN™ to prophylactically treat or prevent an opportunistic 
bacterial infection. 

In a further embodiment, the Therapeutics of the invention are administered in 
combination with an antibiotic agent. Antibiotic agents that may be administered with the 
Therapeutics of the invention include, but are not limited to, amoxicillin, beta-lactamases, 
35 aminoglycosides, beta-lactam (glycopeptide), beta-lactamases, Clindamycin, chloramphenicol, 
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cephalosporins, ciprofloxacin, erythromycin, fluoroquinolones, macrolides, metronidazole, 
penicillins, qiiinolones, rapamycin, rifampin, streptomycin, sulfonamide, tetracyclines, 
trimethoprim, trimethoprim-sulfamethoxazole, and vancomycin. 

In other embodiments, the Therapeutics of the invention are administered in combination 
5 with immunestimul ants . Lmmunostimulants that may be administered in combination with the 
Therapeutics of the invention include, but are not limited to, levamisole (e.g., ERGAMISOL™), 
isoprinosme (e.g. INOSIPLEX™), interferons (e.g. interferon alpha), and interleukins (e.g., IL-2). 

In other embodiments, Therapeutics of the invention are administered in combination with 
immunosuppressive agents. Immunosuppressive agents that may be administered in combination 

10 with the Therapeutics of the invention include, but are not limited to, steroids, cyclosporine, 
cyclosporine analogs, cyclophosphamide methylprednisone, prednisone, azathioprine, FK-506, 15- 
deoxyspergualin, and other immunosuppressive agents that act by suppressing the function of 
responding T cells. Other immunosuppressive agents that may be administered in combination 
with the Therapeutics of the invention include, but are not limited to, prednisolone, methotrexate, 

15 thalidomide, methoxsalen, rapamycin, leflunomide, mizoribine (BREDININ™), brequinar, 
deoxyspergualin, and azaspirane (SKF 105685), ORTHOCLONE OKT® 3 (muromonab-CD3), 
SANDIMMUNE™, NEORAL™, SANGDYA™ (cyclosporine), PROGRAF® (FK506, 
tacrolimus), CELLCEPT® (mycophenolate motefil, of which the active metabolite is 
mycophenolic acid), IMURAN™ (azathioprine), glucocorticosteroids, adrenocortical steroids such 

20 as DELTASONE™ (prednisone) and HYDELTRASOL™ (prednisolone), FOLEX™ and 
MEXATE™ (methotrxate), OXSORALEN-ULTRA™ (methoxsalen) and RAPAMUNE™ 
(sirolimus). In a specific embodiment, immunosuppressants may be used to prevent rejection of 
organ or bone marrow transplantation. 

In an additional embodiment, Therapeutics of the invention are administered alone or in 

25 combination with one or more intravenous immune globulin preparations. Intravenous immune 
globulin preparations that may be administered with the Therapeutics of the invention include, but 
not limited to, GAMMAR™, IVEEGAM™, SANDOGLOBULIN™, GAMMAGARD S/D™, 
ATGAM™ (antithymocyte glubulin), and GAMIMUNE™. La a specific embodiment, 
Therapeutics of the invention are administered in combination with intravenous immune globulin 

30 preparations in transplantation therapy (e.g., bone marrow transplant). 

In certain embodiments, the Therapeutics of the invention are administered alone or in 
combination with an anti-inflammatory agent. Anti-inflammatory agents that may be administered 
with the Therapeutics of the invention include, but are not limited to, corticosteroids (e.g. 
betamethasone, budesonide, cortisone, dexamethasone, hydrocortisone, methylprednisolone, 

35 prednisolone, prednisone, and triamcinolone), nonsteroidal anti-inflanimatory drugs (e.g., 
diclofenac, diflunisal, etodolac, fenoprofen, floctafenine, flurbiprofen, ibuprofen, indomethacin, 
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ketoprofen, meclofenamate, mefenamic acid, meloxicam, nabumetone, naproxen, oxaprozin, 
phenylbutazone, piroxicam, sulindac, tenoxicam, tiaprofenic acid, and tolmetin.), as well as 
antihistamines, aminoarylcarboxylic acid derivatives, arylacetic acid derivatives, arylbutyric acid 
derivatives, arylcarboxylic acids, arylpropionic acid derivatives, pyrazoles, pyrazolones, salicylic 
acid derivatives, thiazinecarboxamides, e-acetamidocaproic acid, S-adenosylmethionine, 3-amino- 
4-hydroxybutyric acid, amixetrine, bendazac, benzydainine, bucolome, difenpiramide, ditazol, 
emorfazone, guaiazulene, nabumetone, nimesiilide, orgotein, oxaceprol, paranyline, perisoxal, 
pifoxime, proquazone, proxazole, and tenidap. 

In an additional embodiment, the compositions of the invention are administered alone or 
in combination with an anti-angiogenic agent. Anti-angiogenic agents that may be administered 
with the compositions of the invention include, but are not limited to, Angiostatin (Entremed, 
Rockville, MD), Troponin- 1 (Boston Life Sciences, Boston, MA), anti-Invasive Factor, retinoic 
acid and derivatives thereof, paclitaxel (Taxol), Suramin, Tissue Inhibitor of Metalloproteinase-1, 
Tissue Inhibitor of Metalloproteinase-2, VEGI, Plasminogen Activator Inhibitor-1, Plasminogen 
Activator Inhibitor-2, and various forms of the lighter n d group" transition metals. 

Lighter "d group" transition metals include, for example, vanadium, molybdenum, 
tungsten, titanium, niobium, and tantalum species. Such transition metal species may form 
transition metal complexes. Suitable complexes of the above-mentioned transition metal species 
include oxo transition metal complexes. 

Representative examples of vanadium complexes include oxo vanadium complexes such 
as vanadate and vanadyl complexes. Suitable vanadate complexes include metavanadate and 
orthovanadate complexes such as, for example, ammonium metavanadate, sodium metavanadate, 
and sodium orthovanadate. Suitable vanadyl complexes include, for example, vanadyl 
acetylacetonate and vanadyl sulfate including vanadyl sulfate hydrates such as vanadyl sulfate 
mono- and trihydrates. 

Representative examples of tungsten and molybdenum complexes also include oxo 
complexes. Suitable oxo tungsten complexes include tungstate and tungsten oxide complexes. 
Suitable tungstate complexes include ammonium tungstate, calcium tungstate, sodium tungstate 
dihydrate, and tungstic acid. Suitable tungsten oxides include tungsten (IV) oxide and tungsten 
(VI) oxide. Suitable oxo molybdenum complexes include molybdate, molybdenum oxide, and 
molybdenyl complexes. Suitable molybdate complexes include ammonium molybdate and its 
hydrates, sodium molybdate and its hydrates, and potassium molybdate and its hydrates. Suitable 
molybdenum oxides include molybdenum (VI) oxide, molybdenum (VI) oxide, and molybdic acid. 
Suitable molybdenyl complexes include, for example, molybdenyl acetylacetonate. Other suitable 
tungsten and molybdenum complexes include hydroxo derivatives derived from, for example, 
glycerol, tartaric acid, and sugars. 
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A wide variety of other anti-angiogenic factors may also be utilized within the context of 
the present invention. Representative examples include, but are not limited to, platelet factor 4; 
protamine sulphate; sulphated chitin derivatives (prepared from queen crab shells), (Murata et aL, 
Cancer Res. 51:22-26, (1991)); Sulphated Polysaccharide Peptidoglycan Complex (SP- PG) (the 
5 function of this compound may be enhanced by the presence of steroids such as estrogen, and 
tamoxifen citrate); Staurosporine; modulators of matrix metabolism, including for example, 
proline analogs, cishydroxyproline, d,L-3,4-dehydroproline, Thiaproline, alpha,alpha-dipyridyl, 
aminopropionitrile fumarate; 4-propyl-5-(4-pyridinyl)-2(3H)-oxazolone; Methotrexate; 
Mitoxantrone; Heparin; Interferons; 2 Macroglobulin-serum; ChlMP-3 (Pavloff et aL, J. Bio. 

10 Chenx 267:17321-17326, (1992)); Chymostatin (Tomkinson et aL, Biochem J. 286:475-480, 
(1992)); Cyclodextrin Tetradecasulfate; Eponemycin; Camptothecin; Fumagillin (Ingber et al., 
Nature 348:555-557, (1990)); Gold Sodium Thiomalate ("GST"; Matsubara and Ziff, J. Clin. 
Invest. 79:1440-1446, (1987)); anticollagenase-serum; alpha2-antiplasmin (Holmes et al., J. Biol. 
Chem. 262(4): 1659-1664, (1987)); Bisantrene (National Cancer Institute); Lobenzarit disodium 

15 (N-(2)-carboxyphenyl-4- chloroanthronilic acid disodium or "CCA"; (Takeuchi et aL, Agents 
Actions 36:312-316, (1992)); and metalloproteinase inhibitors such as BB94. 

Additional anti-angiogenic factors that may also be utilized within the context of the 
present invention include Thalidomide, (Celgene, Warren, NJ); Angiostatic steroid; AGM-1470 
(H. Brem and J. Folkman J Pediatr. Surg. 28:445-5 1 ( 1993)); an integrin alpha v beta 3 antagonist 

20 (C. Storgard et al., J Clin. Invest. 103:47-54 (1999)); carboxynaminolmidazole; 
Carboxyamidotriazole (CAI) (National Cancer Institute, Bethesda, MD); Conbretastatin A-4 
(CA4P) (OXiGENE, Boston, MA); Squalamine (Magainin Pharmaceuticals, Plymouth Meeting, 
PA); TNP-470, (Tap Pharmaceuticals, Deerfield, IL); ZD-0101 AstraZeneca (London, UK); 
APRA (CT2584); Benefin, Byrostatin-1 (SC339555); CGP-41251 (PKC 412); CM101; 

25 Dexrazoxane (ICRF187); DMXAA; Endostatin; Ravopridiol; Genestein; GTE; ImmTher, Iressa 
(ZD1839); Octreotide (Somatostatin); Panretin; PenaciUamine; Photopoint; PI-88; Prinomastat 
(AG-3340) Purlytin; Suradista (FCE26644); Tamoxifen (Nolvadex); Tazarotene; 
Tetrathiomolybdate; Xeloda (Capecitabine); and 5-Fluorouracil. 

Anti-angiogenic agents that may be administed in combination with the compounds of the 

30 invention may work through a variety of mechanisms including, but not limited to, inhibiting 
proteolysis of the extracellular matrix, blocking the function of endothelial cell-extracellular 
matrix adhesion molecules, by antagonizing the function of angiogenesis inducers such as growth 
factors, and inhibiting integrin receptors expressed on proliferating endothelial cells. Examples of 
anti-angiogenic inhibitors that interfere with extracellular matrix proteolysis and which may be 

35 administered in combination with the compositons of the invention include, but are not linked to, 
AG-3340 (Agouron, La Jolla, CA), BAY-12-9566 (Bayer, West Haven, CT), BMS-275291 
(Bristol Myers Squibb, Princeton, NJ), CGS-27032A (Novartis, East Hanover, NJ), Marimastat 
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(British Biotech, Oxford, UK), and Metastat (Aeterna, St-Foy, Quebec). Examples of anti- 
angiogenic inhibitors that act by blocking the function of endothelial cell-extracellular matrix 
adhesion molecules and which may be administered in combination with the compositons of the 
invention include, but are not lmited to, EMD-121974 (Merck KcgaA Darmstadt, Germany) and 
5 Vitaxin (Ixsys, La Jolla, CA/Medimmune, Gaithersburg, MD). Examples of anti-angiogenic 
agents that act by directly antagonizing or inhibiting angiogenesis inducers and which may be 
administered in combination with the compositons of the invention include, but are not lmited to, 
Angiozyme (Ribozyme, Boulder, CO), Anti-VEGF antibody (Genentech, S. San Francisco, CA), 
PTK-787/ZK-225846 (Novartis, Basel, Switzerland), SU-101 (Sugen, S. San Francisco, CA), SU- 

10 5416 (Sugen/ Pharmacia Upjohn, Bridgewater, NJ), and SU-6668 (Sugen). Other anti-angiogenic 
agents act to indirectly inhibit angiogenesis. Examples of indirect inhibitors of angiogenesis 
which may be administered in combination with the compositons of the invention include, but are 
not limited to, IM-862 (Cytran, Kirkland, WA), Interferon-alpha, IL-12 (Roche, Nutley, NJ), and 
Pentosan polysulfate (Georgetown University, Washington, DC). 

15 In particular embodiments, the use of compositions of the invention in combination with 

anti-angiogenic agents is contemplated for the treatment, prevention, and/or amelioration of an 
autoimmune disease, such as for example, an autoimmune disease described herein. 

In a particular embodiment, the use of compositions of the invention in combination with 
anti-angiogenic agents is contemplated for the treatment, prevention, and/or amelioration of 

20 arthritis. In a more particular embodiment, the use of compositions of the invention in combination 
with anti-angiogenic agents is contemplated for the treatment, prevention, and/or amelioration of 
rheumatoid arthritis. 

In another embodiment, the polynucleotides encoding a polypeptide of the present 
invention are administered in combination with an angiogenic protein, or polynucleotides 

25 encoding an angiogenic protein. Examples of angiogenic proteins that may be administered with 
the compositions of the invention include, but are not limited to, acidic and basic fibroblast growth 
factors, VEGF-1, VEGF-2, VEGF-3, epidermal growth factor alpha and beta, platelet-derived 
endothelial cell growth factor, platelet-derived growth factor, tumor necrosis factor alpha, 
hepatocyte growth factor, insulin-like growth factor, colony stimulating factor, macrophage colony 

30 stimulating factor, granulocyte/macrophage colony stimulating factor, and nitric oxide synthase. 

In additional embodiments, compositions of the invention are administered in combination 
with a chemotherapeutic agent. Chemotherapeutic agents that may be administered with the 
Therapeutics of the invention include, but are not limited to alkylating agents such as nitrogen 
mustards (for example, Mechloretharrdne, cyclophosphamide, Cyclophosphamide Ifosfamide, 

35 Melphalan (L-sarcolysin), and Chlorambucil), ethylenimines and methylmelamines (for example, 
Hexamethylmelamine and Thiotepa), alkyl sulfonates (for example, Busulfan), nitrosoureas (for 
example, Carmustine (BCNU), Lomustine (CCNU), Semustine (methyl-CCNU), and Streptozocin 
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(streptozotocin)), triazenes (for example, Dacarbazine (DTIC; 

dimethyltria2»noimidazolecarboxamide)), folic acid analogs (for example, Methotrexate 
(amethopterin)), pyrimidine analogs (for example, Fluorouacil (5-fluorouracil; 5-FU), Floxuridine 
(fhiorodeoxyuridine; FudR), and Cytarabine (cytosine arabinoside)), purine analogs and related 
inhibitors (for example, Mercaptopurine (6-mercaptopurine; 6-MP), Thioguanine (6-thioguanine; 
TG), and Pentostatin (2'-de6xycofonnycin)), vinca alkaloids (for example, Vinblastine (VLB, 
vinblastine sulfate)) and Vincristine (vincristine sulfate)), epipodophyllotoxins (for example, 
Etoposide and Tenyposide), antibiotics (for example, Dactinomycin (actinomycin D), 
Daunorubicin (daunomycin; rubidomycin), Doxorubicin, Bleomycin, Plicamycin (mithramycin), 
and Mitomycin (mitomycin C), enzymes (for example, L-Asparaginase), biological response 
modifiers (for example, mterferon-alpha and interferon-alpha-2b), platinum coordination 
compounds (for example, Cisplatin (cis-DDP) and Carboplatin), anthracenedione (Mitoxantrone), 
substituted ureas (for example, Hydroxyurea), methylhydrazine derivatives (for example, 
Procarbazine (N-methylhydrazine; MIH), adrenocoiticosteroids (for example, Prednisone), 
progestins (for example, Hydroxyprogesterone caproate, Medroxyprogesterone, 
Medroxyprogesterone acetate, and Megestrol acetate), estrogens (for example, Diethylstilbestrol 
(DES), Diethylstilbestrol diphosphate, Estradiol, and Ethinyl estradiol), antiestrogens (for 
example, Tamoxifen), androgens (Testosterone proprionate, and Fluoxymesterone), antiandrogens 
(for example, Hutamide), gonadotropin-releasing horomone analogs (for example, Leuprolide), 
other hormones and hormone analogs (for example, methyltestosterone, estramustine, estramustine 
phosphate sodium, chlorotrianisene, and testolactone), and others (for example, dicarbazine, 
glutamic acid, and mitotane). 

In one embodiment, the compositions of the invention are administered in combination 
with one or more of the following drugs: infliximab (also known as Remicade™ Centocor, Inc.), 
Trocade (Roche, RO-32-3555), Leflunomide (also known as Arava™ from Hoechst Marion 
Roussel), Kineret™ (an EL-1 Receptor antagonist also known as Anakinra from Amgen, Inc.) 

In a specific embodiment, compositions of the invention are administered in combination 
with CHOP (cyclophosphamide, doxorubicin, vincristine, and prednisone) or combination of one 
or more of the components of CHOP. In one embodiment, the compositions of the invention are 
administered in combination with anti-CD20 antibodies, human monoclonal anti-CD20 antibodies. 
In another embodiment, the compositions of the invention are administered in combination with 
anti-CD20 antibodies and CHOP, or anti-CD20 antibodies and any combination of one or more of 
the components of CHOP, particularly cyclophosphamide and/or prednisone. In a specific 

r 

embodiment, compositions of the invention are administered in combination with Riraximab. In a 
further embodiment, compositions of the invention are administered with Rituximab and CHOP, 
or Rituximab and any combination of one or more of the components of CHOP, particularly 
cyclophosphamide and/or prednisone. In a specific embodiment, compositions of the invention are 
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administered in combination with tositumomab. In a further embodiment, compositions of the 
invention are administered with tositumomab and CHOP, or tositumomab and any combination of 
one or more of the components of CHOP, particularly cyclophosphamide and/or prednisone. The 
anti-CD20 antibodies may optionally be associated with radioisotopes, toxins or cytotoxic 
5 prodrugs. 

In another specific embodiment, the compositions of the invention are adrriinistered in 
combination Zevalin™. In a further embodiment, compositions of the invention are administered 
with Zevalin™ and CHOP, or Zevalin™ and any combination of one or more of the components of 
CHOP, particularly cyclophosphamide and/or prednisone. Zevalin™ may be associated with one 

10 or more radisotopes. Particularly preferred isotopes are *°Y and U1 ln. 

In an additional embodiment, the Therapeutics of the invention are administered in 
combination with cytokines. Cytokines that may be administered with the Therapeutics of the 
invention include, but are not limited to, IL2, IL3, IL4, IL5, BL6, IL7, IL10, IL12, IL13, IL15, anti- 
CD40, CD40L, IFN-gamma and TNF-alpha. In another embodiment, Therapeutics of the 

15 invention may be administered with any interleukin, including, but not limited to, IL-lalpha, BL- 
lbeta, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, EL-8, IL-9, IL-10, EL-11, IL-12, IL-13, IL-14, IL-15, EL- 
16, EL-17, EL-18, IL-19, IL-20, andIL-21. 

In one embodiment, the Therapeutics of the invention are administered in combination 
with members of the TNF family. TNF, TNF-related or TNF-like molecules that may be 

20 administered with the Therapeutics of the invention include, but are not limited to, soluble forms 
of TNF-alpha, lymphotoxin-alpha (LT-alpha, also known as TNF-beta), LT-beta (found in 
complex heterotrimer LT-alpha2-beta), OPGL, FasL, CD27L, CD30L, CD40L, 4-1BBL, DcR3, 
OX40L, TNF-gamma (International Publication No. WO 96/14328), AM-I (International 
Publication No. WO 97/33899), endokine-alpha (International Publication No. WO 98/07880), 

25 OPG, and neutrokine-alpha (International Publication No. WO 98/18921, OX40, and nerve growth 
factor (NGF), and soluble forms of Fas, CD30, CD27, CD40 and 4-IBB, TR2 (International 
Publication No. WO 96/34095), DR3 (International Publication No. WO 97/33904), DR4 
(International Publication No. WO 98/32856), TR5 (International Publication No. WO 98/30693), 
TRANK, TR9 (International Pubkcation No. WO 98/56892),TR10 (International Publication No. 

30 WO 98/54202), 312C2 (International Publication No. WO 98/06842), and TR12, and soluble 
forms CD154, CD70, and CD153. 

In an additional embodiment, the Therapeutics of the invention are administered in 
combination with angiogenic proteins. Angiogenic proteins that may be administered with the 
Therapeutics of the invention include, but are not limited to, Glioma Derived Growth Factor 

35 (GDGF), as disclosed in European Patent Number EP-399816; Platelet Derived Growth Factor-A 
(PDGF-A), as disclosed in European Patent Number EP-6821 10; Platelet Derived Growth Factor- 
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B (PDGF-B), as disclosed in European Patent Number EP-282317; Placental Growth Factor 
(P1GF), as disclosed in International Publication Number WO 92/06194; Placental Growth Factor- 
2 (P1GF-2), as disclosed in Hauser et al., Growth Factors, 4:259-268 (1993); Vascular Endothelial 
Growth Factor (VEGF), as disclosed in International Publication Number WO 90/13649; Vascular 
5 Endothelial Growth Factor-A (VEGF-A), as disclosed in European Patent Number EP-506477; 
Vascular Endothelial Growth Factor-2 (VEGF-2), as disclosed in International Publication 
Number WO 96/39515; Vascular Endothelial Growth Factor B (VEGF-3); Vascular Endothelial 
Growth Factor B-186 (VEGF-B186), as disclosed in International Publication Number WO 
96/26736; Vascular Endothelial Growth Factor-D (VEGF-D), as disclosed in International 

10 Publication Number WO 98/02543; Vascular Endothelial Growth Factor-D (VEGF-D), as 
disclosed in International Publication Number WO 98/07832; and Vascular Endothelial Growth 
Factor-E (VEGF-E), as disclosed in German Patent Number DEI 9639601. The above mentioned 
references are herein incorporated by reference in their entireties. 

In an additional embodiment, the Therapeutics of the invention are administered in 

15 combination with Fibroblast Growth Factors. Fibroblast Growth Factors that may be administered 
with the Therapeutics of the invention include, but are not limited to, FGF-1, FGF-2, FGF-3, FGF- 
4, FGF-5, FGF-6, FGF-7, FGF-8, FGF-9, FGF-10, FGF-1 1, FGF-12, FGF-13, FGF-14, and FGF- 
15. 

In an additional embodiment, the Therapeutics of the invention are administered in 
20 combination with hematopoietic growth factors. Hematopoietic growth factors that may be 
administered with the Therapeutics of the invention include, but are not limited to, granulocyte 
macrophage colony stimulating factor (GM-CSF) (sargramostim, LEUKINE™, PROKINE™), 
granulocyte colony stimulating factor (G-CSF) (filgrastim, NEUPOGEN™), macrophage colony 
stimulating factor (M-CSF, CSF-1) erythropoietin (epoetin alfa, EPOGEN™, PROCRTT™), stem 
25 cell factor (SCF, c-kit ligand, steel factor), megakaryocyte colony stimulating factor, PIXY321 (a 
GMCSF/IL-3 fusion protein), interleukins, especially any one or more of EL-1 through BL-12, 
interferon-gamma, or thrombopoietin. 

In certain embodiments, Therapeutics of the present invention are administered in 
combination with adrenergic blockers, such as, for example, acebutolol, atenolol, betaxolol, 
30 bisoprolol, carteolol, labetalol, metoprolol, nadolol, oxprenolol, penbutolol, pindolol, propranolol, 
sotalol, and timolol. 

In another embodiment, the Therapeutics of the invention are administered in combination 
with an antiarrhythmic drug (e.g., adenosine, amidoarone, bretylium, digitalis, digoxin, digitoxin, 
diliazem, disopyramide, esmolol, flecainide, lidocaine, mexiletine, moricizine, phenytoin, 
35 procainamide, N-acetyl procainamide, propafenone, propranolol, quinidine, sotalol, tocainide, and 
verapamil). 
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In another embodiment, the Therapeutics of the invention are administered in combination 
with diuretic agents, such as carbonic anhydrase-inhibiting agents (e.g., acetazolamide, 
dichlorphenamide, and methazolamide), osmotic diuretics (e.g., glycerin, isosorbide, mannitol, 
and urea), diuretics that inhibit Na + -K + -2CT syrnport (e.g., rurosemide, bumetanide, azosemide, 
5 piretanide, tripamide, ethacrynic acid, muzolimine, and torsemide), thiazide and thiazide-like 
diuretics (e.g., bendroflumethiazide, benzthiazide, chlorothiazide, hydrochlorothiazide, 
hydroflumethiazide, methyclothiazide, polythiazide, - Irichorrnethiazide, chlorthalidone, 
indapamide, metolazone, and quinethazone), potassium sparing diuretics (e.g., amiloride and 
triamterene), and mineralcorticoid receptor antagonists (e.g., spironolactone, canrenone, and 

10 potassium canrenoate). 

In one embodiment, the Therapeutics of the invention are administered in combination 
with treatments for endocrine and/or hormone imbalance disorders. Treatments for endocrine 
and/or hormone imbalance disorders include, but are not limited to, n \ radioactive isotopes of 
iodine such as I and I; recombinant growth hormone, such as HUMATROPE™ (recombinant 

15 somatropin); growth hormone analogs such as PROTROPIN™ (somatrem); dopamine agonists 
such as PARLODEL™ (bromocriptine); somatostatin analogs such as SANDOSTATIN™ 
(octreotide); gonadotropin preparations such as PREGNYL™, A.P.L.™ and PROFASI™ (chorionic 
gonadotropin (CG)), PERGONAL™ (menotropins), and METRODIN™ (urofollitropin (uFSH)); 
synthetic human gonadotropin releasing hormone preparations such as FACTREL™ and 

20 LUTREPULSE™ (gonadorelin hydrochloride); synthetic gonadotropin agonists such as 
LUPRON™ (leuprolide acetate), SUPPRELIN™ (histrelin acetate), SYNAREL™ (nafarelin 
acetate), and ZOLADEX™ (goserelin acetate); synthetic preparations of thyrotropin-releasing 
hormone such as RELEFACT TRH™ and THYPINONE™ (protireUn); recombinant human TSH 
such as THYROGEN™; synthetic preparations of the sodium salts of the natural isomers of 

25 thyroid hormones such as L-T 4 ™, SYNTHROID™ and LEVOTHROED™ (levothyroxine sodium), 
L-T 3 ™, CYTOMEL™ and TRIOSTAT™ (liothyroine sodium), and THYROLAR™ (liotrix); 
antithyroid compounds such as 6-n-propylthiouracil (propylthiouracil), l-methyl-2- 
mercaptoimidazole and TAPAZOLE™ (methimazole), NEO-MERCAZOLE™ (carbimazole); 
beta-adrenergic receptor antagonists such as propranolol and esmolol; Ca 2+ channel blockers; 

30 dexamethasone and iodinated radiological contrast agents such as TELEPAQUE™ (iopanoic acid) 
and ORAGRAF1N™ (sodium ipodate). 

Additional treatments for endocrine and/or hormone imbalance disorders include, but are 
not limited to, estrogens or congugated estrogens such as ESTRACE™ (estradiol), ESTINYL™ 
(ethinyl estradiol), PREMARIN™, ESTRATAB™, ORTHO-EST™, OGEN™ and estropipate 

35 (estrone), ESTROVIS™ (quinestrol), ESTRADERM™ (estradiol), DELESTROGEN™ and 
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VALERGEN™ (estradiol valerate), DEPO-ESTRADIOL CYPIONATE™ and ESTROJECT LA™ 
(estradiol cypionate); antiestrogens such as NOLVADEX™ (tamoxifen), SEROPHENE™ and 
GLOME)™ (clomiphene); progestins such as DURALTJTIN™ (hydroxyprogesterone caproate), 
MPA™ and DEPO-PROVERA™ (medroxyprogesterone acetate), PROVERA™ and CYCRIN™ 
5 (MPA), MEGACE™ (megestrol acetate), NORLUTIN™ (norethindrone), and NORLUTATE™ 
and AYGESTIN™ (norethindrone acetate); progesterone implants such as NORPLANT 
SYSTEM™ (subdermal implants of norgestrel); antiprogestins such as RU 486™ (mifepristone); 
hormonal contraceptives such as ENOVBD™ (norethynodrel plus mestranol), PROGESTASERT™ 
(intrauterine device that releases progesterone), LOESTRLN™, BREVICON™, MODICON™, 

10 GENORA™, NELONA™, NORINYL™, OVACON-35™ and OVACON-50™ (ethinyl 
estradiol/norethindrone), LEVLEN™, NORDETTE™, TRI-LEVLEN™ and TRIPHASIL-21™ 
(ethinyl estradiol/levonorgestrel) LO/OVRAL™ and OVRAL™ (ethinyl estradiol/norgestrel), 
DEMULEN™ (ethinyl estradiol/ethynodiol diacetate), NORINYL™, ORTHO-NOVUM™, 
NORETHEN™, GENORA™, and NELOVA™ (norethindrone/rnestranol), DESOGEN™ and 

15 ORTHO-CEPT™ (ethinyl estradioydesogestrel), ORTHO-CY CLEN™ and ORTHO- 
TRICYCLEN™ (ethinyl estradioynorgestimate), MICRONOR™ and NOR-QD™ (norethindrone), 

and OVRETTE™ (norgestrel). 

Additional treatments for endocrine and/or hormone imbalance disorders include, but are 
not limited to, testosterone esters such as methenolone acetate and testosterone undecanoate; 

20 parenteral and oral androgens such as TESTOJECT-50™ (testosterone), TESTEX™ (testosterone 
propionate), DELATESTRYL™ (testosterone enanthate), DEPO-TESTOSTERONE™ 
(testosterone cypionate), DANOCRLNE™ (danazol), HALOTESTIN™ (fluoxymesterone), 
ORETON METHYL™, TESTRED™ and VIRILON™ (methyltestosterone), and OXANDRJN™ 
(oxandrolone); testosterone transdermal systems such as TESTODERM™; androgen receptor 

25 antagonist and 5-alpha-reductase inhibitors such as ANDROCUR™ (cyproterone acetate), 
EULEXIN™ (flutamide), and PROSCAR™ (finasteride); adrenocorticotropic hormone 
preparations such as CORTROSYN™ (cosyntropin); adrenocortical steroids and their synthetic 
analogs such as ACLOVATE™ (alclometasone dipropionate), CYCLOCORT™ (amcinonide), 
BECLOVENT™ and VANCERIL™ (beclomethasone dipropionate), CELESTONE™ 

30 (betamethasone), BENISONE™ and UTICORT™ (betamethasone benzoate), DIPROSONE™ 
(betamethasone dipropionate), CELESTONE PHOSPHATE™ (betamethasone sodium phosphate), 
CELESTONE SOLUSPAN™ (betamethasone sodium phosphate and acetate), BETA-VAL™ and 
VALISONE™ (betamethasone valerate), TEMOVATE™ (clobetasol propionate), CLODERM™ 
(clocortolone pivalate), CORTEF™ and HYDROCORTONE™ (Cortisol (hydrocortisone)), 
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HYDROCORTONE ACETATE™ (Cortisol (hydrocortisone) acetate), LOCOID™ (Cortisol 
(hydrocortisone) butyrate), HYDROCORTONE PHOSPHATE™ (Cortisol (hydrocortisone) 
sodium phosphate), A-HYDROCORT™ and SOLU CORTEF™ (Cortisol (hydrocortisone) sodium 
succinate), WESTCORT™ (Cortisol (hydrocortisone) valerate), CORTISONE ACETATE™ 
5 (cortisone acetate), DESOWEN™ and TRDDESILON™ (desonide), TOPICORT™ 
(desoximetasone), DECADRON™ (dexamethasone), DECADRON LA™ (dexamethasone 
acetate), DECADRON PHOSPHATE™ and HEXADROL PHOSPHATE™ (dexamethasone 
sodium phosphate), FLORONE™ and MAXIFLOR™ (diflorasone diacetate), FLORINEF 
ACETATE™ (fludrocortisone acetate), AEROBID™ and NASALIDE™ (flunisolide), 

10 FLUONID™ and SYNALAR™ (fluocinolone acetonide), LIDEX™ (fluocinonide), FLUOR-OP™ 
and FML™ (fluorometholone), CORDRAN™ (flurandrenolide), HALOG™ (halcinonide), HMS 
LIZUIFILM™ (medrysone), MEDROL™ (methylprednisolone), DEPO-MEDROL™ and 
MEDROL ACETATE™ (methylprednisone acetate), A-METHAPRED™ and SOLUMEDROL™ 
(methylprednisolone sodium succinate), ELOCON™ (mometasone furoate), HALDRONE™ 

15 (paramethasone acetate), DELTA-CORTEF™ (prednisolone), ECONOPRED™ (prednisolone 
acetate), HYDELTRASOL™ (prednisolone sodium phosphate), HYDELTRA-T.B . A™ 
(prednisolone tebutate), DELTASONE™ (prednisone), ARISTOCORT™ and KENACORT™ 
(triamcinolone), KENALOG™ (triamcinolone acetonide), ARISTOCORT™ and KENACORT 
DIACETATE™ (triamcinolone diacetate), and ARISTOSPAN™ (triamcinolone hexacetonide); 

20 inhibitors of biosynthesis and action of adrenocortical steroids such as CYTADREN™ 
(aminoglutethimide), NIZORAL™ (ketoconazole), MODRASTANE™ (trilostane), and 
METOPIRONE™ (metyrapone); bovine, porcine or human insulin or mixtures thereof; insulin 
analogs; recombinant human insulin such as HUMULLN™ and NOVOLIN™; oral hypoglycemic 
agents such as ORAJMQODE™ and ORINASE™ (tolbutamide), DIABINESE™ (chlorpropamide), 

25 TOLAMIDE™ and TOLINASE™ (tolazamide), DYMELOR™ (acetohexamide), gkbenclamide, 
MICRONASE™, DIBETA™ and GLYNASE™ (glyburide), GLUCOTROL™ (glipizide), and 
DIAMICRON™ (gliclazide), GLUCOPHAGE™ (metformin), ciglitazone, pioglitazone, and 
alpha-glucosidase inhibitors; bovine or porcine glucagon; somatostatins such as 
SANDOSTATIN™ (octreotide); and diazoxides such as PROGLYCEM™ (diazoxide). 

30 In an additional embodiment, the Therapeutics of the invention are administered in 

combination with drugs effective in treating iron deficiency and hypochromic anemias, including 
but not limited to, ferrous sulfate (iron sulfate, FEOSOL™), ferrous fumarate (e.g., FEOSTAT™), 
ferrous gluconate (e.g., FERGON™), polysaccharide-iron complex (e.g., NIFEREX™), iron 
dextran injection (e.g., INFED™), cupric sulfate, pyroxidine, riboflavin, Vitamin Bi2, 

35 cyancobalamin injection (e.g., REDISOL™, RUBRAMEN PC™), hydroxocobalamin, folic acid 
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(e.g., FOLVTTE™), leucovorin (folinic acid, 5-CHOH4PteGlu, citrovorum factor) or 
WELLCOVORIN (Calcium salt of leucovorin), transferrin or ferritin. 

In another embodiment, Therapeutics of the invention are administered in combination 
with vasodilating agents and/or calcium channel blocking agents. Vasodilating agents that may be 
5 administered with the Therapeutics of the invention include, but are not limited to, Angiotensin 
Converting Enzyme (ACE) inhibitors (e.g., papaverine, isoxsuprine, benazepril, captopril, 
cilazapril, enalapril, enalaprilat, fosinopril, lisinopril, moexipril, perindopril, quinapril, ramipril, 
spirapril, trandolapril, and nylidrin), and nitrates (e.g., isosorbide dinitrate, isosorbide mononitrate, 
and nitroglycerin). Examples of calcium channel blocking agents that may be administered in 
10 combination with the Therapeutics of the invention include, but are not limited to amlodipine, 
bepridil, diltiazem, felodipine, flunarizine, isradipine, nicardipine, nifedipine, nimodipine, and 
verapamil. 

In certain embodiments, the Therapeutics of the invention are administered in combination 
with treatments for gastrointestinal disorders. Treatments for gastrointestinal disorders that may be 

15 administered with the Therapeutic of the invention include, but are not limited to, H 2 histamine 
receptor antagonists (e.g., TAGAMET™ (cimetidine), ZANTAC™ (ranitidine), PEPCID™ 
(famotidine), and AXID™ (nizatidine)); inhibitors of H\ K + ATPase (e.g., PREVACID™ 
(lansoprazole) and PRILOSEC™ (omeprazole)); Bismuth compounds (e.g., PEPTO-BISMOL™ 
(bismuth subsalicylate) and DE-NOL™ (bismuth subcitrate)); various antacids; sucralfate; 

20 prostaglandin analogs (e.g. CYTOTEC™ (misoprostol)); muscarinic cholinergic antagonists; 
laxatives (e.g., surfactant laxatives, stimulant laxatives, saline and osmotic laxatives); antidiarrheal 
agents (e.g., LOMOTIL™ (diphenoxylate), MOTOFEN™ (diphenoxin), and IMODIUM™ 
Qoperamide hydrochloride)), synthetic analogs of somatostatin such as SANDOSTATIN™ 
(octreotide), antiemetic agents (e.g., ZOFRAN™ (ondansetron), KYTRIL™ (granisetron 

25 hydrochloride), tropisetron, dolasetron, metoclopramide, chlorpromazine, perphenazine, 
prochlorperazine, promethazine, thiethylperazine, triflupromazine, domperidone, haloperidol, 
droperidol, trimethobenzamide, dexamethasone, methylprednisolone, dronabinol, and nabilone); 
D2 antagonists (e.g., metoclopramide, trimethobenzamide and chlorpromazine); bile salts; 
chenodeoxycholic acid; ursodeoxycholic acid; and pancreatic enzyme preparations such as 

30 pancreatin and pancrelipase. 

In additional embodiments, the Therapeutics of the invention are administered in 
combination with other therapeutic or prophylactic regimens, such as, for example, radiation 
therapy. 



35 



692 



> 

WO 02/076488 PCT/US02/08276 

Example 14: Method of Treating Decreased Levels of (he Polypeptide 

The present invention relates to a method for treating an individual in need of an increased 
level of a polypeptide of the invention in the body comprising administering to such an individual 
5 a composition comprising a therapeutically effective amount of polypeptides (including agonists 
thereto), and/or antibodies of the invention. Moreover, it will be appreciated that conditions 
caused by a decrease in the standard or normal expression level of a polypeptide of the present 
invention in an individual may be treated by administering agonists of said polypeptide. Thus, the 
invention also provides a method of treatment of an individual in need of an increased level of the 
10 polypeptide comprising administering to such an individual a Therapeutic comprising an amount 
of the agonist (including polypeptides and antibodies of the present invention) to increase the 

m 

activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily dose 0.1-100 
ug/kg of the agonist for six consecutive days. The exact details of the dosing scheme, based on 
15 administration and formulation, are provided in Example 13. 

Example 15: Method of Treating Increased Levels of the Polypeptide 

The present invention also relates to a method of treating an individual in need of a 
20 decreased level of a polypeptide of the invention in the body comprising administering to such an 
individual a composition comprising a therapeutically effective amount of an antagonist of the 
invention (including polypeptides and antibodies of the invention). 

In one example, antisense technology is used to inhibit production of a polypeptide of the 
present invention. This technology is one example of a method of decreasing levels of a 
25 polypeptide, due to a variety of etiologies, such as cancer. 

For example, a patient diagnosed with abnormally increased levels of a polypeptide is 
administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 2.0 and 3.0 mg/kg day for 21 
days. This treatment is repeated after a 7-day rest period if the treatment was well tolerated. The 
antisense polynucleotides of the present invention can be formulated using techniques and 
30 formulations described herein (e.g. see Example 13), or otherwise known in the art 

Example 16: Method of Treatment Using Gene Therapy-Ex Vivo 

One method of gene therapy transplants fibroblasts, which are capable of expressing a 
35 polypeptide, onto a patient. Generally, fibroblasts are obtained from a subject by skin biopsy. The 
resulting tissue is placed in tissue-culture medium and separated into small pieces. Small chunks 
of the tissue are placed on a wet surface of a tissue culture flask, approximately ten pieces are 
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placed in each flask. The flask is turned upside down, closed tight and left at room temperature 
over night. After 24 hours at room temperature, the flask is inverted and the chunks of tissue 
remain fixed to the bottom of the flask and fresh media (e.g., Ham's F12 media, with 10% FBS, 
penicillin and streptomycin) is added. The flasks are then incubated at 37 degree C for 
5 approximately one week. 

At this time, fresh media is added and subsequently changed every several days. After an 
additional two weeks in culture, a monolayer of fibroblasts emerge. The monolayer is tiypsinized 
and scaled into larger flasks. 

pMV-7 (Kirschmeier, P.T. et al., DNA, 7:219-25 (1988)), flanked by the long terminal 
. 10 repeats of the Moloney murine sarcoma virus, is digested with EcoRI and HindlTI and 
subsequently treated with calf intestinal phosphatase. The linear vector is fractionated on agarose 
gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified using PCR 
primers which correspond to the 5' and 3' end sequences respectively as set forth in Example 1 
15 using primers and having appropriate restriction sites and initiation/stop codons, if necessary. 
Preferably, the 5' primer contains an EcoRI site and the 3' primer includes a Hindin site. Equal 
quantities of the Moloney murine sarcoma virus linear backbone and the amplified EcoRI and 
Hkidin fragment are added together, in the presence of T4 DNA ligase. The resulting mixture is 
maintained under conditions appropriate for ligation of the two fragments. The ligation mixture is 
20 then used to transform bacteria HB 101, which are then plated onto agar containing kanamycin for 
the purpose of conflnriing that the vector has the gene of interest properly inserted. 

The amphotropic pA317 or GP+aml2 packaging cells are grown in tissue culture to 
confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 10% calf serum (CS), 
penicillin and streptomycin. The MSV vector containing the gene is then added to the media and 
25 the packaging cells transduced with the vector. The packaging cells now produce infectious viral 
particles containing the gene (the packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequendy, the media is 
harvested from a 10 cm plate of confluent producer cells. The spent media, containing the 
infectious viral particles, is filtered through a millipore filter to remove detached producer cells 
30 and this media is then used to infect fibroblast cells. Media is removed from a sub-confluent plate 
of fibroblasts and quickly replaced with the media from the producer cells. This media is removed 
and replaced with fresh media. If the titer of virus is high, then virtually all fibroblasts will be 
infected and no selection is required. If the titer is very low, then it is necessary to use a retroviral 
vector that has a selectable marker, such as neo or his. Once the fibroblasts have been efficiently 
35 infected, the fibroblasts are analyzed to determine whether protein is produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or after having 
been grown to confluence on cytodex 3 microcarrier beads. 
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Example 17: Gene Therapy Using Endogenous Genes Corresponding To 

Polynucleotides of the Invention 

5 Another method of gene therapy according to the present invention involves operably 

associating the endogenous polynucleotide sequence of the invention with a promoter, via 
homologous recombination as described, for example, in U.S. Patent NO: 5,641,670, issued June 
24, 1997; International Publication NO: WO 96/29411, published September 26, 1996; 
International Publication NO: WO 94/12650, published August 4, 1994; Koller et al., Proc. Natl 

10 Acad, Set USA, 86:8932-8935 (1989); and Zijlstra et al., Nature, 342:435-438 (1989). This 
method involves the activation of a gene which is present in the target cells, but which is not 
expressed in the cells, or is expressed at a lower level than desired. 

Polynucleotide constructs are made which contain a promoter and targeting sequences, 
which are homologous to the 5* non-coding sequence of endogenous polynucleotide sequence, 

15 flanking the promoter. The targeting sequence will be sufficiently near the 5' end of the 
polynucleotide sequence so the promoter will be operably linked to the endogenous sequence upon 
homologous recombination. The promoter and the targeting sequences can be amplified using 
PCR. Preferably, the amplified promoter contains distinct restriction enzyme sites on the 5* and 3' 
ends. Preferably, the 3' end of the first targeting sequence contains the same restriction enzyme 

20 site as the 5' end of the amplified promoter and the 5' end of the second targeting sequence 
contains the same restriction site as the 3* end of the amplified promoter. 

The amplified promoter and the amplified targeting sequences are digested with the 
appropriate restriction enzymes and subsequently treated with calf intestinal phosphatase. The 
digested promoter and digested targeting sequences are added together in the presence of T4 DNA 

25 ligase. The resulting mixture is maintained under conditions appropriate for ligation of the two 
fragments. The construct is size fractionated on an agarose gel, then purified by phenol extraction 
and ethanol precipitation. 

In this Example, the polynucleotide constructs are administered as naked polynucleotides 
via electroporation. However, the polynucleotide constructs may also be administered with 

30 transfection-facilitating agents, such as liposomes, viral sequences, viral particles, precipitating 
agents, etc. Such methods of delivery are known in the art 

Once the cells are transfected, homologous recombination will take place which results in 
the promoter being operably linked to the endogenous polynucleotide sequence. This results in the 
expression of polynucleotide corresponding to the polynucleotide in the cell. Expression may be 

35 detected by immunological staining, or any other method known in the art 

Fibroblasts are obtained from a subject by skin biopsy. The resulting tissue is placed in 
DMEM + 10% fetal calf serum. Exponentially growing or early stationary phase fibroblasts are 
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trypsinized and rinsed from the plastic surface with nutrient medium. An aliquot of the cell 
suspension is removed for counting, and the remaining cells are subjected to centrifugation. The 
supernatant is aspirated and the pellet is resuspended in 5 ml of electroporation buffer (20 mM 
HEPES pH 7.3, 137 mM NaCl, 5 mM KC1, 0.7 mM Na 2 HP0 4 , 6 mM dextrose). The cells are 
5 recentrifuged, the supernatant aspirated, and the cells resuspended in electroporation buffer 
containing 1 mg/ml acetylated bovine serum albumin. The final cell suspension contains 
approximately 3X10 6 cells/ml. Electroporation should be performed immediately following 
resuspension. 

Plasmid DNA is prepared according to standard techniques. For example, to construct a 

10 plasmid for targeting to the locus corresponding to the polynucleotide of the invention, plasmid 
pUC18 (MBI Fermentas, Amherst, NY) is digested with Hindm The CMV promoter is 
amplified by PCR with an Xbal site on the 5' end and a BamHI site on the 3' end. Two non-coding 
sequences are amplified via PCR: one non-coding sequence (fragment 1) is amplified with a 
Hindin site at the 5 1 end and an Xba site at the 3'end; the other non-coding sequence (fragment 2) 

15 is amplified with a BamHI site at the 5'end and a HindUl site at the 3'end. The CMV promoter and 
the fragments (1 and 2) are digested with the appropriate enzymes (CMV promoter - Xbal and 
BamHI; fragment 1 - Xbal; fragment 2 - BamHI) and ligated together. The resulting ligation 
product is digested with Hindin, and ligated with the Hrndlll-digested pUC18 plasmid. 

Plasmid DNA is added to a sterile cuvette with a 0.4 cm electrode gap (Bio-Rad). The 

20 final DNA concentration is generally at least 120 /xg/ml. 0.5 ml of the cell suspension (containing 
approximately 1.5JQ0 6 cells) is then added to the cuvette, and the cell suspension and DNA 
solutions are gently mixed. Electroporation is performed with a Gene-Pulser apparatus (Bio-Rad). 
Capacitance and voltage are set at 960 /iF and 250-300 V, respectively. As voltage increases, cell 
survival decreases, but the percentage of surviving cells that stably incorporate the introduced 

25 DNA into their genome increases dramatically. Given these parameters, a pulse time of 
approximately 14-20 mSec should be observed. 

Electroporated cells are maintained at room temperature for approximately 5 min, and the 
contents of the cuvette are then gently removed with a sterile transfer pipette. The cells are added 
directly to 10 ml of prewarmed nutrient media (DMEM with 15% calf serum) in a 10 cm dish and 

30 incubated at 37 degree C. The following day, the media is aspirated and replaced with 10 ml of 
fresh media and incubated for a further 16-24 hours. 

The engineered fibroblasts are then injected into the host, either alone or after having been 
grown to confluence on cytodex 3 microcarrier beads. The fibroblasts now produce the protein 
product. The fibroblasts can then be introduced into a patient as described above. 

35 

Example 18: Metiwd of Treatment Using Gene Therapy - In Vivo 
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Another aspect of the present invention is using in vivo gene therapy methods to prevent, 
treat, and/or ameliorate gastrointestinal diseases and disorders. The gene therapy method relates to 
the introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) sequences into 
an animal to increase or decrease the expression of the polypeptide. The polynucleotide of the 
5 present invention may be operatively linked to (i.e., associated with) a promoter or any other 
genetic elements necessary for the expression of the polypeptide by the target tissue. Such gene 
therapy and delivery techniques and methods are known in the art, see, for example, WO90/11092, 
W098/11779; U.S. Patent NO. 5693622, 5705151, 5580859; Tabata et al., Cardiovasc. Res. 
35(3):470479 (1997); Chao et al., Pharmacol. Res. 35(6):517-522 (1997); Wolff, Neuromuscul. 

10 Disord. 7(5):314-318 (1997); Schwartz et al., Gene Ther. 3(5):405-411 (1996); Tsurumi et al, 
Circulation 94(12):328 1-3290 (1996) (incorporated herein by reference). 

The polynucleotide constructs may be delivered by any method that delivers injectable 
materials to the cells of an animal, such as, injection into the interstitial space of tissues (heart, 
muscle, skin, lung, liver, intestine and the like). The polynucleotide constructs can be delivered in 

15 a pharmaceutically acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences that are free from 
any delivery vehicle that acts to assist, promote, or facilitate entry into the cell, including viral 
sequences, viral particles, liposome formulations, lipofectin or precipitating agents and the like. 
However, the polynucleotides of the present invention may also be delivered in liposome 

20 formulations (such as those taught in Feigner P.L. et al. (1995) Ann. NY Acad. Sci. 772:126-139 
and Abdallah B. et al. (1995) Biol. Cell 85(1): 1-7) which can be prepared by methods well known 
to those skilled in the art. 

The polynucleotide vector constructs used in the gene therapy method are preferably 
constructs that will not integrate into the host genome nor will they contain sequences that allow 

25 for replication. Any strong promoter known to those skilled in the art can be used for driving the 
expression of DNA. Unlike other gene therapy techniques, one major advantage of introducing 
naked nucleic acid sequences into target cells is the transitory nature of the polynucleotide 
synthesis in the cells. Studies have shown that non-replicating DNA sequences can be introduced 
into cells to provide production of the desired polypeptide for periods of up to six months. 

30 The polynucleotide construct can be delivered to the interstitial space of tissues within an 

animal, including muscle, skin, brain, lung, liver, spleen, bone marrow, thymus, heart, lymph, 
blood, bone, cartilage, pancreas, kidney, gall bladder, stomach, intestine, testis, ovary, uterus, 
rectum, nervous system, eye, gland, and connective tissue. Interstitial space of the tissues 
comprises the intercellular fluid, mucopolysaccharide matrix among the reticular fibers of organ 

35 tissues, elastic fibers in the walls of vessels or chambers, collagen fibers of fibrous tissues, or that 
same matrix within connective tissue enshea thing muscle cells or in the lacunae of bone. It is 
similarly the space occupied by the plasma of the circulation and the lymph fluid of the lymphatic 
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channels. Delivery to the interstitial space of muscle tissue is preferred for the reasons discussed 
below. They may be conveniently delivered by injection into the tissues comprising these cells. 
They are preferably delivered to and expressed in persistent, non-dividing cells which are 
differentiated, although delivery and expression may be achieved in non-differentiated or less 

5 completely differentiated cells, such as, for example, stem cells of blood or skin fibroblasts. In 
vivo muscle cells are particularly competent in their ability to take up and express polynucleotides. 

For the naked polynucleotide injection, an effective dosage amount of DNA or RNA will 
be in the range of from about 0.05 g/kg body weight to about 50 mg/kg body weight. Preferably 
the dosage will be from about 0.005 mg/kg to about 20 mg/kg and more preferably from about 

10 0.05 mg/kg to about 5 mg/kg. Of course, as the artisan of ordinary skill will appreciate, this 
dosage will vary according to the tissue site of injection. The appropriate and effective dosage of 
nucleic acid sequence can readily be determined by those of ordinary skill in the art and may 
depend on the condition being treated and the route of administration. The preferred route of 
administration is by the parenteral route of injection into the interstitial space of tissues. However, 

15 other parenteral routes may also be used, such as, inhalation of an aerosol formulation particularly 
for delivery to lungs or bronchial tissues, throat or mucous membranes of the nose. In addition, 
naked polynucleotide constructs can be delivered to arteries during angioplasty by the catheter 
used in the procedure. 

The dose response effects of injected polynucleotide in muscle in vivo is determined as 
20 follows. Suitable template DNA for production of mRNA coding for polypeptide of the present 
invention is prepared in accordance with a standard recombinant DNA methodology. The 
template DNA, which may be either circular or linear, is either used as naked DNA or complexed 
with liposomes. The quadriceps muscles of mice are then injected with various amounts of the 
template DNA. 

25 Five to six week old female and male Balb/C mice are anesthetized by intraperitoneal 

injection with 0.3 ml of 2.5% Avertrn. A 1.5 cm incision is made on the anterior thigh, and the 
quadriceps muscle is directiy visualized. The template DNA is injected in 0.1 ml of carrier in a 1 
cc syringe through a 27 gauge needle over one minute, approximately 0.5 cm from the distal 
insertion site of the muscle into the knee and about 0.2 cm deep. A suture is placed over the 

30 injection site for future localization, and the skin is closed with stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) muscle extracts are prepared by 
excising the entire quadriceps. Every fifth 15 um cross-section of the individual quadriceps 
muscles is histochemically stained for protein expression. A time course for protein expression 
may be done in a similar fashion except that quadriceps from different mice are harvested at 

35 different times. Persistence of DNA in muscle following injection may be determined by Southern 
blot analysis after preparing total cellular DNA and HIRT snpernatants from injected and control 
mice. The results of the above experimentation in mice can be used to extrapolate proper dosages 
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and other treatment parameters in humans and other animals using naked DNA. 



Example 19: Transgenic Animals 



5 The polypeptides of the invention can also be expressed in transgenic animals. Animals of 

any species, including, but not limited to, mice, rats, rabbits, hamsters, guinea pigs, pigs, micro- 
pigs, goats, sheep, cows and non-human primates, e.g., baboons, monkeys, and chimpanzees may 
be used to generate transgenic animals. In a specific embodiment, techniques described herein or 
otherwise known in the art, are used to express polypeptides of the invention in humans, as part of 

10 a gene therapy protocol. 

Any technique known in the art may be used to introduce the transgene (i.e., 
polynucleotides of the invention) into animals to produce the founder lines of transgenic animals. 
Such techniques include, but are not limited to, pronuclear microinjection (Paterson et al., Appl. 
Microbiol. Biotechnol. 40:691-698 (1994); Carver et al., Biotechnology (NY) 11:1263-1270 

15 (1993); Wright et al., Biotechnology (NY) 9:830-834 (1991); and Hoppe et al., U.S. Pat. No. 
4,873,191 (1989)); retrovirus mediated gene transfer into germ lines (Van der Putten et al., Proc. 
Nad. Acad. Sci., USA 82:6148-6152 (1985)), blastocysts or embryos; gene targeting in embryonic 
stem cells (Thompson et al., Cell 56:313-321 (1989)); electroporation of cells or embryos (Lo, 
1983, Mol Cell. Biol. 3:1803-1814 (1983)); introduction of the polynucleotides of the invention 

20 using a gene gun (see, e.g., Ulmer et al., Science 259:1745 (1993); introducing nucleic acid 
constructs into embryonic pleuripotent stem cells and transferring the stem cells back into the 
blastocyst; and sperm-mediated gene transfer (Lavitrano et al., Cell 57:717-723 (1989); etc. For a 
review of such techniques, see Gordon, 'Transgenic Animals," Intl. Rev. Cytol. 115:171-229 
(1989), which is incorporated by reference herein in its entirety. 

25 Any technique known in the art may be used to produce transgenic clones containing 

polynucleotides of the invention, for example, nuclear transfer into enucleated oocytes of nuclei 
from cultured embryonic, fetal, or adult cells induced to quiescence (Campell et al., Nature 
380:64-66 (1996); Wilmut et al., Nature 385:810-813 (1997)). 

The present invention provides for transgenic animals that carry the transgene in all their 

30 cells, as well as animals which carry the transgene in some, but not all their cells, Le„ mosaic 
animals or chimeric. The transgene may be integrated as a single transgene or as multiple copies 
such as in concatamers, e.g., head-to-head tandems or head-to-tail tandems. The transgene may 
also be selectively introduced into and activated in a particular cell type by following, for example, 
the teaching of Lasko et al. (Lasko et al, Proc. Natl. Acad. Sci. USA 89:6232-6236 (1992)). The 

35 regulatory sequences required for such a cell-type specific activation will depend upon the 
particular cell type of interest, and will be apparent to those of skill in the art. When it is desired 
that the polynucleotide transgene be integrated into the chromosomal site of the endogenous gene, 

699 



WO 02/076488 PCT/US02/08276 

gene targeting is preferred. Briefly, when such a technique is to be utilized, vectors containing 
some nucleotide sequences homologous to the endogenous gene are designed for the purpose of 
integrating, via homologous recombination with chromosomal sequences, into and disrupting the 
function of the nucleotide sequence of the endogenous gene. The transgene may also be 
5 selectively introduced into a particular cell type, thus inactivating the endogenous gene in only that 
cell type, by following, for example, the teaching of Gu et al. (Gu et al., Science 265:103-106 
(1994)). The regulatory sequences required for such a cell-type specific inactivation will depend 
upon the particular cell type of interest, and will be apparent to those of skill in the art. 

Once transgenic animals have been generated, the expression of the recombinant gene 

10 may be assayed utilizing standard techniques. Initial screening may be accomplished by Southern 
blot analysis or PCR techniques to analyze animal tissues to verify that integration of the transgene 
has taken place. The level of mRNA expression of the transgene in the tissues of the transgenic 
aiiimals may also be assessed using techniques which include, but are not limited to, Northern blot 
analysis of tissue samples obtained from the animal, in situ hybridization analysis, and reverse 

15 transcriptase-PCR (rt-PCR). Samples of transgenic gene-expressing tissue may also be evaluated 
irnmunocytochemically or immunohistochemically using antibodies specific for the transgene 
product 

Once the founder animals are produced, they may be bred, inbred, outbred, or crossbred to 
produce colonies of the particular animal. Examples of such breeding strategies include, but are 

20 not limited to: outbreeding of founder animals with more than one integration site in order to 
establish separate lines; inbreeding of separate lines in order to produce compound transgenics that 
express the transgene at higher levels because of the effects of additive expression of each 
transgene; crossing of heterozygous transgenic animals to produce animals homozygous for a 
given integration site in order to both augment expression and eliminate the need for screening of 

25 ammals by DNA analysis; crossing of separate homozygous lines to produce compound 
heterozygous or homozygous lines; and breeding to place the transgene on a distinct background 
that is appropriate for an experimental model of interest. 

Transgenic animals of the invention have uses which include, but are not limited to, 
animal model systems useful in elaborating the biological function of polypeptides of the present 

30 invention, studying conditions and/or disorders associated with aberrant expression, and in 
screening for compounds effective in ameliorating such conditions and/or disorders. 

Example 20: Knock-Out Animals 

35 Endogenous gene expression can also be reduced by inactivating or "knocking out" the 

gene and/or its promoter using targeted homologous recombination, (e.g., see Smithies et al., 
Nature 317:230-234 (1985); Thomas & Capecchi, Cell 51:503-512 (1987); Thompson et al., Cell 

700 



s 

WO 02/076488 PCT/US02/08276 

5:313-321 (1989); each of which is incorporated by reference herein in its entirety). For example, 
a mutant, non-functional polynucleotide of the invention (or a completely unrelated DNA 
sequence) flanked by DNA homologous to the endogenous polynucleotide sequence (either the 
coding regions or regulatory regions of the gene) can be used, with or without a selectable marker 
5 and/or a negative selectable marker, to transfect cells that express polypeptides of the invention in 
vivo. In another embodiment, techniques known in the art are used to generate knockouts in cells 
that contain, but do not express the gene of interest. Insertion of the DNA construct, via targeted 
homologous recombination, results in inactivation of the targeted gene. Such approaches are 
particularly suited in research and agricultural fields where modifications to embryonic stem cells 
10 can be used to generate animal offspring with an inactive targeted gene (e.g., see Thomas & 
Capecchi 1987 and Thompson 1989, supra). However this approach can be routinely adapted for 
use in humans provided the recombinant DNA constructs are directly adrninistered or targeted to 
the required site in vivo using appropriate viral vectors that will be apparent to those of skill in the 
art. 

15 In further embodiments of the invention, cells that are genetically engineered to express 

the polypeptides of the invention, or alternatively, that are genetically engineered not to express 
the polypeptides of the invention (e.g., knockouts) are administered to a patient in vivo. Such cells 
may be obtained from the patient (i.e., animal, including human) or an MHC compatible donor and 
can include, but are not limited to fibroblasts, bone marrow cells, blood cells (e.g., lymphocytes), 

20 adipocytes, muscle cells, endothelial cells etc. The cells are genetically engineered in vitro using 
recombinant DNA techniques to introduce the coding sequence of polypeptides of the invention 
into the cells, or alternatively, to disrupt the coding sequence and/or endogenous regulatory 
. sequence associated with the polypeptides of the invention, e.g., by transduction (using viral 
vectors, and preferably vectors that integrate the transgene into the cell genome) or transfection 

25 procedures, including, but not hmited to, the use of plasmids, cosmids, YACs, naked DNA, 
electroporation, liposomes, etc. The coding sequence of the polypeptides of the invention can be 
placed under the control of a strong constitutive or inducible promoter or promoter/enhancer to 
achieve expression, and preferably secretion, of the polypeptides of the invention. The engineered 
cells which express and preferably secrete the polypeptides of the invention can be introduced into 

30 the patient systemically, e.g., in the circulation, or intraperitoneally. 

Alternatively, the cells can be incorporated into a matrix and implanted in the body, e.g., 
genetically engineered fibroblasts can be implanted as part of a skin graft; genetically engineered 
endothelial cells can be implanted as part of a lymphatic or vascular graft (See, for example, 
Anderson et al. U.S. Patent No. 5,399,349; and Mulligan & Wilson, U.S. Patent No. 5,460,959 

35 each of which is incorporated by reference herein in its entirety). 

When the cells to be administered are non-autologous or non-MHC compatible cells, they 
can be administered using well known techniques which prevent the development of a host 
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immune response against the introduced cells. For example, the cells may be introduced in an 
encapsulated form which, while allowing for an exchange of components with the irnmediate 
extracellular environment, does not allow the introduced cells to be recognized by the host 
immune system. 

5 Transgenic and "knock-out" animals of the invention have uses which include, but are not 

limited to, animal model systems useful in elaborating the biological function of polypeptides of 
the present invention, studying conditions and/or disorders associated with aberrant expression, 
and in screening for compounds effective in ameliorating such conditions and/or disorders. 

10 Example 21: Production Of Polypeptide of the Invention For High-Throughput 

Screening Assays 

The following protocol produces a supernatant containing polypeptide of the present 
invention to be tested. This supernatant can then be used in the Screening Assays described in 

1 5 Examples 32-4 1 . 

First, dilute Poly-D-Lysine (644 587 Boehringer-Marjnheim) stock solution (lmg/ml in 
PBS) 1:20 in PBS (w/o calcium or magnesium 17-5 16F Biowhittaker) for a working solution of 
50ug/ml. Add 200 ul of this solution to each well (24 well plates) and incubate at RT for 20 
minutes. Be sure to distribute the solution over each well (note: a 12-channel pipetter may be used 

20 with tips on every other channel). Aspirate off the Poly-D-Lysine solution and rinse with 1ml PBS 
(Phosphate Buffered Saline). The PBS should remain in the well until just prior to plating the 
cells and plates may be poly-lysine coated in advance for up to two weeks. 

Plate 293T. cells (do not carry cells past P+20) at 2 x 10 5 cells/well in .5ml 
DMEM(DuIbecco's Modified Eagle Medium)(with 4.5 G/L glucose and L-glutamine (12-604F 

25 Biowhittaker))/10% heat inactivated FBS(14-503F Biowhittaker)/lx Penstrep(17-602E 
Biowhittaker). Let the cells grow overnight. 

The next day, mix together in a sterile solution basin: 300 ul Lipofectarnine (18324-012 
Gibco/BRL) and 5ml Optimem I (31985070 Gibco/BRL)/96-well plate. With a small volume 
multi-channel pipetter, aliquot approximately 2ug of an expression vector containing a 

30 polynucleotide insert, produced by the methods described in Examples 8-10, into an appropriately 
labeled 96-well round bottom plate. With a multi-channel pipetter, add 50ul of the 
Lipofectamine/Optimem I mixture to each well. Pipette up and down gently to mix. Incubate at 
RT 15-45 minutes. After about 20 minutes, use a multi-channel pipetter to add ISOul Optimem I 
to each well. As a control, one plate of vector DNA lacking an insert should be transfected with 

35 each set of transfections. 

Preferably, the transfection should be performed by tag-teaming the following tasks. By 
tag-teaming, hands on time is cut in half, and the cells do not spend too much time on PBS. First, 
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person A aspirates off the media from four 24-well plates of cells, and then person B rinses each 
well with .5-lml PBS. Person A then aspirates off PBS rinse, and person B, using al2-channel 
pipetter with tips on every other channel, adds the 200ul of DNA/Lipofectamine/Optimem I 
complex to the odd wells first, then to the even wells, to each row on the 24-well plates. Incubate 
5 at 37 degree C for 6 hours. 

While cells are incubating, prepare appropriate media, either 1%BSA in DMEM with lx 
penstrep, or HGS CHO-5 media (116.6 mg/L of CaC12 (anhyd); 0.00130 mg/L CuS0 4 -5H 2 0; 

0.050 mg/L of Fe(N0 3 ) 3 -9H 2 0; 0.417 mg/L of FeS0 4 -7H 2 0; 311.80 mg/L of Kcl; 28.64 mg/L 

of MgCI 2 ; 48.84 mg/L of MgS0 4 ; 6995.50 mg/L of NaCl; 2400.0 mg/L of NaHC0 3 ; 62.50 mg/L 

10 of NaH 2 PO 4 -H 2 0; 71.02 mg/L of Na 2 HP04; .4320 mg/L of ZnSO 4 -7H 2 0; .002 mg/L of 

Arachidonic Acid ; 1.022 mg/L of Cholesterol; .070 mg/L of DL-alpha-Tocopherol-Acetate; 
0.0520 mg/L of Linoleic Acid; 0.010 mg/L of Linolenic Acid; 0.010 mg/L of Myristic Acid; 0.010 
mg/L of Oleic Acid; 0.010 mg/L of Palmitric Acid; 0.010 mg/L of Palmitic Acid; 100 mg/L of 
Pluronic F-68; 0.010 mg/L of Stearic Acid; 2.20 mg/L of Tween 80; 4551 mg/L of D-Glucose; 
15 130.85 mg/ml of L- Alanine; 147.50 mg/ml of L-Arginine-HCL; 7.50 mg/ml of L-Asparagine- 
H 2 0; 6.65 mg/ml of L-Aspartic Acid; 29.56 mg/ml of L-Cystine-2H<X-H 2 0; 31.29 mg/ml of L- 

Cystine-2HCL; 7.35 mg/ml of L-Glutamic Acid; 365.0 mg/ml of l^Glutamine; 18.75 mg/ml of 
Glycine; 52.48 mg/ml of L-Histidine-HCL-H 2 0; 106.97 mg/ml of L-Isoleucine; 111.45 mg/ml of 

L-Leucine; 163.75 mg/ml of L-Lysine HCL; 32.34 mg/ml of ^Methionine; 68.48 mg/ml of Lr 
20 Phenylalainine; 40.0 mg/ml of L-Proline; 26.25 mg/ml of L-Serine; 101.05 mg/ml of L-Threonine; 
19.22 mg/ml of L-Tryptophan; 91.79 mg/ml of J^Tryrosine-2Na-2H 2 0; and 99.65 mg/ml of L- 

Valine; 0.0035 mg/L of Biotin; 3.24 mg/L of D-Ca Pantothenate; 11.78 mg/L of Choline Chloride; 
4.65 mg/L of Folic Acid; 15.60 mg/L of i-Inositol; 3.02 mg/L of Niacinamide; 3.00 mg/L of 
Pyridoxal HCL; 0.031 mg/L of Pyridoxine HCL; 0.319 mg/L of Riboflavin; 3.17 mg/L of 
25 Thiamine HCL; 0.365 mg/L of Thymidine; 0.680 mg/L of Vitamin B 12 ; 25 mM of HEPES 

Buffer; 2.39 mg/L of Na Hypoxanthine; 0.105 mg/L of Lipoic Acid; 0.081 mg/L of Sodium 
Putrescine-2HCL; 55.0 mg/L of Sodium Pyruvate; 0.0067 mg/L of Sodium Seienite; 20uM of 
Ethanolamine; 0.122 mg/L of Ferric Citrate; 41.70 mg/L of Methyl-B-Cyclcdextrin complexed 
with Linoleic Acid; 33.33 mg/L of Methyl-B-Cyclodextrin complexed with Oleic Acid; 10 mg/L 
30 of Methyl-B-Cyclodextrin complexed with Retinal Acetate. Adjust osmolarity to 327 mOsm) with 
2mm glutamine and lx penstrep. (BSA (81-068-3 Bayer) lOOgm dissolved in 1L DMEM for a 
10% BSA stock solution). Filter the media and collect 50 ul for endotoxin assay in 15ml 
polystyrene conical. 

The transfection reaction is terminated, preferably by tag-teaming, at the end of the 
35 incubation period. Person A aspirates off the transfection media, while person B adds 1.5ml 
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appropriate media to each well. Incubate at 37 degree C for 45 or 72 hours depending on the 
media used: 1 %BS A for 45 hours or CHO-5 for 72 hours. 

On day four, using a 30Qul multichannel pipetter, aliquot 600ul in one 1ml deep well plate 
and the remaining supernatant into a 2ml deep well. The supernatants from each well can then be 
5 used in the assays described in Examples 32-39. 

It is specifically understood that when activity is obtained in any of the assays described 
below using a supernatant, the activity originates from either the polypeptide of the present 
invention directly (e.g., as a secreted protein) or by polypeptide of the present invention inducing 
expression of other proteins, which are then secreted into the supernatant. Thus, the invention 
10 further provides a method of identifying the protein in the supernatant characterized by an activity 
in a particular assay. 

Example 22: Construction of GAS Reporter Construct 

15 One signal transduction pathway involved in the differentiation and proliferation of cells is 

called the Jaks-STATs pathway. Activated proteins in the Jaks-STATs pathway bind to gamma 
activation site "GAS" elements or interferon-sensitive responsive element ("ISRE"), located in the 
promoter of many genes. The binding of a protein to these elements alter the expression of the 
associated gene. 

20 GAS and ISRE elements are recognized by a class of transcription factors called Signal 

Transducers and Activators of Transcription, or "STATs." There are six members of the STATs 
family. Statl and Stat3 are present in many cell types, as is Stat2 (as response to IFN-alpha is 
widespread). Stat4 is more restricted and is not in many cell types though it has been found in T 
helper class I, cells after treatment with IL-12. Stat5 was originally called mammary growth 

25 factor, but has been found at higher concentrations in other cells including myeloid cells. It can be 
activated in tissue culture cells by many cytokines. 

The STATs are activated to translocate from the cytoplasm to the nucleus upon tyrosine 
phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") family. Jaks represent a 
distinct family of soluble tyrosine kinases and include Tyk2, Jakl, Jak2, and Jak3. These kinases 

30 display significant sequence similarity and are generally catalytically inactive in resting cells. 

The Jaks are activated by a wide range of receptors summarized in the Table below. 
(Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:621-51 (1995)). A 
cytokine receptor family, capable of activating Jaks, is divided into two groups: (a) Class 1 
includes receptors for IL-2, IL-3, IL-4, IL-6, IL-7, EL-9, IL-11, IL-12, IL-15, Epo, PRL, GH, G- 

35 CSF, GM-CSF, LIF, CNTF, and thrombopoietin; and (b) Class 2 includes IFN-a, IFN-g, and IL- 
10. The Class 1 receptors share a conserved cysteine motif (a set of four conserved cysteines and 
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one tryptophan) and a WSXWS motif (a membrane proximal region encoding Trp-Ser-Xaa-Trp- 
Ser (SEQ ID NO: 2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn activate 
STATs, which then translocate and bind to GAS elements. This entire process is encompassed in 
the Jaks-STATs signal transduction pathway. Therefore, activation of the Jaks-STATs pathway, 
reflected by the binding of the GAS or the ISRE element, can be used to indicate proteins involved 
in the proliferation and differentiation of cells. For example, growth factors and cytokines are 
known to activate the Jaks-STATs pathway (See Table below). Thus, by using GAS elements 
linked to reporter molecules, activators of the Jaks-STATs pathway can be identified. 
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To construct a synthetic GAS containing promoter element, which is used in the 
Biological Assays described in Examples 32-33, a PCR based strategy is employed to generate a 
GAS-SV40 promoter sequence. The 5* primer contains four tandem copies of the GAS binding 
site found in the IRF1 promoter and previously demonstrated to bind STATs upon induction with a 
5 range of cytokines (Rothman et al., Immunity 1:457-468 (1994).), although other GAS or ISRE 
elements can be used instead. The 5' primer also contains 18bp of sequence complementary to the 
SV40 early promoter sequence and is flanked with an Xhol site. The sequence of the 5* primer is: 
5' rGCGCCTCGAGATTTCCCCGAAATCTAGATTTCCCCGAAATGATTTCCCCG^ 
ATGATTTCCCCGAAATATCTGCCATCTCAATTAG:3' (SEQ ID NO: 3) 
10 The downstream primer is complementary to the SV40 promoter and is flanked with a 

Hind ffl site: 5 ' :GCGGCAAGCTITTTGCAAAGCCTAGGC:3 ' (SEQ ID NO: 4) 

PCR amplification is performed using the SV40 promoter template present in the B- 
gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is digested with 
Xhol/Hind HI and subcloned into BLSK2-. (Stratagene.) Sequencing with forward and reverse 
15 primers coirfirms that the insert contains the following sequence: 

5 ' : CTCGAGA TTTCCCCGAAATCTAGATTTCCCCGAAATGAT 
TTTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCX 
GCCCATCCCGC<XCTAACTCCGCCCAGTTCCGC^ 
TTTTTTTTATTTATGCAGAGGCCGAG 
20 TGAGGAGGCTTTTTTGGAGGCCTA (SEQ ID NO: 5) 

With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 reporter 
construct is next engineered. Here, the reporter molecule is a secreted alkaline phosphatase, or 
"SEAP." Clearly, however, any reporter molecule can be instead of SEAP, in this or in any of the 
other Examples. Well known reporter molecules that can be used instead of SEAP include 
25 chloramphenicol acetyltransferase (CAT), luciferase, alkaline phosphatase, B-galactosidase, green 
fluorescent protein (GFP), or any protein detectable by an antibody. 

The above sequence confirmed synthetic GAS-SV40 promoter element is subcloned into 
the pSEAP-Promoter vector obtained from Clontech using Hindm and Xhol, effectively replacing 
the SV40 promoter with the amplified GAS:SV40 promoter element, to create the GAS-SEAP 
30 vector. However, this vector does not contain a neomycin resistance gene, and therefore, is not 
preferred for mammalian expression systems. 

Thus, in order to generate mammalian stable cell lines expressing the GAS-SEAP reporter, 
the GAS-SEAP cassette is removed from the GAS-SEAP vector using Sail and NotI, and inserted 
into a backbone vector containing the neomycin resistance gene, such as pGEP-1 (Clontech), using 
35 these restriction sites in the multiple cloning site, to create the GAS-SEAP/Neo vector. Once this 
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vector is transfected into mammalian cells, this vector can then be used as a reporter molecule for 
GAS binding as described in Examples 32-33. 

Other constructs can be made using the above description and replacing GAS with a 
different promoter sequence. For example, construction of reporter molecules containing EGR and 
5 NF-KB promoter sequences are described in Examples 34 and 35. However, many other 
promoters can be substituted using the protocols described in these Examples. For instance, SRE, 
IL-2, NFAT, or Osteocalcin promoters can be substituted, alone or in combination (e.g., GAS/NF- 
KB/EGR, GAS/NF-KB, U-2/NFAT, or NF-KB/GAS). Similarly, other cell lines can be used to 
test reporter construct activity, such as HELA (epithelial), HUVEC (endothelial), Reh (B-cell), 
10 Saos-2 (osteoblast), HUVAC (aortic), or Cardiomyocyte. 

Example 23: Assay for SEAP Activity 

As a reporter molecule for the assays described in Examples 32-35, SEAP activity is 
assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to the following general 
procedure. The Tropix Phospho-light Kit supplies the Dilution, Assay, and Reaction Buffers used 
below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 ul of 2.5x dilution buffer 
into Optiplates containing 35 ul of a supernatant. Seal the plates with a plastic sealer and incubate 
at 65 degree C for 30 min. Separate the Optiplates to avoid uneven heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser and prime 
with the Assay Buffer. Add 50 ml Assay Buffer and incubate at room temperature 5 min. Empty 
the dispenser and prime with the Reaction Buffer (see the Table below). Add 50 ul Reaction 
Buffer and incubate at room temperature for 20 minutes. Since the intensity of the 
chemilurninescent signal is time dependent, and it takes about 10 minutes to read 5 plates on a 
luminometer, thus one should treat 5 plates at each time and start the second set 10 minutes later. 

Read the relative light unit in the luminometer. Set H12 as blank, and print the results. 
An increase in chemilurninescence indicates reporter activity. 

30 Reaction Buffer Formulation: 



# of plates Rxn buffer diluent (ml) CSPD (ml) 

10 60 3 

11 65 3.25 

12 70 3.5 

13 75 3.75 

14 80 4 

15 85 4.25 

16 90 4.5 



15 



20 



25 
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Example 24: High-Throughput Screening Assay Identifying Changes in Small 
Molecule Concentration and Membrane Permeability 

5 

Binding of a ligand to a receptor is known to alter intracellular levels of small molecules, 
such as calcium, potassium, sodium, and pH, as well as alter membrane potential. These 
alterations can be measured in an assay to identify supernatants which bind to receptors of a 
particular cell. Although the following protocol describes an assay for calcium, this protocol can 
10 easily be modified to detect changes in potassium, sodium, pH, membrane potential, or any other 
small molecule which is detectable by a fluorescent probe. 

The following assay uses Fluorometric Imaging Plate Reader ("FLIPR") to measure 
changes in fluorescent molecules (Molecular Probes) that bind small molecules. Clearly, any 
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fluorescent molecule detecting a small molecule can be used instead of the calcium fluorescent 
molecule, fluo-4 (Molecular Probes, Inc.; catalog no. F-14202), used here. 

For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star black 96-well 
plate with clear bottom The plate is incubated in a C0 2 incubator for 20 hours. The adherent cells 
5 are washed two times in Biotek washer with 200 ul of HBSS (Hank's Balanced Salt Solution) 
leaving 100 ul of buffer after the final wash. 

A stock solution of 1 mg/ml fluo-4 is made in 10% pluronic acid DMSO. To load the 
cells with fluo-4, 50 ul of 12 ug/ml fluo-4 is added to each well. The plate is incubated at 37 
degrees C in a C0 2 incubator for 60 min. The plate is washed four times in the Biotek washer with 
10 HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are re- 
suspended to 2-5x1 0 6 cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml fluo-4 solution 
in 10% pluronic acid DMSO is added to each ml of cell suspension. The tube is then placed in a 
37 degrees C water bath for 30-60 min. The cells are washed twice with HBSS, resuspended to 
15 lxlO 6 cells/ml, and dispensed into a microplate, 100 ul/well. The plate is centrifuged at 1000 rpm 
for 5 min. The plate is then washed once in Denley Cell Wash with 200 ul, followed by an 
aspiration step to 100 ul final volume. 

For a non-cell based assay, each well contains a fluorescent molecule, such as fluo-4 . The 
supernatant is added to the well, and a change in fluorescence is detected. 
20 To measure the fluorescence of intracellular calcium, the FLIPR is set for the following 

parameters: (1) System gain is 300-800 raW; (2) Exposure time is 0.4 second; (3) Camera F/stop 
is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; and (6) Sample addition is 50 ul. 
Increased emission at 530 nm indicates an extracellular signaling event caused by the a molecule, 
either polypeptide of the present invention or a molecule induced by polypeptide of the present 

25 invention, which has resulted in an increase in the intracellular Ca ++ concentration. 

Example 25: High-Ttiroughput Screening Assay Identifying Tyrosine Kinase Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of transmembrane and 
30 cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase RPTK) group are receptors for 
a range of mitogenic and metabolic growth factors including the PDGF, FGF, EGF, NGF, HGF 
and Insulin receptor subfamilies. In addition there are a large family of RPTKs for which the 
corresponding ligand is unknown. Ligands for RPTKs include mainly secreted small proteins, but 
also membrane-bound and extracellular matrix proteins. 
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Activation of RPTK by ligands involves ligand-mediated receptor dimerization, resulting 
in transphosphorylation of the receptor subunits and activation of the cytoplasmic tyrosine kinases. 
The cytoplasmic tyrosine kinases include receptor associated tyrosine kinases of the src-family 
(e.g., src, yes, lck, lyn, fyn) and non-receptor linked and cytosolic protein tyrosine kinases, such as 
5 the Jak family, members of which mediate signal transduction triggered by the cytokine 
superfamily of receptors (e.g., the Interleukins, Interferons, GM-CSF, and Leptin). 

Because of the wide range of known factors capable of stimulating tyrosine kinase 
activity, identifying whether polypeptide of the present invention or a molecule induced by 
polypeptide of the present invention is capable of activating tyrosine kinase signal transduction 

10 pathways is of interest. Therefore, the following protocol is designed to identify such molecules 
capable of activating the tyrosine kinase signal transduction pathways. 

Seed target cells (e.g., primary keratinocytes) at a density of approximately 25,000 cells 
per well in a 96 well Loprodyne Silent Screen Plates purchased from Nalge Nunc (Naperville, IL). 
The plates are sterilized with two 30 minute rinses with 100% ethanol, rinsed with water and dried 

15 overnight Some plates are coated for 2 hr with 100 ml of cell culture grade type I collagen (50 
mg/ml), gelatin (2%) or polylysine (50 mg/ml), all of which can be purchased from Sigma 
Chemicals (St Louis, MO) or 10% Matrigel purchased from Becton Dickinson (Bedford,MA), or 
calf serum, rinsed with PBS and stored at 4 degree C. Cell growth on these plates is assayed by 
seeding 5,000 cells/well in growth medium and indirect quantitation of cell number through use of 

20 alamarBlue as described by the manufacturer Alamar Biosciences, Inc. (Sacramento, CA) after 48 
hr. Falcon plate covers #3071 from Becton Dickinson (Bedford,MA) are used to cover the 
Loprodyne Silent Screen Plates. Falcon Microtest IE cell culture plates can also be used jn some 
proliferation experiments. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of Loprodyne plates 

25 (20,000/200nu7well) and cultured overnight in complete medium. Cells are quiesced by 
incubation in serum-free basal medium for 24 hr. After 5-20 minutes treatment with EGF 
(60ng/ml) or 50 ill of the supernatant produced in Example 21, the medium was removed and 100 
ml of extraction buffer ((20 mM HEPES pH 7.5, 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 
mM Na3V04, 2 mM Na4P207 and a cocktail of protease inhibitors (# 1836170) obtained from 

30 Boeheringer Mannheim (Indianapolis, IN)) is added to each well and the plate is shaken on a 

rotating shaker for 5 minutes at 4°C. The plate is then placed in a vacuum transfer manifold and 
the extract filtered through the 0.45 mm membrane bottoms of each well using house vacuum. 
Extracts are collected in a 96-well catch/assay plate in the bottom of the vacuum manifold and 
immediately placed on ice. To obtain extracts clarified by centrifugation, the content of each well, 
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after detergent solubilization for 5 minutes, is removed and centrifuged for 15 minutes at 4 degree 
Cat 16,000 xg. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many methods of 
detecting tyrosine kinase activity are known, one method is described here. 
5 Generally, the tyrosine kinase activity of a supernatant is evaluated by determining its 

ability to phosphorylate a tyrosine residue on a specific substrate (a biotinylated peptide). 
Biotinylated peptides that can be used for this purpose include PSK1 (corresponding to amino 
acids 6-20 of the cell division kinase cdc2-p34) and PSK2 (corresponding to amino acids 1-17 of 
gastrin). Both peptides are substrates for a range of tyrosine kinases and are available from 
10 Boehringer Mannheim. 

The tyrosine kinase reaction is set up by adding the following components in order. First, 
add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg2+ (5mM ATP/50mM MgCl2), then 

lOul of 5x Assay Buffer (40mM imidazole hydrochloride, pH7.3, 40 mM beta-glycerophosphate, 
ImM EGTA, lOOmM MgQ^ 5 mM MnCl 2j 0.5 mg/ml BSA), then 5ul of Sodium 

15 Vanadate(lmM), and then 5ul of water. Mix the components gently and preincubate the reaction 
mix at 30 degree C for 2 min. Initial the reaction by adding lOul of the control enzyme or the 
filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 120mm EDTA 
and place the reactions on ice. 

20 Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction mixture to a 

microtiter plate (MTP) module and incubating at 37 degree C for 20 min. This allows the 
streptavidin coated 96 well plate to associate with the biotinylated peptide. Wash the MTP module 
with 300ul/well of PBS four times. Next add 75 ul of anti-phospotyrosine antibody conjugated to 
horse radish peroxidase(anti-P-Tyr-POD(0.5u/ml)) to each well and incubate at 37 degree C for 

25 one hour. Wash the well as above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and incubate at 
room temperature for at least 5 mins (up to 30 min). Measure the absorbance of the sample at 405 
nm by using ELISA reader. The level of bound peroxidase activity is quantitated using an ELISA 
reader and reflects the level of tyrosine kinase activity. 

30 

Example 26: High-Throughput Screening Assay Identifying Phosphorylation Activity 

As a potential alternative and/or complement to the assay of protein tyrosine kinase 
activity described in Example 25, an assay which detects activation (phosphorylation) of major 
35 intracellular signal transduction intermediates can also be used. For example, as described below 
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one particular assay can detect tyrosine phosphorylation of the Erk-1 and Erk-2 kinases. However, 
phosphorylation of other molecules, such as Raf, JNK, p38 MAP, Map kinase kinase (MEK), 
MEK kinase, Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
phosphoserine, phosphotyrosme, or phosphothreonine molecule, can be detected by substituting 
5 these molecules for Erk-1 or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96-well ELISA plate with 
0.1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are then rinsed with PBS and 
blocked with 3% BSA/PBS for 1 hr at RT. The protein G plates are then treated with 2 
commercial monoclonal antibodies (lOOng/well) against Erk-1 and Erk-2 (1 hr at RT) (Santa Cruz 

10 Biotechnology). (To detect other molecules, this step can easily be modified by substituting a 
monoclonal antibody detecting any of the above described molecules.) After 3-5 rinses with PBS, 
the plates are stored at 4 degree C until use. 

A431 cells are seeded at 20,000/well in a 96-well Loprodyne filterplate and cultured 
overnight in growth medium. The cells are then starved for 48 hr in basal medium (DMEM) and 

15 then treated with EGF (6ng/well) or 50 ul of the supernatants obtained in Example 21 for 5-20 
minutes. The cells are then solubilized and extracts filtered directly into the assay plate. 

After incubation with the extract for 1 hr at RT, the wells are again rinsed. As a positive 
control, a commercial preparation of MAP kinase (lOng/well) is used in place of A431 extract. 
Plates are then treated with a commercial polyclonal (rabbit) antibody (lug/ml) which specifically 

20 recognizes the phosphorylated epitope of the Erk-1 and Erk-2 kinases (1 hr at RT). This antibody 
is biotinylated by standard procedures. The bound polyclonal antibody is then quantitated by 
successive incubations with Europium-streptavidin and Europium fluorescence enhancing reagent 
in the Wallac DELFIA instrument (time-resolved fluorescence). An increased fluorescent signal 
over background indicates a phosphorylation by polypeptide of the present invention or a molecule 

25 induced by polypeptide of the present invention. 

Example 27: Cellular Adhesion Molecule (CAM) Expression on Endothelial Cells 

The recruitment of lymphocytes to areas of inflammation and angiogenesis involves 
30 specific receptor-ligand interactions between cell surface adhesion molecules (CAMs) on 
lymphocytes and the vascular endothelium. The adhesion process, in both normal and 
pathological settings, follows a multi-step cascade that involves intercellular adhesion molecule- 1 
(ICAM-1), vascular cell adhesion molecule-1 (VCAM-1), and endothelial leukocyte adhesion 
molecule-1 (E-selectin) expression on endothelial cells (EC). The expression of these molecules 
35 and others on the vascular endothelium deterrnines the efficiency with which leukocytes may 
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adhere to the local vasculature and extravasate into the local tissue during the development of an 
inflammatory response. The local concentration of cytokines and growth factor participate in the 
modulation of the expression of these CAMs. 

Briefly, endothelial cells (e.g., Human Umbilical Vein Endothelial cells (HUVECs)) are 
5 grown in a standard 96 well plate to confluence, growth medium is removed from the cells and 
replaced with 100 (i\ of 199 Medium (10% fetal bovine serum (FBS)). Samples for testing and 
positive or negative controls are added to the plate in triplicate (in 10 jxl volumes). Plates are then 
incubated at 37°C for either 5 h (selectin and integrin expression) or 24 h (integrin expression 
only). Plates are aspirated to remove medium and 100 /xl of 0.1% paraforrnaldehyde-PBS(with 

10 Ca++ and Mg++) is added to each well. Plates are held at 4°C for 30 min. Fixative is removed 
from the wells and wells are washed IX with PBS(+CaJSlg) + 0.5% BSA and drained. 10 jii of 
diluted primary antibody is added to the test and control wells. Anti-ICAM-l-Biotin, Anti- 
VCAM-l-Biotin and Anti-E-selectin-Biotin are used at a concentration of 10 /ig/ml (1:10 dilution 
of 0.1 mg/ml stock antibody). Cells are incubated at 37°C for 30 min. in a humidified 

15 environment. Wells are washed three times with PBS(+Ca,Mg) + 0.5% BSA. 20 \i\ of diluted 
ExtrAvidin-Alkaline Phosphatase (1:5,000 dilution, referred to herein as the working dilution) are 
added to each well and incubated at 37°C for 30 min. Wells are washed three times with 
PBS(+Ca,Mg)+0.5% BSA. Dissolve 1 tablet of p-Nitrophenol Phosphate pNPP per 5 ml of 
glycine buffer (pH 10.4). 100 /xl of pNPP substrate in glycine buffer is added to each test well. 

20 Standard wells in triplicate are prepared from the working dilution of the ExtrAvidin-Alkaline 
Phosphotase in glycine buffer: 1:5,000 (10°) > 10 ° 5 > 10' 1 > 10 -1 5 . 5 /xl of each dilution is added to 
triplicate wells and the resulting AP content in each well is 5.50 ng, 1.74 ng, 0.55 ng, 0.18 ng. 
100 /xl of pNNP reagent is then added to each of the standard wells. The plate is incubated at 37°C 
for 4h. A volume of 50 ul of 3M NaOH is added to all wells. The plate is read on a plate reader at 

25 405 nm using the background subtraction option on blank wells filled with glycine buffer only. 
Additionally, the template is set up to indicate the concentration of AP-conjugate in each standard 
well [ 5.50 ng; 1.74 ng; 0.55 ng; 0.18 ng]. Results are indicated as amount of bound AP-conjugate 
in each sample. 

■ 

30 Example 28: Alamar Blue Endothelial Cells Proliferation Assay 

* 

This assay may be used to quantitatively determine protein mediated inhibition of bFGF- 
induced proliferation of Bovine Lymphatic Endothelial Cells (DECs), Bovine Aortic Endothelial 
Cells (BAECs) or Human Microvascular Uterine Myometrial Cells (UTMECs). This assay 
35 incorporates a fluorometric growth indicator based on detection of metabolic activity. A standard 
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Alamar Blue Proliferation Assay is prepared in EGM-2MV with 10 ng /ml of bFGF added as a 
source of endothelial cell stimulatiorL This assay may be used with a variety of endothelial cells 
with slight changes in growth medium and cell concentration. Dilutions of the protein batches to 
be tested are diluted as appropriate. Serum-free medium (GIBCO SFM) without bFGF is used as a 
5 non-stimulated control and Angiostatin or TSP-1 are included as a known inhibitory controls. 

Briefly, LEC, BAECs or UTMECs are seeded in growth media at a density of 5000 to 
2000 cells/well in a 96 well plate and placed at 37 degreesC overnight. After the overnight 
incubation of the cells, the growth media is removed and replaced with GIBCO EC-SFM. The 
cells are treated with the appropriate dilutions of the protein of interest or control protein sample(s) 
10 (prepared in SFM ) in triplicate wells with additional bFGF to a concentration of 10 ng/ ml. Once 
the cells have been treated with the samples, the plate(s) is/are placed back in the 37° C incubator 
for three days. After three days 10 ml of stock alamar blue (Biosource Cat# DAL1 100) is added to 
each well and the plate(s) is/are placed back in the 37°C incubator for four hours. The plate(s) are 
then read at 530nm excitation and 590nm emission using the CytoFluor fluorescence reader. 
15 Direct output is recorded in relative fluorescence units. 

Alamar blue is an oxidation-reduction indicator that both fluoresces and changes color in 
response to chemical reduction of growth medium resulting from cell growth. As cells grow in 
culture, innate metabolic activity results in a chemical reduction of the immediate surrounding 
environment. Reduction related to growth causes the indicator to change from oxidized (non- 
20 fluorescent blue) form to reduced (fluorescent red) form (i.e., stimulated proliferation will produce 
a stronger signal and inhibited proliferation will produce a weaker signal and the total signal is 
proportional to the total number of cells as well as their metabolic activity). The background level 
of activity is observed with the starvation medium alone. This is compared to the output observed 
from the positive control samples (bFGF in growth medium) and protein dilutions. 

25 

Example 29: Detection of Inhibition of a Mixed Lymphocyte Reaction 

This assay can be used to detect and evaluate inhibition of a Mixed Lymphocyte Reaction 
(MLR) by gene products (e.g., isolated polypeptides). Inhibition of a MLR may be due to a direct 

30 effect on cell proliferation and viability, modulation of costimulatory molecules on interacting 
cells, modulation of adhesiveness between lymphocytes and accessory cells, or modulation of 
cytokine production by accessory cells. Multiple cells may be targeted by these polypeptides since 
the peripheral blood mononuclear fraction used in this assay includes T, B and natural killer 
lymphocytes, as well as monocytes and dendritic cells. 

35 Polypeptides of interest found to inhibit the MLR may find application in diseases 
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associated with lymphocyte and monocyte activation or proliferation. These include, but are not 
limited to, diseases such as asthma, arthritis, diabetes, inflammatory skin conditions, psoriasis, 
eczema, systemic lupus erythematosus, multiple sclerosis, glomerulonephritis, inflammatory bowel 
disease, Crohn's disease, ulcerative colitis, arteriosclerosis, cirrhosis, graft vs. host disease, host vs. 
5 graft disease, hepatitis, leukemia and lymphoma. 

Briefly, PBMCs from human donors are purified by density gradient centrifugation using 
Lymphocyte Separation Medium (LSM®, density 1.0770 g/ml, Organon Teknika Corporation, 
West Chester, PA). PBMCs from two donors are adjusted to 2 x 10 6 cells/ml in RPMM640 (Life 
Technologies, Grand Island, NY) supplemented with 10% FCS and 2 rnM glutamine. PBMCs 

10 from a third donor is adjusted to 2 x 10 5 cells/ml. Fifty microliters of PBMCs from each donor is 
added to wells of a 96-weIl round bottom microliter plate. Dilutions of test materials (50 jil) is 
added in triplicate to microtiter wells. Test samples (of the protein of interest) are added for final 
dilution of 1:4; rhuIL-2 (R&D Systems, Minneapolis, MN, catalog number 202-IL) is added to a 
final concentration of 1 Hg/ml; anti-CD4 mAb (R&D Systems, clone 34930.11, catalog number 

15 • MAB379) is added to a final concentration of 10 u,g/ml. Cells are cultured for 7-8 days at 37°C in 
5% CO2, and 1 u,C of [ 3 H] thymidine is added to wells for the last 16 hrs of culture. Cells are 
harvested and thymidine incorporation determined using a Packard TopCount Data is expressed 
as the mean and standard deviation of triplicate determinations. 

Samples of the protein of interest are screened in separate experiments and compared to 

20 the negative control treatment, anti-CD4 mAb, which inhibits proliferation of lymphocytes and the 
positive control treatment, IL-2 (either as recombinant material or supernatant), which enhances 
proliferation of lymphocytes. 

One skilled in the art could easily modify the exemplified studies to test the activity of 
polynucleotides (e.g., gene therapy), antibodies, agonists, and/or antagonists and fragments and 

25 variants thereof. 



Example 30: Assays for Protease Activity 

30 The following assay may be used to assess protease activity of the polypeptides of the 

invention. 

Gelatin and casein zymography are performed essentially as described (Heusen et al., 
Anal BwcherrL, 102:196-202 (1980); Wilson et al., Journal of Urology, 149:653-658 (1993)). 
Samples are run on 10% polyacryamide/0.1% SDS gels containing 1% gelain orcasein, soaked in 
35 2.5% triton at room temperature for 1 hour, and in 0.1M glycine, pH 8.3 at 37°C 5 to 16 hours. 
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After staining in arnido black areas of proteolysis apear as clear areas agains the blue-black 
background. Trypsin (Sigma T8642) is used as a positive control. 

Protease activity is also determined by monitoring the cleavage of n-a-benzoyl-L-arginine 
ethyl ester (BAEE) (Sigma B-4500. Reactions are set up in (25rrMfNraP0 4 ,lmM EDTA, and ImM 
5 BAEE), pH 7.5. Samples are added and the change in adsorb ance at 260am is monitored on the 
Beckman DU-6 spectrophotometer in the time-drive mode. Trypsin is used as a positive control. 

Additional assays based upon the release of acid-soluble peptides from casein or 
hemoglobin measured as adsorbance at 280 nm or colorimetrically using the Folin method are 
performed as described in Bergmeyer, et aL, Methods of Enzymatic Analysis, 5 (1984). Other 
10 assays involve the solubilization of chromogenic substrates (Ward, Applied Science, 251-317 
(1983)). 

Example 31: Identifying Serine Protease Substrate Specificity 

15 Methods known in the art or described herein may be used to determine the substrate 

specificity of the polypeptides of the present invention having serine protease activity. A preferred 
method of determining substrate specificity is by the use of positional scanning synthetic 
combinatorial libraries as described in GB 2 324 529 (incorporated herein in its entirety). 

20 Example 32: Ligand Binding Assays 

The following assay may be used to assess ligand binding activity of the polypeptides of 
the invention. 

Ligand binding assays provide a direct method for ascertaining receptor pharmacology and 

■ 

25 are adaptable to a high throughput format. The purified ligand for a polypeptide is radiolabeled to 
high specific activity (50-2000 Ci/mmol) for binding studies. A determination is then made that 
the process of radiolabeling does not diminish the activity of the ligand towards its polypeptide. 
Assay conditions for buffers, ions, pH and other modulators such as nucleotides are optimized to 
establish a workable signal to noise ratio for both membrane and whole cell polypeptide sources. 

30 For these assays, specific polypeptide binding is defined as total associated radioactivity minus the 
radioactivity measured in the presence of an excess of unlabeled competing ligand. Where 
possible, more than one competing ligand is used to define residual nonspecific binding. 

Example 33: Functional Assay in Xenopus Oocytes 

35 
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Capped RNA transcripts from linearized plasmid templates encoding the polypeptides of 
the invention are synthesized in vitro with RNA polymerases in accordance with standard 
procedures. In vitro transcripts are suspended in water at a final concentration of 0.2 mg/mi. 
Ovarian lobes are removed from adult female toads, Stage V defolliculated oocytes are obtained, 
5 and RNA transcripts (10 ng/oocytc) are injected in a 50 nl bolus using a microinjection apparatus. 
Two electrode voltage clamps are used to measure the currents from individual Xenopus oocytes in 
response polypeptides and polypeptide agonist exposure. Recordings are made in Ca2+ free 
Barth's medium at room temperature. The Xenopus system can be used to screen known ligands 
and tissue/cell extracts for activating ligands. 

10 

Example 34: Microphysiometric Assays 

Activation of a wide variety of secondary messenger systems results in extrusion of small 
amounts of acid from a cell. The acid formed is largely as a result of the increased metabolic 
15 activity required to fuel the intracellular signaling process. The pH changes in the media 
surrounding the cell are very small but are detectable by the CYTOSENSOR microphysiometer 
(Molecular Devices Ltd., Menlo Park, Calif.). The CYTOSENSOR is thus capable of detecting the 
activation of polypeptide which is coupled to an energy utilizing intracellular signaling pathway. 

20 Example 35: Extract/Cell Supernatant Screening 

A large number of mammahan receptors exist for which there remains, as yet, no cognate 
activating ligand (agonist). Thus, active ligands for these receptors may not be included within the 
ligands banks as identified to date. Accordingly, the polypeptides of the invention can also be 
25 functionally screened (using calcium, cAMP, microphysiometer, oocyte electrophysiology, etc., 
functional screens) against tissue extracts to identify its natural ligands. Extracts that produce 
positive functional responses can be sequentially subfractionated until an activating ligand is 
isolated and identified. 

■ 

30 

Example 36: Calcium and cAMP Functional Assays 

Seven transmembrane receptors which are expressed in HEK 293 cells have been shown 
to be coupled functionally to activation of PLC and calcium mobilization and/or cAMP stimulation 
35 or inhibition. Basal calcium levels in the HEK 293 cells in receptor-transfected or vector control 
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cells were observed to be in the normal, 100 nM to 200 nM, range. HEK 293 cells expressing 
recombinant receptors are loaded with fura 2 and in a single day >150 selected ligands or 
tissue/cell extracts are evaluated for agonist induced calcium mobilization. Similarly, HEK 293 
cells expressing recombinant receptors are evaluated for the stimulation or inhibition of cAMP 
5 production using standard cAMP quantitation assays. Agonists presenting a calcium transient or 
cAMP fluctuation are tested in vector control cells to determine if the response is unique to the 
transfected cells expressing receptor. 

Example 37: ATP-binding assay 

10 

The following assay may be used to assess ATP-binding activity of polypeptides of the 
invention. 

ATP-binding activity of the polypeptides of the invention may be detected using the ATP- 
binding assay described in U.S. Patent 5,858,719, which is herein incorporated by reference in its 

15 entirety. Briefly, ATP-binding to polypeptides of the invention is measured via photoaffinity 
labeling with 8-azido-ATP in a competition assay. Reaction mixtures containing 1 mg/ml of the 
ABC transport protein of the present invention are incubated with varying concentrations of ATP, 
or the non-hydrolyzable ATP analog adenyl-S'-imidodiphosphate for 10 minutes at 4°C. A mixture 
of 8-azido-ATP (Sigma Chem. Corp., St. Louis, MO.) plus 8-azido-ATP ( 32 P-ATP) (5 mCiAimol, 

20 ICN, Irvine CA.) is added to a final concentration of 100 jiM and 0.5 ml aliquots are placed in the 
wells of a porcelain spot plate on ice. The plate is irradiated using a short wave 254 ran UV lamp 
at a distance of 2.5 cm from the plate for two one-minute intervals with a one-minute cooling 
interval in between. The reaction is stopped by addition of dithiothreitol to a final concentration of 
2mM. The incubations are subjected to SDS-PAGE electrophoresis, dried, and autoradiographed. 

25 Protein bands corresponding to the particular polypeptides of the invention are excised, and the 
radioactivity quantified. A decrease in radioactivity with increasing ATP or adenly-5 1 - 
imidodiphosphate provides a measure of ATP affinity to the polypeptides. 



30 Example 38: Small Molecule Screening 

This invention is particularly useful for screening therapeutic compounds by using the 
polypeptides of the invention, or binding fragments thereof, in any of a variety of drug screening 
techniques. The polypeptide or fragment employed in such a test may be affixed to a solid support, 
35 expressed on a cell surface, free in solution, or located intracellularly. One method of drug 
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screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with 
recombinant nucleic acids expressing the polypeptide or fragment. Drugs are screened against such 
transformed cells in competitive binding assays. One may measure, for example, the formulation 
of complexes between the agent being tested and polypeptide of the invention. 
5 Thus, the present invention provides methods of screening for drugs or any other agents 

which affect activities mediated by the polypeptides of the invention. These methods comprise 
contacting such an agent with a polypeptide of the invention or fragment thereof and assaying for 
the presence of a complex between the agent and the polypeptide or fragment thereof, by methods 
well known in the art. In such a competitive binding assay, the agents to screen are typically 

10 labeled. Following incubation, free agent is separated from that present in bound form, and the 
amount of free or uncomplexed label is a measure of the ability of a particular agent to bind to the 
polypeptides of the invention. 

Another technique for drug screening provides high throughput screening for compounds 
having suitable binding affinity to the polypeptides of the invention, and is described in great detail 

15 in European Patent Application 84/03564, published on September 13, 1984, which is herein 
incorporated by reference in its entirety. Briefly stated, large numbers of different small molecule 
test compounds are synthesized on a solid substrate, such as plastic pins or some other surface. The 
test compounds are reacted with polypeptides of the invention and washed. Bound polypeptides 
are then detected by methods well known in the art Purified polypeptides are coated directly onto 

20 plates for use in the aforementioned drug screening techniques. In addition, non-neutralizing 
antibodies may be used to capture the peptide and immobilize it on the solid support. 

This invention also contemplates the use of competitive drug screening assays in which 
neutralizing antibodies capable of binding polypeptides of the invention specifically compete with 
a test compound for binding to the polypeptides or fragments thereof. In this manner, the 

25 antibodies are used to detect the presence of any peptide which shares one or more antigenic 
epitopes with a polypeptide of the invention. 

Example 39: Phosphorylation Assay 

« 

30 In order to assay for phosphorylation activity of the polypeptides of the invention, a 

phosphorylation assay as described in U.S. Patent 5,958,405 (which is herein incorporated by 
reference) is utilized. Briefly, phosphorylation activity may be measured by phosphorylation of a 
protein substrate using gamma-labeled 32 P-ATP and quantitation of the incorporated radioactivity 
using a gamma radioisotope counter. The polypeptides of the invention are incubated with the 

35 protein substrate, ^P-ATP, and a kinase buffer. The 32 P incorporated into the substrate is then 
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separated from free P-ATP by electrophoresis, and the incorporated P is counted and compared 
to a negative control. Radioactivity counts above the negative control are indicative of 
phosphorylation activity of the polypeptides of the invention. 

5 Example 40: Detection of Phosphorylation Activity (Activation) of the Polypeptides of 

the Invention in the Presence of Polypeptide Ugands 

Methods known in the art or described herein may be used to determine the 
phosphorylation activity of the polypeptides of the invention. A preferred method of determining 
10 phosphorylation activity is by the use of the tyrosine phosphorylation assay as described in US 
5,817,471 (incorporated herein by reference). 

Example 41: Identification Of Signal Transduction Proteins That Interact With 
1 5 Polypeptides Of The Present Invention 

The purified polypeptides of the invention are research tools for the identification, 
characterization and purification of additional signal transduction pathway proteins or receptor 
proteins. Briefly, labeled polypeptides of the invention are useful as reagents for the purification of 

20 molecules with which it interacts. In one embodiment of affinity purification, polypeptides of the 
invention are covalently coupled to a chromatography column. Cell-free extract derived from 
putative target cells, such as carcinoma tissues, is passed over the column, and molecules with 
appropriate affinity bind to the polypeptides of the invention. The protein complex is recovered 
from the column, dissociated, and the recovered molecule subjected to N^erminal protein 

25 sequencing. This amino acid sequence is then used to identify the captured molecule or to design 
degenerate oligonucleotide probes for cloning the relevant gene from an appropriate cDNA library. 

Example 42: Assay for Phosphatase Activity 

30 The following assay may be used to assess serine/threonine phosphatase (PTPase) activity 

of the polypeptides of the invention. 

In order to assay for serine/threonine phosphatase (PTPase) activity, assays can be utilized 
which are widely known to those skilled in the art. For example, the serine/threonine phosphatase 
(PSPase) activity is measured using a PSPase assay kit from New England Biolabs, hie. Myelin 

35 basic protein (MyBP), a substrate for PSPase, is phosphorylated on serine and threonine residues 
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with cAMP-dependent Protein Kinase in the presence of [ 32 P]ATP. Protein serine/threonine 
phosphatase activity is then determined by measuring the release of inorganic phosphate from 32P- 
labeled MyBP. 

5 Example 43: Interaction of Serine/Threonine Phospliatases with other Proteins 

The polypeptides of the invention with serine/threonine phosphatase activity as determined 
in Example 42 are research tools for the identification, characterization and purification of 
additional interacting proteins or receptor proteins, or other signal transduction pathway proteins. 

10 Briefly, labeled polypeptide(s) of the invention is useful as a reagent for the purification of 
molecules with which it interacts. In one embodiment of affinity purification, polypeptide of the 
invention is covalently coupled to a chromatography column. Cell-free extract derived from 
putative target cells, such as neural or liver cells, is passed over the column, and molecules with 
appropriate affinity bind to the polypeptides of the invention. The polypeptides of the invention - 

15 complex is recovered from the column, dissociated, and the recovered molecule subjected to N- 
terminal protein sequencing. This amino acid sequence is then used to identify the captured 
molecule or to design degenerate oligonucleotide probes for cloning the relevant gene from an 
appropriate cDNA library. 

20 Example 44: Assaying for Heparanase Activity 

In order to assay for heparanase activity of the polypeptides of the invention, the 
heparanase assay described by Vlodavsky et al is utilized (Vlodavsky, L, et al., Nat. Med., 5:793- 
802 (1999)). Briefly, cell lysates, conditioned media or intact cells (1 x 10 6 cells per 35-mm dish) 

25 are incubated for 18 hrs at 37°C, pH 6.2-6.6, with 35 S-labeled ECM or soluble ECM derived peak I 
proteoglycans. The incubation medium is centrifuged and the supernatant is analyzed by gel 
filtration on a Sepharose CL-6B column (0.9 x 30 cm). Fractions are eluted with PBS and their 
radioactivity is measured. Degradation fragments of heparan sulfate side chains are eluted from 
Sepharose 6B at 0.5 < K av < 0.8 (peak H). Each experiment is done at least three times. 

30 Degradation fragments corresponding to "peak n," as described by Vlodavsky et al., is indicative 
of the activity of the polypeptides of the invention in cleaving heparan sulfate. 

Example 45: Immobilization ofbiomolecnles 

35 This example provides a method for the stabilization of polypeptides of the invention in 

non-host cell lipid bilayer constucts (see, e.g., Bieri et al., Nature Biotech 17:1105-1108 (1999), 
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hereby incorporated by reference in its entirety herein) which can be adapted for the study of 
polypeptides of the invention in the various functional assays described above. Briefly, 
carbohydrate-specific chemistry for biotinylation is used to confine a biotin tag to the extracellular 
domain of the polypeptides of the invention, thus allowing uniform orientation upon 
5 immobilization. A 50uM solution of polypeptides of the invention in washed membranes is 
incubated with 20 mM NaI04 and 1.5 mg/ml (4mM) BACH or 2 mg/ml (7.5mM) biotin- 
hydrazide for 1 hr at room temperature (reaction volume, 150ul). Then the sample is dialyzed 
(Pierce Slidealizer Cassett, 10 kDa cutoff; Pierce Chemical Co., Rockford DL) at 4C first for 5 h, 
exchanging the buffer after each hour, and finally for 12 h against 500 ml buffer R (0.15 M NaCl, 
10 1 mM MgC12, 10 mM sodium phosphate, pH7). Just before addition into a cuvette, the sample is 
diluted 1:5 in buffer ROG50 (Buffer R supplemented with 50 mM octylglucoside). 



Example 46: TAQMAN 

15 

Quantitative PCR (QPCR). Total RNA from cells in culture are extracted by Trizol 
separation as recommended by the supplier (LifeTechnologies). (Total RNA is treated with DNase 
I (Life Technologies) to remove any contaminating genomic DNA before reverse transcription.) 
Total RNA (50 ng) is used in a one-step, 50ul, RT-QPCR, consisting of Taqman Buffer A (Perkin- 

20 Elmer; 50 mM KC1/10 mM Tris, pH 8.3), 5.5 mM MgCl 2 , 240 jtM each dNTP, 0.4 units RNase 
inhibitor(Promega), 8%glycerol, 0.012% Tween-20, 0.05% gelatin, 0.3uM primers, O.luM probe, 
0.025units Amplitaq Gold (Perkin-Elmer) and 2.5 units Superscript II reverse transcriptase (Life 
Technologies). As a control for genomic contamination, parallel reactions are setup without 
reverse transcriptase. The relative abundance of (unknown) and 18S RNAs are assessed by using 

25 the Applied Biosystems Prism 7700 Sequence Detection System (Livak, K. J., Flood, S. J., 
Marmaro, J., Giusti, W. & Deetz, K. (1995) PCR Methods Appl. 4, 357-362). Reactions are 
carried out at 48°C for 30 min, 95°C for 10 min, followed by 40 cycles of 95°C for 15s, 60°C for 1 
min. Reactions are performed in triplicate. 

Primers (f & r) and FRET probes sets are designed using Primer Express Software 

30 (Perkin-Elmer). Probes are labeled at the 5'-end with the reporter dye 6-FAM and on the 3'-end 
with the quencher dye TAMRA (Biosource International, Camarillo, CA or Perkin-Elmer). 



Example 47: Assays for Metalloproteinase Activity 

Metalloproteinases (EC 3.4.24.-) are peptide hydrolases which use metal ions, such as 
35 Zn 2+ , as the catalytic mechanism. Metalloproteinase activity of polypeptides of the present 
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invention can be assayed according to the following methods. 
Proteolysis of alpha-2-macroglobulin 

To confirm protease activity, purified polypeptides of the invention are mixed with the 
5 substrate alpha-2-macroglobulin (0.2 unit/ml; Boehringer Mannheim, Germany) in Ix assay buffer 
(50 mM HEPES, pH 7.5, 0.2 M NaCl, 10 mM CaCl 2 , 25 jiM ZnCl 2 and 0.05% Brij-35) and 
incubated at 37°C for 1-5 days. Trypsin is used as positive control. Negative controls contain only 
alpha-2-macroglobulin in assay buffer. The samples are' collected and boiled in SDS-PAGE 
sample buffer containing 5% 2-mercaptoethanol for 5-min, then loaded onto 8% SDS- 
10 polyacrylamide gel. After electrophoresis the proteins are visualized by silver staining. Proteolysis 
is evident by the appearance of lower molecular weight bands as compared to the negative control. 

Inhibition of alpha-2-macro globulin proteolysis by inhibitors of metalloproteinases 
Known metalloproteinase inhibitors (metal chelators (EDTA, EGTA, AND HgCl 2 ), 

15 peptide metalloproteinase inhibitors (TIMP-1 and TIMP-2), and commercial small molecule MMP 
inhibitors) are used to characterize the proteolytic activity of polypeptides of the invention. The 
three synthetic MMP inhibitors used are: MMP inhibitor I, [IC50 =1.0 |iM against MMP-1 and 
MMP-8; IC 50 = 30 *iM against MMP-9; IC 50 = 150 pM against MMP-3]; MMP-3 (stromelysin-1) 
inhibitor I [IC 50 = 5 pM against MMP-3], and MMP-3 inhibitor E [Kj = 130 nM against MMP-3]; 

20 inhibitors available through Calbiochem, catalog # 444250, 444218, and 444225, respectively). 
Briefly, different concentrations of the small molecule MMP inhibitors are mixed with purified 
polypeptides of the invention (50ug/ml) in 22.9 uJ of lx HEPES buffer (50 mM HEPES, pH 7.5, 
0.2 M NaCl, 10 mM CaCl 2 , 25 pM ZnCl 2 and 0.05%Brij-35) and incubated at room temperature 
(24 °Q for 2-hr, then 7.1 fo.1 of substrate alpha-2-macroglobulin (0.2 unit/ml) is added and 

25 incubated at 37°C for 20-hr. The reactions are stopped by adding 4x sample buffer and boiled 
immediately for 5 minutes. After SDS-PAGE, the protein bands are visualized by silver stain. 

Synthetic Fluorogenic Peptide Substrates Cleavage Assay 

The substrate specificity for polypeptides of the invention with demonstrated 
30 metalloproteinase activity can be determined using synthetic fluorogenic peptide substrates 
(purchased from BACHEM Bioscience Inc). Test substrates include, M-1985, M-2225, M-2105, 
M-21 10, and M-2255. The first four are MMP substrates and the last one is a substrate of tumor 
necrosis factor-a (TNF-a) converting enzyme (TACE). All the substrates are prepared in 1:1 
dimethyl sulfoxide (DMSO) and water. The stock solutions are 50-500 pM. Fluorescent assays are 
35 performed by using a Perkin Elmer LS SOB luminescence spectrometer equipped with a constant 
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temperature water bath. The excitation X is 328 am and the emission X is 393 nm. Briefly, the 
assay is carried out by incubating 176 ul lx HEPES buffer (0.2 M NaCl, 10 mM CaCl 2 , 0.05% 
Brij-35 and 50 mM HEPES, pH 7.5) with 4 ul of substrate solution (50 jiM) at 25 °C for 15 
minutes, and then adding 20 \i\ of a purified polypeptide of the invention into the assay cuvett. The 
5 final concentration of substrate is 1 jiM. Initial hydrolysis rates are monitored for 30-min. 

Example 48: Characterization of the cDNA contained in a deposited plasmid 
The size of the cDNA insert contained in a deposited plasmid may be routinely determined 
using techniques known in the art, such as PCR amplification using synthetic primers hybridizable 

10 to the 3' and 5' ends of the cDNA sequence. For example, two primers of 17-30 nucleotides 
derived from each end of the cDNA (i.e., hybridizable to the absolute 5* nucleotide or the 3* 
nucleotide end of the sequence of SEQ ID NO:X, respectively) are synthesized and used to 
amplify the cDNA using the deposited cDNA plasmid as a template. The polymerase chain 
reaction is carried out under routine conditions, for instance, in 25 ul of reaction mixture with 0.5 

15 ug of the above cDNA template. A convenient reaction mixture is 1.5-5 mM MgCl 2 , 0.01% (w/v) 
gelatin, 20 uM each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 Unit of Taq 
polymerase. Thirty five cycles of PCR (denaturation at 94 degree C for 1 rnin; annealing at 55 
degree C for 1 min; elongation at 72 degree C for 1 rnin) are performed with a PerkhvElmer Cetus 
automated thermal cycler. The amplified product is analyzed by agarose gel electrophoresis. The 

20 PCR product is verified to be the selected sequence by subcloning and sequencing the DNA 
product.lt will be clear that the invention may be practiced otherwise than as particularly described 
in the foregoing description and examples. Numerous modifications and variations of the present 
invention are possible in light of the above teachings and, therefore, are within the scope of the 
appended claims. 

25 

Incorporation by Reference 

The entire disclosure of each document cited (including patents, patent applications, 
journal articles, abstracts, laboratory manuals, books, or other disclosures) in the Background of 
the Invention, Detailed Description, and Examples is hereby incorporated herein by reference. In 

30 addition, the sequence listing submitted herewith is incorporated herein by reference in its entirety. 
The specification and sequence listing of each of the following U.S. and PCT applications are 
herein incorporated by reference in their entirety: U.S. Appln. No. 60/040,162 filed on 07-Mar- 
1997, U.S. Appln. No. 60/043,576 filed on ll-Apr-1997, U.S. Appln. No. 60/047,601 filed on 23- 
May-1997, U.S. Appln. No. 60/056,845 filed on 22-Aug-1997, U.S. Appln. No. 60/043,580 filed 

35 on ll-Apr-1997, U.S. Appln. No. 60/047,599 filed on 23-May-1997, U.S. Appln. No. 60/056,664 
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filed on 22-Aug-1997, U.S. Appln. No. 60/043,314 filed on ll-Apr-1997, U.S. Appln. No. 
60/047,632 filed on 23-May-1997, U.S. Appln. No. 60/056,892 filed on 22-Aug-1997, U.S. Appln. 
No. 60/043,568 filed on ll-Apr-1997, U.S. Appln. No. 60/047,595 filed on 23-May-1997, U.S. 
Appln. No. 60/056,632 filed on 22-Aug-1997, U.S. Appln. No. 60/043,578 filed on ll-Apr-1997, 
5 U.S. Appln. No. 60/040333 filed on 07-Mar-1997, U.S. Appln. No. 60/043,670 filed on ll-Apr- 
1997, U.S. Appln. No. 60/047,596 filed on 23 -May- 1997, U.S. Appln. No. 60/056,864 filed on 22- 
Aug-1997, U.S. Appln. No. 60/043,674 filed on ll-Apr-1997, U.S. Appln. No. 60/047,612 filed 
on 23-May-1997, U.S. Appln. No. 60/056,631 filed on 22-Aug-1997, U.S. Appln. No. 60/043,569 
filed on ll-Apr-1997, U.S. Appln. No. 60/047,588 filed on 23-May-1997, U.S. Appln. No. 

10 60/056,876 filed on 22-Aug-1997, U.S. Appln. No. 60/043,671 filed on ll-Apr-1997, U.S. Appln. 
No. 60/043,311 filed on ll-Apr-1997, U.S. Appln. No. 60/038,621 filed on 07-Mar~1997, U.S. 
Appln. No. 60/043,672 filed on ll-Apr-1997, U.S. Appln. No. 60/047,613 filed on 23-May-1997, 
U.S. Appln. No. 60/056,636 filed on 22-Aug-1997, U.S. Appln. No. 60/043,669 filed on ll-Apr- 
1997, U.S. Appln. No. 60/047,582 filed on 23-May-1997, U.S. Appln. No. 60/056,910 filed on 22- 

15 Aug-1997, U.S. Appln. No. 60/043,315 filed on ll-Apr-1997, U.S. Appln. No. 60/047,598 filed 
on 23-May-1997, U.S. Appln. No. 60/056,874 filed on 22-Aug-1997, U.S. Appln. No. 60/043,312 
filed on ll-Apr-1997, U.S. Appln. No. 60/047,585 filed on 23-May-1997, U.S. Appln. No. 
60/056,881 filed on 22-Aug-1997, U.S. Appln. No. 60/043,313 filed on ll-Apr-1997, U.S. Appln. 
No. 60/047,586 filed on 23-May-1997, U.S. Appln. No. 60/056,909 filed on 22-Aug-1997, U.S. 

20 Appln. No. 60/040,161 filed on 07-Mar-1997, U.S. Appln. No. 60/047,587 filed on 23-May-1997, 
U.S. Appln. No. 60/056,879 filed on 22-Aug-1997, U.S. Appln. No. 60/047,500 filed on 23-May- 
1997, U.S. Appln. No. 60/056,880 filed on 22-Aug-1997, U.S. Appln. No. 60/047,584 filed on 23- 
May-1997, U.S. Appln. No. 60/056,894 filed on 22-Aug-1997, U.S. Appln. No. 60/047,492 filed 
on 23-May-1997, U.S. Appln. No. 60/056,911 filed on 22-Aug-1997, U.S. Appln. No. 60/040,626 

25 filed on 07-Mar-1997, U.S. Appln. No. 60/047,503 filed on 23-May-1997, U.S. Appln. No. 
60/056,903 filed on 22-Aug-1997, U.S. Appln. No. 60/047,501 filed on 23-May-1997, U.S. Appln. 
No. 60/056,637 filed on 22-Aug-1997, U.S. Appln. No. 60/047,590 filed on 23-May-1997, U.S. 
Appln. No. 60/056,875 filed on 22-Aug-1997, U.S. Appln. No. 60/047,581 filed on 23-May-1997, 
U.S. Appln. No. 60/056,882 filed on 22-Aug-1997, U.S. Appln. No. 60/047,592 filed on 23-May- 

30 1997, U.S. Appln. No. 60/056,888 filed on 22-Aug-1997, U.S. Appln. No. 60/040,334 filed on 07- 
Mar-1997, U.S. Appln. No. 60/047,618 filed on 23-May-1997, U.S. Appln. No. 60/056,872 filed 
on 22-Aug-1997, U.S. Appln. No. 60/047,617 filed on 23-May-1997, U.S. Appln. No. 60/056,662 
filed on 22-Aug-1997, U.S. Appln. No. 60/047,589 filed on 23-May-1997, U.S. Appln. No. 
60/056,862 filed on 22-Aug-1997, U.S. Appln. No. 60/047,594 filed on 23-May-1997, U.S. Appln. 

35 No. 60/056,884 filed on 22-Aug-1997, U.S. Appln. No. 60/047,583 filed on 23-May-1997, U.S. 
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Appln. No. 60/056,878 filed on 22-Aug-1997, U.S. Appln. No. 60/040,336 filed on 07-Mar-1997, 
U.S. Appln. No. 60/047,502 filed on 23-May-1997, U.S. Appln. No. 60/056,893 filed on 22-Aug- 
1997, U.S. Appln. No. 60/047,633 filed on 23-May-1997, U.S. Appln. No. 60/056,630 filed on 22- 
Aug-1997, U.S. Appln. No. 60/047,593 filed on 23-May-1997, U.S. Appln. No. 60/056,887 filed 
5 on 22-Aug-1997, U.S. Appln. No. 60/040,163 filed on 07-Mar-1997, U.S. Appln. No. 60/047,597 
filed on 23-May-1997, U.S. Appln. No. 60/056,889 filed on 22-Aug-1997, U.S. Appln. No. 
60/047,615 filed on 23-May-1997, U.S. Appln. No. 60/056,877 filed on 22-Aug-1997, U.S. Appln. 
No. 60/047,600 filed on 23-May-1997, U.S. Appln. No. 60/056,886 filed on 22-Aug-1997, U.S. 
Appln. No. 60/047,614 filed on 23-May-1997, U.S. Appln. No. 60/056,908 filed on 22-Aug-1997, 

10 U.S. Appln. No. 60/040,710 filed on 14-Mar-1997, U.S. Appln. No. 60/050,934 filed on 30-May- 
1997, U.S. Appln. No. 60/048,100 filed on 30-May-1997, U.S. Appln. No. 60/040,762 filed on 14- 
Mar-1997, U.S. Appln. No. 60/048,357 filed on 30-May-1997, U.S. Appln. No. 60/048,189 filed 
on 30-May-1997, U.S. Appln. No. 60/041,277 filed on 21-Mar-1997, U.S. Appln. No. 60/048,188 
filed on 30-May-1997, U.S. Appln. No. 60/048,094 filed on' 30-May-1997, U.S. Appln. No. 

15 60/048350 filed on 30-May-1997, U.S. Appln. No. 60/048,135 filed on 30-May-1997, U.S. 
Appln. No. 60/042,344 filed on 21-Mar-1997, U.S. Appln. No. 60/048,187 filed on 30-May-1997, 
U.S. Appln. No. 60/048,099 filed on 30-May-1997, U.S. Appln. No. 60/050,937 filed on 30-May- 
1997, U.S. Appln. No. 60/048,352 filed on 30-May-1997, U.S. Appln. No. 60/041,276 filed on 21- 
Mar-1997, U.S. Appln. No. 60/048,069 filed on 30-May-1997, U.S. Appln. No. 60/048,131 filed 

20 on 30-May-1997, U.S. Appln. No. 60/048,186 filed on 30-May-1997, U.S. Appln. No. 60/048,095 
filed on 30-May-1997, U.S. Appln. No. 60/041,281 filed on 21-Mar-1997, U.S. Appln. No. 
60/048355 filed on 30-May-1997, U.S. Appln. No. 60/048,096 filed on 30-May-1997, U.S. 
Appln. No. 60/048,351 filed on 30-May-1997, U.S. Appln. No. 60/048,154 filed on 30-May-1997, 
U.S. Appln. No. 60/048,160 filed on 30-May-1997, U.S. Appln. No. 60/042,825 filed on 08-Apr- 

25 1997, U.S. Appln. No. 60/048,070 filed on 30-May-1997, U.S. Appln. No. 60/042,727 filed on 08- 
Apr-1997, U.S. Appln. No. 60/048,068 filed on 30-May-1997, U.S. Appln. No. 60/042,726 filed 
on 08-Apr-1997, U.S. Appln. No. 60/048,184 filed on 30-May-1997, U.S. Appln. No. 60/042,728 
filed on 08-Apr-1997, U.S. Appln. No. 60/042,754 filed on 08-Apr-1997, U.S. Appln. No. 
60/048,190 filed on 30-May-1997, U.S. Appln. No. 60/044,039 filed on 30-May-1997, U.S. 

30 Appln. No. 60/048,093 filed on 30-May-1997, U.S. Appln. No. 60/048,885 filed on 06-Jun-1997, 
U.S. Appln. No. 60/057,645 filed on 05-Sep-1997, U.S. Appln. No. 60/049,375 filed on OMun- 
1997, U.S. Appln. No. 60/057,642 filed on 05-Sep-1997, U.S. Appln. No. 60/048,881 filed on 06- 
Jun-1997, U.S. Appln. No. 60/057,668 filed on 05-Sep-1997, U.S. Appln. No. 60/048,880 filed on 
06-Jun-1997, U.S. Appln. No. 60/057,635 filed on 05-Sep-1997, U.S. Appln. No. 60/048,896 filed 

35 on 06-Jun-1997, U.S. Appln. No. 60/057,627 filed on 05nSep-1997, U.S. Appln. No. 60/049,020 
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filed on 06-Jun-1997, U.S. Appln. No. 60/057,667 filed on 05-Sep-1997, U.S. Appln. No. 
60/048,876 filed on 06-Jun-1997, U.S. Appln. No. 60/057,666 filed on 05-Sep-1997, U.S. Appln. 
No. 60/048,895 filed on 06-Jun-1997, U.S. Appln. No. 60/057,764 filed on 05-Sep-1997, U.S. 
Appln. No. 60/048,884 filed on 06-Jun-1997, U.S. Appln. No. 60/057,643 filed on 05-Sep-1997, 
5 U.S. Appln. No. 60/048,894 filed on 06-Jun-1997, U.S. Appln. No. 60/057,769 filed on 05-Sep- 
1997, U.S. Appln. No. 60/048,971 filed on 06-Jun-1997, U.S. Appln. No. 60/057,763 filed on 05- 
Sep-1997, U.S. Appln. No. 60/048,964 filed on 06-Jun-1997, U.S. Appln. No. 60/057,650 filed on 

05- Sep-1997, U.S. Appln. No. 60/048,882 filed on 06-Jun-1997, U.S. Appln. No. 60/057,584 filed 
on 05-Sep-1997, U.S. Appln. No. 60/048,899 filed on 06-Jun-1997, U.S. Appln. No. 60/057,647 

10 filed on 05-Sep-1997, U.S. Appln. No. 60/048,893 filed on 06-Jun-1997, U.S. Appln. No. 

60/057,661 filed on 05-Sep-1997, U.S. Appln. No. 60/048,900 filed on 06-Jun-1997, U.S. Appln. 

No. 60/057,662 filed on 05-Sep-1997, U.S. Appln. No. 60/048,901 filed on 06-Jun-1997, U.S. 

Appln. No. 60/057,646 filed on 05-Sep-1997, U.S. Appln. No. 60/048,892 filed on 06~Jun-1997, 

U.S. Appln. No. 60/057,654 filed on 05-Sep~1997, U.S. Appln. No. 60/048,915 filed on 06-Jun- 
15 1997, U.S. Appln. No. 60/057,651 filed on 05-Sep-1997, U.S. Appln. No. 60/049,019 filed on 06- 

Jun-1997, U.S. Appln. No. 60/057,644 filed on 05-Sep-1997, U.S. Appln. No. 60/048,970 filed on 

06- Jun-1997, U.S. Appln. No. 60/057,765 filed on 05-Sep-1997, U.S. Appln. No. 60/048,972 filed 
on 06-Jun-1997, U.S. Appln. No. 60/057,762 filed on 05-Sep-1997, U.S. Appln. No. 60/048,916 
filed on 06Jun-1997, U.S. Appln. No. 60/057,775 filed on 05-Sep-1997, U.S. Appln. No. 

20 60/049,373 filed on 06-Jun-1997, U.S. Appln. No. 60/057,648 filed on 05-Sep-1997, U.S. Appln. 
No. 60/048,875 filed on 06-Jun-1997, U.S. Appln. No. 60/057,774 filed on 05-Sep-1997, U.S. 
Appln. No. 60/049,374 filed on 06-Jun-1997, U.S. Appln. No. 60/057,649 filed on 05^Sep-1997, 
U.S. Appln. No. 60/048,917 filed on 06-Jun-1997, U.S. Appln. No. 60/057,770 filed on 05-Sep- 
1997, U.S. Appln. No. 60/048,949 filed on 06-Jun-1997, U.S. Appln. No. 60/057,771 filed on 05- 

25 Sep-1997, U.S. Appln. No. 60/048,974 filed on 06-Jun-1997, U.S. Appln. No. 60/057,761 filed on 
05-Sep-1997, U.S. Appln. No. 60/048,883 filed on 06-Jun-1997, U.S. Appln. No. 60/057,760 filed 
on 05-Sep-1997, U.S. Appln. No. 60/048,897 filed on 06-Jun-1997, U.S. Appln. No. 60/057,776 
filed on 05-Sep-1997, U.S. Appln. No. 60/048,898 filed on 06-Jun-1997, U.S. Appln. No. 
60/057,778 filed on 05-Sep-1997, U.S. Appln. No. 60/048,962 filed on 06-Jun-1997, U.S. Appln. 

30 No. 60/057,629 filed on 05-Sep-1997, U.S. Appln. No. 60/048,963 filed on 06-Jun-1997, U.S. 
Appln. No. 60/057,628 filed on 05-Sep-1997, U.S. Appln. No. 60/048,877 filed on 06-Jun-1997, 
U.S. Appln. No. 60/057,777 filed on 05-Sep-1997, U.S. Appln. No. 60/048,878 filed on 06-Jun- 
1997, U.S. Appln. No. 60/057,634 filed on 05-Sep-1997, U.S. Appln. No. 60/049,608 filed on 13- 
Jun-1997, U.S. Appln. No. 60/058,669 filed on 12-Sep-1997, U.S. Appln. No. 60/049,566 filed on 

35 13-Jun-1997, U.S. Appln. No. 60/058,668 filed on 12-Sep-1997, U.S. Appln. No. 60/052,989 filed 
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on 13-Jun-1997, U.S. Appln. No. 60/058,750 filed on 12-Sep-1997, U.S. Appln. No. 60/049,607 
filed on 13-Jun-1997, U.S. Appln. No. 60/058,665 filed on 12-Sep-1997, U.S. Appln. No. 
60/049,611 filed on 13-Jun-1997, U.S. Appln. No. 60/058,971 filed on 12-Sep-1997, U.S. Appln. 
No. 60/050,901 filed on 13-Jun-1997, U.S. Appln. No. 60/058,972 filed on 12-Sep-1997, U.S. 
5 Appln. No. 60/049,609 filed on 13-Jun-1997, U.S. Appln. No. 60/058,975 filed on 12-Sep-1997, 
U.S. Appln. No. 60/048,356 filed on 30-May-1997, U.S. Appln. No. 60/056,296 filed on 29-Aug- 
1997, U.S. Appln. No. 60/048,101 filed on 30-May-1997, US. Appln. No. 60/056,293 filed on 29- 
Aug-1997, U.S. Appln. No. 60/050,935 filed on 30-May-1997, U.S. Appln. No. 60/056,250 filed 
on 29-Aug-1997, U.S. Appln. No. 60/049,610 filed on 13-Jun-1997, U.S. Appln. No. 60/061,060 

10 filed on 02-Oct-1997, U.S. Appln. No. 60/049,606 filed on 13-Jun-1997, U.S. Appln. No. 
60/060,841 filed on 02-Oct-1997, U.S. Appln. No. 60/049,550 filed on 13-Jun-1997, U.S. Appln. 
No. 60/060,834 filed on 02-Oct-1997, U.S. Appln. No. 60/049,549 filed on 13-Jun-1997, U.S. 
Appln. No. 60/060,865 filed on 02-Oct-1997, U.S. Appln. No. 60/049,548 filed on 13Jun-1997, 
U.S. Appln. No. 60/060,844 filed on 02-Oct-1997, U.S. Appln. No. 60/049,547 filed on 13-Jun- 

15 1997, U.S. Appln. No. 60/061,059 filed on 02-Oct-1997, U.S. Appln. No. 60/051381 filed on 01- 
Jul-1997, U.S. Appln. No. 60/058,598 filed on 12-Sep-1997, U.S. Appln. No. 60/051,480 filed on 
Ol-Jul-1997, U.S. Appln. No. 60/058,663 filed on 12-Sep-1997, U.S. Appln. No. 60/051,926 filed 
on 08-Jul-1997, U.S. Appln. No. 60/058,785 filed on 12-Sep-1997, U.S. Appln. No. 60/052,793 
filed on 08-Jul-1997, U.S. Appln. No. 60/058,664 filed on 12-Sep-1997, U.S. Appln. No. 

20 60/051,925 filed on 08-JuM997, U.S. Appln. No. 60/058,660 filed on 12-Sep-1997, U.S. Appln. 
No. 60/051,929 filed on 08-Jul-1997, U.S. Appln. No. 60/058,661 filed on 12-Sep-1997, U.S. 
Appln. No. 60/052,803 filed on 08 -Jul-1997, U.S. Appln. No. 60/055,722 filed on 18-Aug-1997, 
U.S. Appln. No. 60/052,732 filed on 08-Jul-1997, U.S. Appln. No. 60/055,723 filed on 18-Aug- 
1997, U.S. Appln. No. 60/051,932 filed on 08-Jul-1997, U.S. Appln. No. 60/055,948 filed on 18- 

25 Aug-1997, U.S. Appln. No. 60/051,931 filed on 08-M-1997, U.S. Appln. No. 60/055,949 filed on 
18-Aug-1997, U.S. Appln. No. 60/051,916 filed on 08-M-1997, U.S. Appln. No. 60/055,953 filed 
on i8-Aug-1997, U.S. Appln. No. 60/051,930 filed on 08-M-1997, U.S. Appln. No. 60/055,950 
filed on 18-Aug-1997, U.S. Appln. No. 60/051,918 filed on 08-M-1997, U.S. Appln. No. 
60/055,947 filed on 18-Aug-1997, U.S. Appln. No. 60/051,920 filed on 08-M-1997, U.S. Appln. 

30 No. 60/055,964 filed on 18-Aug-1997, U.S. Appln. No. 60/052,733 filed on 08-M-1997, U.S. 
Appln. No. 60/056,360 filed on 18-Aug-1997, U.S. Appln. No. 60/052,795 filed on 08-M-1997, 
U.S. Appln. No. 60/055,684 filed on 18-Aug-1997, U.S. Appln. No. 60/051,919 filed on 08-Jul- 
1997, U.S. Appln. No. 60/055,984 filed on 18-Aug-1997, U.S. Appln. No. 60/051,928 filed on 08- 
Jul-1997, U.S. Appln. No. 60/055,954 filed on 18-Aug-1997, U.S. Appln. No. 60/052,870 filed on 

35 16-M-1997, U.S. Appln. No. 60/055,952 filed on 18-Aug-1997, U.S. Appln. No. 60/052,871 filed 
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on 16-M-1997, U.S. Appln. No. 60/055,725 filed on 18-Aug-1997, U.S. Appln. No. 60/052,872 
filed on 16-M-1997, U.S. Appln. No. 60/056,359 filed on 18-Aug-1997, U.S. Appln. No. 
60/052,661 filed on 16-M-1997, U.S. Appln. No. 60/055,985 filed on 18-Aug-1997, U.S. Appln. 
No. 60/052,874 filed on 16-M-1997, U.S. Appln. No. 60/055,724 filed on 18-Aug-1997, U.S. 
5 Appln. No. 60/052,873 filed on 16-Jul-1997, U.S. Appln. No. 60/055,726 filed on 18-Aug-1997, 
U.S. Appln. No. 60/052,875 filed on 16-M-1997, U.S. Appln. No. 60/056,361 filed on 18-Aug- 
1997, U.S. Appln. No. 60/053,440 filed on 22-M-1997, U.S. Appln. No. 60/055,989 filed on 18- 
Aug-1997, U.S. Appln. No. 60/053,441 filed on 22-M-1997, U.S. Appln. No. 60/055,946 filed on 
18-Aug-1997, U.S. Appln. No. 60/053,442 filed on 22-JuM997, U.S. Appln. No. 60/055,683 filed 

10 on 18-Aug-1997, U.S. Appln. No. 60/054,212 filed on 30-M-1997, U.S. Appln. No. 60/055,968 
filed on 18-Aug-1997, U.S. Appln. No. 60/054,209 filed on 30-JuM997, U.S. Appln. No. 
60/055,972 filed on 18-Aug-1997, U.S. Appln. No. 60/054,234 filed on 30-M-1997, U.S. Appln. 
No. 60/055,969 filed on 18-Aug-1997, U.S. Appln. No. 60/055,386 filed on 05-Aug-1997, U.S. 
Appln. No. 60/055,986 filed on 18-Aug-1997, U.S. Appln. No. 60/054,807 filed on 05-Aug-1997 5 . 

15 U.S. Appln. No. 60/055,970 filed on 18-Aug-1997, U.S. Appln. No. 60/054,215 filed on 30-Jul- 
1997, U.S. Appln. No. 60/056,543 filed on 19-Aug-1997, U.S. Appln. No. 60/054,218 filed on 30- 
Jul-1997, U.S. Appln. No. 60/056,561 filed on 19-Aug-1997, U.S. Appln. No. 60/054,214 filed on 
30-Jul-1997, U.S. Appln. No. 60/056,534 filed on 19-Aug-1997, U.S. Appln. No. 60/054,236 filed 
on 30-M-1997, U.S. Appln. No. 60/056,729 filed on 19-Aug-1997, U.S. Appln. No. 60/054,213 

20 filed on 30-M-1997, U.S. Appln. No. 60/056,727 filed on 19-Aug~1997, U.S. Appln. No. 
60/054,211 filed on 30-M-1997, U.S. Appln. No. 60/056,554 filed on 19-Aug-1997, U.S. Appln. 
No. 60/054,217 filed on 30-M-1997, U.S. Appln. No. 60/056,730 filed on 19-Aug-1997, U.S. 
Appln. No. 60/055,312 filed on 05-Aug-1997, U.S. Appln. No. 60/056,563 filed on 19-Aug-1997, 
U.S. Appln. No. 60/055,309 filed on 05-Aug-1997, U.S. Appln. No. 60/056,557 filed on 19-Aug- 

25 1997, U.S. Appln. No. 60/055,310 filed on 05-Aug-1997, U.S. Appln. No. 60/056,371 filed on 19- 
Aug-1997, U.S. Appln. No. 60/054,798 filed on 05-Aug-1997, U.S. Appln. No. 60/056,732 filed 
on 19-Aug-1997, U.S. Appln. No. 60/056,369 filed on 19-Aug-1997, U.S. Appln. No. 60/056,535 
filed on 19-Aug-1997, U.S. Appln. No. 60/056,556 filed on 19-Aug-1997, U.S. Appln. No. 
60/056,555 filed on 19-Aug-1997, U.S. Appln. No. 60/054,806 filed on 05-Aug-1997, U.S. Appln. 

30 No. 60/056,366 filed on 19-Aug-1997, U.S. Appln. No. 60/054,809 filed on 05-Aug-1997, U.S. 
Appln. No. 60/056,364 filed on 19-Aug-1997, U.S. Appln. No. 60/054,804 filed on 05-Aug-1997, 
U.S. Appln. No. 60/056,370 filed on 19-Aug-1997, U.S. Appln. No. 60/054,803 filed on 05-Aug- 
1997, U.S. Appln. No. 60/056,731 filed on 19-Aug-1997, U.S. Appln. No. 60/055,311 filed on 05- 
Aug-1997, U.S. Appln. No. 60/056365 filed on 19-Aug-1997, U.S. Appln. No. 60/054,808 filed 

35 on 05-Aug-1997, U.S. Appln. No. 60/056,367 filed on 19-Aug-1997, U.S. Appln. No. 60/056,726 
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filed on 19-Aug-1997, U.S. Appln. No. 60/056,368 filed on 19-Aug-1997, U.S. Appln. No. 
60/056,728 filed on 19-Aug-1997, U.S. Appln. No. 60/056,628 filed on 19-Aug-1997, U.S. Appln. 
No. 60/056,629 filed on 19-Aug-1997, U.S. Appln. No. 60/056,270 filed on 29-Aug-1997, U.S. 
Appln. No. 60/056,271 filed on 29-Aug-1997, U.S. Appln. No. 60/056,247 filed on 29-Aug-1997, 
5 U.S. Appln. No. 60/056,073 filed on 29-Aug-1997, U.S. Appln. No. 60/057,669 filed on 05-Sep- 
1997, U.S. Appln. No. 60/057,663 filed on 05-Sep-1997, U.S. Appln. No. 60/057,626 filed on 05- 
Sep-1997, U.S. Appln. No. 60/058,666 filed on 12-Sep-1997, U.S. Appln. No. 60/058,973 filed on 
12-Sep-1997, U.S. Appln. No. 60/058,974 filed on 12-Sep-1997, U.S. Appln. No. 60/058,667 filed 
on 12-Sep-1997, U.S. Appln. No. 60/060,837 filed on 02-Oct-1997, U.S. Appln. No. 60/060,862 

10 filed on 02-Oct-1997, U.S. Appln. No. 60/060,839 filed on 02-Oct-1997, U.S. Appln. No. 
60/060,866 filed on 02-Oct-1997, U.S. Appln. No. 60/060,843 filed on 02-Oct-1997, U.S. Appln. 
No. 60/060,836 filed on 02-Oct-1997, U.S. Appln. No. 60/060,838 filed on 02-Oct-1997, U.S. 
Appln. No. 60/060,874 filed on 02Oct-1997, U.S. Appln. No. 60/060,833 filed on 02-Oct-1997, 
U.S. Appln. No. 60/060,884 filed on 02-Oct-1997, U.S. Appln. No. 60/060,880 filed on 02-Oct- 

15 1997, U.S. Appln. No. 60/061,463 filed on 09-Oct-1997, U.S. Appln. No. 60/061,529 filed on 09- 
Oct-1997, U.S. Appln. No. 60/071,498 filed on 09-Oct-1997, U.S. Appln. No. 60/061,527 filed on 
09-Oct-1997, U.S. Appln. No. 60/061,536 filed on 09-Oct-1997, U.S. Appln. No. 60/061432 filed 
on 09-Oct-1997, U.S. Appln. No. 60/063,099 filed on 24-Oct-1997, U.S. Appln. No. 60/063,088 
filed on 24-Oct-1997, U.S. Appln. No. 60/063,100 filed on 24-Oct-1997, U.S. Appln. No. 

20 60/063,387 filed on 24-Oct-1997, US. Appln. No. 60/063,148 filed on 24-Oct-1997, US. Appln. 
No. 60/063,386 filed on 24-Oct-1997, U.S. Appln. No. 60/062,784 filed on 24-Oct-1997, U.S. 
Appln. No. 60/063,091 filed on 24-Oct-1997, U.S. Appln. No. 60/063,090 filed on 24-Oct-1997, 
U.S. Appln. No. 60/063,089 filed on 24-Oct-1997, U.S. Appln. No. 60/063,092 filed on 24-Oct- 
1997, U.S. Appln. No. 60/063,111 filed on 24-Oct-1997, U.S. Appln. No. 60/063,101 filed on 24- 

25 Oct-1997, U.S. Appln. No. 60/063,109 filed on 24-Oct-1997, U.S. Appln. No. 60/063,1 10 filed on 
24-Oct-1997, U.S. Appln. No. 60/063,098 filed on 24-Oct-1997, U.S. Appln. No. 60/063,097 filed 
on 24-Oct-1997, U.S. Appln. No. 60/064,911 filed on 07-Nov-1997, U.S. Appln. No. 60/064,912 
filed on 07-Nov-1997, U.S. Appln. No. 60/064,983 filed on 07-Nov-1997, U.S. Appln. No. 
60/064,900 filed on 07-Nov-1997, U.S. Appln. No. 60/064,988 filed on 07-Nov-1997, U.S. Appln. 

30 No. 60/064,987 filed on 07-Nov-1997, U.S. Appln. No. 60/064,908 filed on 07-Nov-1997, U.S. 
Appln. No. 60/064,984 filed on 07-Nov-1997, U.S. Appln. No. 60/064,985 filed on 07-Nov-1997, 
U.S. Appln. No. 60/066,094 filed on 17-Nov-1997, U.S. Appln. No. 60/066,100 filed on 17-Nov- 
1997, U.S. Appln. No. 60/066,089 filed on 17-Nov-1997, U.S. Appln. No. 60/066,095 filed on 17- 
Nov-1997, U.S. Appln. No. 60/066,090 filed on 17-Nov-1997, U.S. Appln. No. 60/068,006 filed 

35 on 18-Dec-1997, U.S. Appln. No. 60/068,057 filed on 18-Dec-1997, U.S. Appln. No. 60/068,007 
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filed on 18-Dec-1997, U.S. Appln. No. 60/068,008 filed on 18-Dec-1997, U.S. Appln. No. 
60/068,054 filed on 18-Dec-1997, U.S. Appln. No. 60/068,064 filed on 18-Dec-1997, U.S. Appln. 
No. 60/068,053 filed on 18-Dec-1997, U.S. Appln. No. 60/070,923 filed on 18-Dec-1997, U.S. 
Appln. No. 60/068,365 filed on 19-Dec-1997, U.S. Appln. No. 60/068,169 filed on 19-Dec-1997, 
5 U.S. Appln. No. 60/068,367 filed on 19-Dec-1997, U.S. Appln. No. 60/068,369 filed on 19-Dec- 

1997, U.S. Appln. No. 60/068,368 filed on 19-Dec-1997, U.S. Appln. No. 60/070,657 filed on 07- 
Jan-1998, U.S. Appln. No. 60/070,692 filed on 07-Jan-1998, U.S. Appln. No. 60/070,704 filed on 
07-Jan-1998, U.S. Appln. No. 60/070,658 filed on 07-Jan-1998, U.S. Appln. No. 60/073,160 filed 
on 30-Jan-1998, U.S. Appln. No. 60/073,159 filed on 30-Jan-1998, U.S. Appln. No. 60/073,165 

10 filed on 30Jan-1998, U.S. Appln. No. 60/073,164 filed on 30Jan-1998, U.S. Appln. No. 
60/073,167 filed on 30-Jan-1998, U.S. Appln. No. 60/073,162 filed on 30-Jan-1998, U.S. Appln. 
No. 60/073,161 filed on 30-Jan-1998, U.S. Appln. No. 60/073,170 filed on 30-Jan-1998, U.S. 
Appln. No. 60/074,141 filed on 09-Feb-1998, U.S. Appln. No. 60/074,341 filed on 09-Feb-1998, 
U.S. Appln. No. 60/074,037 filed on 09-Feb-1998, U.S. Appln. No. 60/074,157 filed on 09-Feb- 

15 1998, U.S. Appln. No. 60/074,118 filed on 09-Fet>-1998, US. Appln. No. 60/076,051 filed on 26- 
Feb-1998, U.S. Appln. No. 60/076,053 filed on 26-Feb-1998, U.S. Appln. No. 60/076,054 filed on 
26-Feb-1998, U.S. Appln. No. 60/076,052 filed on 26-Feb-1998, U.S. Appln. No. 60/076,057 filed 
on 26-Feb-1998, U.S. Appln. No. 60/077,714 filed on 12-Mar-1998, U.S. Appln. No. 60/077,687 
filed on 12-Mar-1998, U.S. Appln. No. 60/077,686 filed on 12-Mar-1998, U.S. Appln. No. 

20 60/077,696 filed on 12-Mar-1998, U.S. Appln. No. 60/078,566 filed on 19-Mar-1998, U.S. Appln. 
No. 60/078,574 filed on 19-Mar-1998, U.S. Appln. No. 60/078,576 filed on 19-Mar-1998, U.S. 
Appln: No. 60/078,579 filed on 19-Mar-1998, U.S. Appln. No. 60/078,563 filed on 19-Mar-1998, 
U.S. Appln. No. 60/078,573 filed on 19-Mar-1998, U.S. Appln. No. 60/078,578 filed on 19-Mar- 

1998, U.S. Appln. No. 60/078,581 filed on 19-Mar-1998, U.S. Appln. No. 60/078,577 filed on 19- 
25 Mar-1998, U.S. Appln. No. 60/080,314 filed on 01-Apr-1998, U.S. Appln. No. 60/080,312 filed on 

Ol-Apr-1998, U.S. Appln. No. 60/080313 filed on Ol-Apr-1998, U.S. Appln. No. 60/085,180 filed 
on 12-May-1998, U.S. Appln. No. 60/085,105 filed on 12-May-1998, U.S. Appln. No. 60/085,094 
filed on 12-May-1998, U.S. Appln. No. 60/085,093 filed on 12-May-1998, U.S. Appln. No. 
60/085,924 filed on 18-May-1998, U.S. Appln. No. 60/085,906 filed on 18-May-1998, U.S. 

30 Appln. No. 60/085,927 filed on 18-May-1998, U.S. Appln. No. 60/085,920 filed on 18-May-1998, 
U.S. Appln. No. 60/085,928 filed on 18-May-1998, U.S. Appln. No. 60/085,925 filed on 18-May- 
1998, U.S. Appln. No. 60/085,921 filed on 18-May-1998, U.S. Appln. No. 60/085,923 filed on 18- 
May-1998, U.S. Appln. No. 60/085,922 filed on 18-May-1998, U.S. Appln. No. 60/090,112 filed 
on 22-Jun-1998, U.S. Appln. No. 60/089,508 filed on 16-Jun-1998, U.S. Appln. No* 60/089,507 

35 filed on 16-Jun-1998, U.S. Appln. No. 60/089,510 filed on 16-Jun-1998, U.S. Appln. No. 
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60/089,509 filed on 16-Jun-1998, U.S. Appln. No. 60/090,113 filed on 22-Jun-1998, U.S. Appln. 
No. 60/092,956 filed on 15-M-1998, U.S. Appln. No. 60/092,921 filed on 15-Jul-1998, U.S. 
Appln. No. 60/092,922 filed on 15-Jul-1998, U.S. Appln. No. 60/094,657 filed on 30-Jul-1998, 
U.S. Appln. No. 60/095,486 filed on 05-Aug-1998, U.S. Appln. No. 60/096,319 filed on 12-Aug- 
5 1998, U.S. Appln. No. 60/095,455 filed on 06-Aug-1998, U.S. Appln. No. 60/095,454 filed on 06- 
Aug-1998, U.S. Appln. No. 60/097,917 filed on 25-Aug-1998, U.S. Appln. No. 60/098,634 filed 
on 31-Aug-1998, U.S. Appln. No. 60/101,546 filed on 23-Sep-1998, U.S. Appln. No. 60/102,895 
filed on 02-Oct-1998, U.S. Appln. No. 60/108,207 filed on 12-Nov-1998, U.S. Appln. No. 
60/113,006 filed on 18-Dec-1998, U.S. Appln. No. 60/112,809 filed on 17-Dec-1998, U.S. Appln. 
10 No. 60/116,330 filed on 19-Jan-1999, U.S. Appln. No. 60/119,468 filed on 10-Feb-1999, U.S. 
Appln. No. 60/125,055 filed on 18-Mar-1999, U.S. Appln. No. 60/128,693 filed on 09-Apr-1999, 
U.S. Appln. No. 60/130,991 filed on 26-Apr-1999, U.S. Appln. No. 60/137,725 filed on 07-Jun- 

1999, U.S. Appln. No. 60/145,220 filed on 23-M-1999, U.S. Appln. No. 60/149,182 filed on 17- 
Aug-1999, U.S. Appln. No. 60/152,317 filed on 03-Sep-1999, U.S. Appln. No. 60/152,315 filed on 

15 03-Sep-1999, U.S. Appln. No. 60/155,709 filed on 24-Sep-1999, U.S. Appln. No. 60/163,085 filed 
on 02-Nov-1999, U.S. Appln. No. 60/172,411 filed on 17-Dec-1999, U.S. Appln. No. 60/162,239 
filed on 29-Oct-1999, U.S. Appln. No. 60/215,139 filed on 30-Jun-2000, U.S. Appln. No. 
60/162,211 filed on 29-Oct-1999, U.S. Appln. No. 60/215,138 filed on 30-Jun-2000, U.S. Appln. 
No. 60/162,240 filed on 29-Oct-1999, U.S. Appln. No. 60/215,131 filed on 30-Jun-2000, U.S. 

20 Appln. No. 60/162,237 filed on 29-Oct-1999, U.S. Appln. No. 60/219,666 filed on 21-Jul-2000, 
U.S. Appln. No. 60/162,238 filed on 29-Oct-1999, U.S. Appln. No. 60/215,134 filed on 3(Mun- 

2000, U.S. Appln. No. 60/163,580 filed on 05-Nov-1999, U.S. Appln. No. 60/215,130 filed on 30- 
Jun-2000, U.S. Appln. No. 60/163,577 filed on 05-Nov-1999, U.S. Appln. No. 60/215,137 filed on 
30-Jun-2000, U.S. Appln. No. 60/163,581 filed on 05-Nov-1999, U.S. Appln. No. 60/215,133 filed 

25 on 30-Jun-2000, U.S. Appln. No. 60/163,576 filed on 05-Nov-1999, U.S. Appln. No. 60/221,366 
filed on 27-M-2000, U.S. Appln. No. 60/164,344 filed on 09-Nov-1999, U.S. Appln. No. 
60/195,296 filed on O7-Apr-2000, U.S. Appln. No. 60/221,367 filed on 27Jul-2000, U.S. Appln. 
No. 60/164,835 filed on 12-Nov-1999, U.S. Appln. No. 60/221,142 filed on 27-Jul-2000, U.S. 
Appln. No. 60/164,744 filed on 12-Nov-1999, U.S. Appln. No. 60/215,140 filed on 30-Jun-2000, 

30 U.S. Appln. No. 60/164,735 filed on 12-Nov-1999, U.S. Appln. No. 60/221,193 filed on 27-Jul- 
2000, U.S. Appln. No. 60/164,825 filed on 12-Nov-1999, U.S. Appln. No. 60/222,904 filed on 03- 
Aug-2000, U.S. Appln. No. 60/164,834 filed on 12-Nov-1999, U.S. Appln. No. 60/224,007 filed 
on 04-Aug-2000, U.S. Appln. No. 60/164,750 filed on 12-Nov-1999, U.S. Appln. No. 60/215,128 
filed on 30-Jun-2000, U.S. Appln. No. 60/166,415 filed on 19-Nov-1999, U.S. Appln. No. 

35 60/215,136 filed on 30-Jun-2000, U.S. Appln. No. 60/166,414 filed on 19-Nov-1999, U.S. Appln. 
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No. 60/219,665 filed on 21-Jul-2000, U.S. Appln. No. 60/164,731 filed on 12-Nov-1999, U.S. 
Appln. No. 60/215,132 filed on 30-Jun-2000, U.S. Appln. No. 60/226,280 filed on 18-Aug-2000, 
U.S. Appln. No. 60/256,968 filed on 21-Dec-2000, U.S. Appln. No. 60/226,380 filed on 18-Aug- 

2000, U.S. Appln. No. 60/259,803 filed on 05-Jan-2001, U.S. Appln. No. 60/228,084 filed on 28- 
5 Aug-2000, U.S. Appln. No. 09/915,582 filed on 27-M-2001, U.S. Appln. No. 60/231,968 filed on 

12-Sep-2000, U.S. Appln. No. 60/236,326 filed on 29-Sep-2000, U.S. Appln. No. 60/234,211 filed 
on 20-Sep-200a U.S. Appln. No. 60/226,282 filed on 18-Aug-2000, U.S. Appln. No. 60/232,104 
filed on 12-Sep-2000, U.S. Appln. No. 60/234,210 filed on 20-Sep-2000, U.S. Appln. No. 
60/226,278 filed on 18-Aug-2000, U.S. Appln. No. 60/259,805 filed on 05-Jan~2001, U.S. Appln. 
. 10 No. 60/226,279 filed on 18-Aug-2000, U.S. Appln. No. 60/259,678 filed on 05-Jan-2001, U.S. 
Appln. No. 60/226,281 filed on 18-Aug-2000, U.S. Appln. No. 60/231,969 filed on 12-Sep-2000, 
U.S. Appln. No. 60/228,086 filed on 28-Aug-2000, U.S. Appln. No. 60/259,516 filed on 04-Jan- 

2001, U.S. Appln. No. 60/228,083 filed on 28-Aug-2000, U.S. Appln. No. 60/259,804 filed on 05- 
Jan-2001, U.S. Appln. No. 60/270,658 filed on 23-Fet>-2001, U.S. Appln. No. 60/304,444 filed on 

15 12-M-2001, U.S. Appln. No. 60/270,625 filed on 23-Feb-2001, U.S. Appln. No. 60/304,417 filed 
on 12-M-2001, U.S. Appln. No. 60/295,869 filed on 06-Jun-2001, U.S. Appln. No. 60/304,121 
filed on ll-Jul-2001, U.S. Appln. No. 60/311,085 filed on 10-Aug-2001, U.S. Appln. No. 
60/325,209 filed on 28-Sep-2001, U.S. Appln. No. 60/330,629 filed on 26-Oct-2G01, U.S. Appln. 
No. 60/331,046 filed on 07-Nov-2001, U.S. Appln. No. 60/358,554 filed on 22-Feb-2002, U.S. 

20 Appln. No. 60/358,714 filed on 25-Feb-2002, U.S. Appln. No. 60/277,340 filed on 21-Mar-2001, 
U.S. Appln. No. 60/306,171 filed on 19-M-2001, U.S. Appln. No. 60/278,650 filed on 27-Mar- 
2001, U.S. Appln. No. 60/331,287 filed on 13-Nov-2001, U.S. Appln. No. 09/950,082 filed on 12- 
Sep-2001, U.S. Appln. No. 09/950,083 filed on 12-Sep-2001, PCT Appln. No. US00/29363 filed 
on 25-Oct-2000, PCT Appln. No. US00/29360 filed on 25-Oct-2000, PCT Appln. No. 

25 USOO/29362 filed on 25-Oct-2000, PCT Appln. No. US00/29365 filed on 25-Oct-20Q0, PCT 
Appln. No. US00/29364 filed on 25-Oet-2000, PCT Appln. No. US00/30040 filed on 01-Nov- 
2000, PCT Appln. No. USOO/30037 filed on Ol-Nov-2000, PCT Appln. No. US00/30O45 filed on 
Ol-Nov-2000, PCT Appln. No. USOO/30036 filed on Ol-Nov-2000, PCT Appln. No. US00/30039 
filed on Ol-Nov-2000, PCT Appln. No. US00/30654 filed on 08-Nov-2000, PCT Appln. No. 

30 US00/30628 filed on 08-Nov-2000, PCT Appln. No. US00/30653 filed on 08-Nov-2000, PCT 
Appln. No. US00/30629 filed on 08-Nov-2000, PCT Appln. No. USOO/30679 filed on 08-Nov- 
2000, PCT Appln. No. US00/30674 filed on 08-Nov-2000, PCT Appln. No. US00/31162 filed on 
15-Nov-2000, PCT Appln. No. US00/31282 filed on 15-Nov-2000, PCT Appln. No. US00/30657 
filed on 08-Nov-2000, PCT Appln. No. US01/01396 filed on 17-Jan-2001, PCT Appln. No. 

35 US01/01387 filed on 17-Jan-2001, PCT Appln. No. US01/01567 filed on 17-Jan-2001, PCT 
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Appln. No. US01/01431 filed on 17-Jan-2001, PCT Appln. No. US01/01432 filed on 17-Jan-2Q01, 
PCT Appln. No. US01/00544 filed on 09-Jan-2001, PCT Appln. No. US01/01435 filed on 17-Jan- 
2001, PCT Appln. No. US01/01386 filed on 17-Jan-2001, PCT Appln. No. US01/01565 filed on 
17-Jan-2001, PCT Appln. No. US01/01394 filed on 17-Jan-2001, PCT Appln. No. US01/01434 
5 filed on 17-Jan-2001, PCT Appln. No. US01/01397 filed on 17-Jan-2001, PCT Appln. No. 
US01/01385 filed on 17-Jan-2001, PCT Appln. No. US01/01384 filed on 17-Jan-2001, PCT 
Appln. No. US01/01383 filed on 17-Jan-2001, PCT Appln. No. (Atty. Dkt. No. PS735; 
unassigned) filed on 21-Feb-2002, PCT Appln. No. (Atty. Dkt. No. PS736; unassigned) filed on 
21-Feb-2002, U.S. Appln. No. 09/148,545 filed on 04-Sep-1998, U.S. Appln. No. 09/621,011 filed 

10 on 20-Jul-2000, U.S. Appln. No. 09/981,876 filed on 19-Oct-2001, U.S. Appln. No. 09/149,476 
filed on 08-Sep-1998, U.S. Appln. No. 09/809,391 filed on 16-Mar-2001, U.S. Appln. No. 
09/882,171 filed on 18-Jun-2001, U.S. Appln. No. 60/190,068 filed on 17-Mar-2000, U.S. Appln. 
No. 09/152,060 filed on ll-Sep-1998, U.S. Appln. No. 09/852,797 filed on ll-May-2001, U.S. 
Appln. No. 09/853,161 filed on ll-May-2001, U.S. Appln. No. 09/852,659 filed on ll-May-2001, 

15 U.S. Appln. No. 10/058,993 filed on 30-Jan-2002, U.S. Appln. No. 60/265,583 filed on 02-Feb- 
2001, U.S. Appln. No. 09/154,707 filed on 17-Sep-1998, U.S. Appln. No. 09/966,262 filed on 01- 
Oct-2001, U.S. Appln. No. 09/983,966 filed on 26-Oct-2001, U.S. Appln. No. 10/059,395 filed on 
31-Jan-2002, U.S. Appln. No. 09/984,245 filed on 29-Oct-2001, U.S. Appln. No. 09/166,780 filed 
on 06-Oct-1998, U.S. Appln. No. 09/577,145 filed on 24-May-2000, U.S. Appln. No. 09/814,122 

20 filed on 22-Mar-2001, U.S. Appln. No. 09/189,144 filed on 10-Nov-1998, U.S. Appln. No. 
09/690,454 filed on 18-Oct-2000, U.S. Appln. No. (Atty. Dkt. No. PZ006G13A; unassigned) filed 
on 05-Feb-2002, U.S. Appln. No. 10/062,599 filed on 05-Feb-2002, U.S. Appln. No. 09/205,258 
filed on 04-Dec-1998, U.S. Appln. No. 09/933,767 filed on 22-Aug-2001, U.S. Appln. No. 
60/184,836 filed on 24-Feb-2000, U.S. Appln. No. 60/193,170 filed on 29-Mar-2000, U.S. Appln. 

25 No. 10/023,282 filed on 20-Dec-2001, U.S. Appln. No. 10/004,860 filed on 07-Dec-2001, U.S. 
Appln. No. 09/209,462 filed on ll-Dec-1998, U.S. Appln. No. 09/213,365 filed on 17-Dec-1998, 
U.S. Appln. No. 09/627,081 filed on 27-Jul-2000, U.S. Appln. No. 09/227,357 filed on 08-Jan- 
1999, U.S. Appln. No. 09/983,802 filed on 23-Oct-2001, U.S. Appln. No. 09/973,278 filed on 10- 
Oct-2001, U.S. Appln. No. 60/239,899 filed on 13-Oct-2000, U.S. Appln. No. 09/984,490 filed on 

30 30-Oct-2001 , U.S. Appln. No. 09/776,724 filed on 06-Feb-2001, U.S. Appln. No. 09/229,982 filed 
on 14-Jan-1999, U.S. Appln. No. 09/669,688 filed on 26-Sep-2000, U.S. Appln. No. 60/180,909 
filed on 08-Feb-2000, U.S. Appln. No. 09/236,557 filed on 26-Jan-1999, U.S. Appln. No. 
09/666,984 filed on 21-Sep-2000, U.S. Appln. No. 09/820,649 filed on 30-Mar-2001, U.S. Appln. 
No. 60/295,558 filed on 05-Jun-2001, U.S. Appln. No. 09/244,112 filed on 04-Feb-1999, U.S. 

35 Appln. No. 09/774,639 filed on Ol-Feb-2001, U.S. Appln. No. 09/969,730 filed on 04-Oct-2001, 
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U.S. Appln. No. 60/238,291 filed on 06-Oct-2000, U.S. Appln. No. 09/251329 filed on 17-Feb- 
1999, U.S. Appln. No. 09/716,128 filed on 17-Nov-2000, U.S. Appln. No. 09/257,179 filed on 25- 
Feb-1999, U.S. Appln. No. 09/729,835 filed on 06-Dec-2000, U.S. Appln. No. 09/262,109 filed on 
04-Max-1999, U.S. Appln. No. 09/722,329 filed on 28-Nov-2000, U.S. Appln. No. (Atty. Dkt. No. 
5 PZ016P1C1; unassigned) filed on 17Jan-2002, U.S. Appln. No. 60/262,066 filed on 18-Jan-2001, 
U.S. Appln. No. 09/281,976 filed on 31-Mar-1999, U.S. Appln. No. 09/288,143 filed on 08-Apr- 
1999, U.S. Appln. No. 09/984,429 filed on 30-Oct-2001, U.S. Appln. No. 60/244,591 filed on 01- 
Nov-2000, U.S. Appln. No. 09/296,622 filed on 23-Apr-1999, U.S. Appln. No. 09/305,736 filed 
on 05-May-1999, U.S. Appln. No. 09/818,683 filed on 28-Mar-2001, U.S. Appln. No. 09/974,879 

10 filed on 12-Oct-2001, U.S. Appln. No. 60/239,893 filed on 13-Oct-2000, U.S. Appln. No. 
09/334,595 filed on 17-Jun-1999, U.S. Appln. No. 09/348,457 filed on 07-M-1999, U.S. Appln. 
No. 09/739,907 filed on 20-Dec-2000, U.S. Appln. No. 09/938,671 filed on 27-Aug-2001, U.S. 
Appln. No. 09/363,044 filed on 29-M-1999, U.S. Appln. No. 09/813,153 filed on 21-Mar-2001, 
U.S. Appln. No. 09/949,925 filed on 12-Sep-2001, U.S. Appln. No. 60/232,150 filed on 12-Sep- 

15 2000, U.S. Appln. No. 09/369,247 filed on 05-Aug-1999, U.S. Appln. No. 10/062,548 filed on 05- 
Feb-2002, U.S. Appln. No. 09/382,572 filed on 25-Aug-1999, U.S. Appln. No. 09/716,129 filed on 
17-Nov-2000, U.S. Appln. No. 09/393,022 filed on 09-Sep-1999, U.S. Appln. No. 09/798,889 
filed on 06-Mar-2001, U.S. Appln. No. 09/397,945 filed on 17-Sep-1999, U.S. Appln. No. 
09/437,658 filed on 10-Nov-1999, U.S. Appln. No. 09/892,877 filed on 28-Jun-2001, U.S. Appln. 

20 No. 09/948,783 filed on lO^Sep-2001, U.S. Appln. No. 60/231,846 filed on ll-Sep-2000, U.S. 
Appln. No. 09/461,325 filed on 14-Dec-1999, U.S. Appln. No. 10/050,873 filed on 18-Jan-2002, 
U.S. Appln. No. 60/263,230 filed on 23-Jan-2001, U.S. Appln. No. 60/263,681 filed on 24-Jan- 
2001, U.S. Appln. No. 10/012,542 filed on 12~Dec-2001, U.S. Appln. No. 09/482,273 filed on 13- 
Jan-2000, U.S. Appln. No. 60/234,925 filed on 25-Sep-2000, U.S. Appln. No. 09/984,276 filed on 

25 29-Oct-2001, U.S. Appln. No. 09/984,271 filed on 29-Oct-2001, U.S. Appln. No. 09/489,847 filed 
on 24-Jan-2000, U.S. Appln. No. 60/350,898 filed on 25-Jan-2002, U.S. Appln. No. 09/511,554 
filed on 23-Feb-2000, U.S. Appln. No. 09/739,254 filed on 19-Dec-2000, U.S. Appln. No. 
09/904,615 filed on 16-M-2001, U.S. Appln. No. 10/054,988 filed on 25-Jan-2002, U.S. Appln. 
No. 09/531,119 filed on 20-Mar-2000, U.S. Appln. No. 09/820,893 filed on 30-Mar-2001, U.S. 

30 Appln. No. 09/565,391 filed on 05-May-2000, U.S. Appln. No. 09/948,820 filed on 10-Sep-2001, 
U.S. Appln. No. 09/591,316 filed on 09-Jun-2000, U.S. Appln. No. 09/895,298 filed on 02-Jul- 
2001, U.S. Appln. No. 09/618,150 filed on 17-M-2Q00, U.S. Appln. No. 09/985,153 filed on 01- 
Nov-2001, U.S. Appln. No. 09/628,508 filed on 28-Jul-2000, U.S. Appln. No. 09/997,131 filed on 
30-Nov-2001, U.S. Appln. No. 09/661,453 filed on 13-Sep-2000, U.S. Appln. No. 10/050,882 

35 filed on 18-Jan-2002, U.S. Appln. No. 09/684,524 filed on 10-Oct-2000, U.S. Appln. No. 
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10/050,704 filed on 18-Jan-2002, U.S. Appln. No. 09/726,643 filed on 01-Dec-2000, U.S. Appln. 
No. 10/042,141 filed on ll-Jan-2002, U.S. Appln. No. 09/756,168 filed on 09-Jan-2001, U.S. 
Appln. No. 09/781,417 filed on 13-Feb-2001, U.S. Appln. No. (Atty. Dkt. No. PZ042P1C1; 
unassigned) filed on Ol-Feb-2002, U.S. Appln. No. 09/789,561 filed on 22-Feb-2001, U.S. Appln. 
5 No. 09/800,729 filed on 08-Mar-2001, U.S. Appln. No. 09/832,129 filed on ll-Apr-2001, PCT 
Appln. No.US98/04482 filed on 06-Mar-1998, PCT Appln. No.US98/04493 filed on 06-Mar-1998, 
PCT Appln. No.US98/04858 filed on 12-Mar-1998, PCT Appln. No.US98/05311 filed on 19-Mar- 
1998, PCT Appln. No.US98/06801 filed on 07-Apr-1998, PCT Appln. No.US98/10868 filed on 
28-May-1998, PCT Appln. No.US98/11422 filed on 04-Jun-1998, PCT Appln. No.US01/05614 

10 filed on 21-Feb-2001, PCT Appln. No.US98/12125 filed on ll-Jun-1998, PCT Appln. 
No.US98/13608 filed on 30-Jun-1998, PCT Appln. No.US98/13684 filed on 07-Jul-1998, PCT 
Appln. No.US98/14613 filed on 15-Jul-1998, PCT Appln. No.US98/15949 filed on 29-M-1998, 
PCT Appln. No.US98/16235 filed on 04-Aug-1998, PCT Appln. No.US98/17044 filed on 18-Aug- 
1998, PCT Appln. No.US98/17709 filed on 27-Aug-1998, PCT Appln. No.US98/18360 filed on 

15 03-Sep-1998, PCT Appln. No.(Atty. Dkt. No. PZ016PCT2; unassigned) filed on 17-Jan-2002, 
PCT Appln. No.US98/20775 filed on 01-Oct-1998, PCT Appln. No.US98/21142 filed on 08-Oct- 

1998, PCT Appln. No.US98/22376 filed on 23-Oct-1998, PCT Appln. No.US98/23435 filed on 
04-Nov-1998, PCT Appln. No.US98/27059 filed on 17-Dec-1998, PCT Appln. No.US99/00108 
filed on 06-Jan-1999, PCT Appln. No.US99/01621 filed on 27-Jan-1999, PCT Appln. 

20 NO.US99/02293 filed on 04-Feb-1999, PCT Appln. No.US99/03939 filed on 24-Feb-1999, PCT 
Appln. No.US99/05721 filed on ll-Mar-1999, PCT Appln. No.US99/05804 filed on 18-Mar-1999, 
PCT Appln. No.US99/09847 filed on 06-May-1999, PCT Appln. No.US99/13418 filed on 15-Jun- 

1999, PCT Appln. No.US99/15849 filed on 14-Jul-1999, PCT Appln. No.US01/00911 filed on 12- 
Jan-2001, PCT Appln. No.US01/29871 filed on 24-Sep-2001, PCT Appln. No.US99/17130 filed 

25 on 29-Jul-1999, PCT Appln. No.US99/19330 filed on 24-Aug-1999, PCT Appln. No.US99/22012 
filed on 22-Sep~1999> PCT Appln. No.US99/26409 filed on 09-Nov-1999, PCT Appln. 
No.US99/29950 filed on 16-Dec-1999, PCT Appln. No.US00/00903 filed on 18-Jan-2000, PCT 
Appln. No.US00/03062 filed on 08-Feb-2000, PCT Appln. No.US(M)/06783 filed on 16-Mar-2000, 
PCT Appln. No.US00/08979 filed on 06-Apr-2000, PCT Appln. No.USOO/15187 filed on 02-Jun- 

30 2000, PCT Appln. No.US00/19735 filed on 20-Jul-2000, PCT Appln. No,US00/22325 filed on 16- 
Aug-2000, PCT Appln. No.US00/24008 filed on 31-Aug-2000, PCT Appln. No.US00/26013 filed 
on 22-Sep-2000, PCT Appln. No.US00/28664 filed on 17-Oct-2Q00, US Appln. No. 09/833,245 
filed on 12-Apr-2001, and PCT Appln. No. US01/11988 filed on 12-Apr-2001. 
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Applicant's File 
Reference Number: 



PS906PCT 



International Application 

Number: Unassigned 



INDICATIONS RELATING TO DEPOSITED BIOLOGICAL MATERIAL 

(PCT Rule 13bis) 

A. The indications made below relate to the deposited biological material referred to in Table 1 A 

of the description. 

B. IDENTIFICATION OF DEPOSIT : Further deposits are identified 

on an additional sheet: 

Name of Depository: American Type Culture Collection 
Address of Depository: 10801 University Boulevard 

Manassas, Virginia 20110-2209 

United States of America 
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EUROPE 

In respect of those designations in which a European Patent is sought a sample of the 
deposited microorganism will be made available until the publication of the mention of the 
grant of the European patent or until the date on which the application has been refused or 
withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert 
nominated by the person requesting the sample (Rule 28(4) EPC). 

CANADA 

The applicant requests that, until either a Canadian patent has been issued on the basis of an 
application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the Commissioner of Patents only authorizes the furnishing of 
a sample of the deposited biological material referred to in the application to an independent 
expert nominated by the Commissioner, the applicant must, by a written statement, inform the 
International Bureau accordingly before completion of technical preparations for publication 
of the international application. 

NORWAY 

The applicant hereby requests that the application has been laid open to public inspection (by 
the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available 
to the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has 
been filed by the applicant, any request made by a third party for the furnishing of a sample 
shall indicate the expert to be used That expert may be any person entered on the list of 
recognized experts drawn up by the Norwegian Patent Office or any person approved by the 
applicant in the individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall 
only be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of 
the application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the National Board of Patents and Regulations), or has been finally decided 
upon by the National Board of Patents and Registration without having been laid open to 
public inspection, the furnishing of a sample shall only be effected to an expert in the art. 
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UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only 
be made available to an expert. The request to this effect must be filed by the applicant with 
the International Bureau before the completion of the technical preparations for the 
international publication of the application. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Danish Patent Office), or has been finally decided upon by the Danish 
Patent office without having been laid open to public inspection, the fimiishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the Danish Patent Office not later that at the time when the application is made 
available to the public under Sections 22 and 33(3) of the Danish Patents Act. If such a 
request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list 
of recognized experts drawn up by the Danish Patent Office or any person by the applicant in 
the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public 
inspection (by the Swedish Patent Office), or has been finally decided upon by the Swedish 
Patent Office without having been laid open to public inspection, the furnishing of a sample 
shall only be effected to an expert in the art. The request to this effect shall be filed by the 
applicant with the International Bureau before the expiration of 1 6 months from the priority 
date (preferably on the Form PCT/RO/1 34 reproduced in annex Z of Volume I of the PCT 
Applicant's Guide). If such a request has been filed by the applicant any request made by a 
third party for the furnishing of a sample shall indicate the expert to be used. That expert may 
be any person entered on a list of recognized experts drawn up by the Swedish Patent Office 
or any person approved by a applicant in the individual case. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in the 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 
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What Is Claimed Is: 

1. Use of a polypeptide for the preparation of a diagnostic or pharmaceutical composition 
for diagnosing or treating a gastrointestinal disorder, wherein said polypeptide comprises an amino 
acid sequence at least 95% identical to a sequence selected from the group consisting of: 

(a) a full length polypeptide of SEQ ID NO: Y or a full length polypeptide encoded by 
the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in 
Table 1 A; - 

(b) a predicted secreted form of SEQ ID NO:Y or a secreted form of the polypeptide 
encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as 
referenced in Table 1 A; 

(c) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A; 

(d) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1 A, wherein said fragment has biological activity; 

(e) a polypeptide domain of SEQ JD NO: Y as referenced in Table IB; 

(f) a polypeptide domain of SEQ ID NO:Y as referenced in Table 2; and 

(g) a predicted epitope of SEQ ID NO:Y as referenced in Table IB. 

2. Use of the polypeptide of claim 1, wherein said wherein said polypeptide comprises a 
heterologous amino acid sequence. 

3. Use of a polypeptide for the preparation of a diagnostic or pharmaceutical composition 
for diagnosing or treating a gastrointestinal disorder, wherein said polypeptide comprises an amino 
acid sequence selected from the group consisting of: 

(a) a full length polypeptide of SEQ ID NO: Y or a full length polypeptide encoded by 
the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in 
Table 1A; 

(b) a predicted secreted form of SEQ ID NO: Y or a secreted form of the polypeptide 
encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as 
referenced in Table 1 A; 
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(c) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A; 

(d) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A, wherein said fragment has biological activity; 

(e) a polypeptide domain of SEQ ID NO:Y as referenced in Table IB; 

(f) a polypeptide domain of SEQ ID NO: Y as referenced in Table 2; and 

(g) a predicted epitope of SEQ ID NO:Y as referenced in Table IB. 

4. Use of the polypeptide of claim 3, wherein said polypeptide comprises a heterologous 
amino acid sequence. 

5. Use of an antibody or fragment thereof for the preparation of a diagnostic or 
pharmaceutical composition for diagnosing or treating a gastrointestinal disorder, wherein said 
antibody or fragment thereof binds a polypeptide comprising an amino acid sequence at least 95% 
identical to a sequence selected from the group consisting of: 

(a) a full length polypeptide of SEQ ID NO: Y or a full length polypeptide encoded by 
the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in 
Table 1A; 

(b) a predicted secreted form of SEQ ID NO: Y or a secreted form of the polypeptide 
encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as 
referenced in Table 1 A; 

(c) a polypeptide fragment of SEQ ID NO:Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A; 

(d) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1 A, wherein said fragment has biological activity; 

(e) a polypeptide domain of SEQ ID NO: Y as referenced in Table IB; 

(f) a polypeptide domain of SEQ ID NO: Y as referenced in Table 2; and 

(g) a predicted epitope of SEQ ID NO:Y as referenced in Table IB. 
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6. Use of an antibody or fragment thereof for the preparation of a diagnostic or 
pharmaceutical composition for diagnosing or treating a gastrointestinal disorder, wherein said 
antibody or fragment thereof binds a polypeptide selected from the group consisting of: 

(a) a full length polypeptide of SEQ ID NO: Y or a full length polypeptide encoded by 
the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in 
Table 1A; 

(b) a predicted secreted form of SEQ ID NO:Y or a secreted form of the polypeptide 
encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as 
referenced in Table 1A; 

(c) a polypeptide fragment of SEQ ID NO:Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A; 

(d) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1 A, wherein said fragment has biological activity; 

(e) a polypeptide domain of SEQ ID NO: Y as referenced in Table IB; 

(f) a polypeptide domain of SEQ ID NO: Y as referenced in Table 2; and 

(g) a predicted epitope of SEQ ID NO:Y as referenced in Table IB. 

7. Use of a nucleic acid molecule for the preparation of a diagnostic or pharmaceutical 
composition for diagnosing or treating a gastrointestinal disorder, wherein said nucleic acid 
molecule comprises a polynucleotide sequence at least 95% identical to a sequence selected from 
the group consisting of: 

(a) a polynucleotide fragment of SEQ ID NO:X as referenced in Table 1 A; 

(b) a polynucleotide encoding a full length polypeptide of SEQ ID NO:Y or a full 
length polypeptide encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ 
ID NO:Y as referenced in Table 1A; 

(c) a polynucleotide encoding a predicted secreted form of SEQ ID NO: Y or a secreted 
form of the polypeptide encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ED NO:Y as referenced in Table 1 A; 

(d) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a 
polypeptide fragment encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ID NO: Y as referenced in Table 1 A; 
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(e) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a 
polypeptide fragment encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ID NO:Y as referenced in Table 1A, wherein said fragment has biological activity; 

(f) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y as referenced in 

Table IB; 

(g) a polynucleotide encoding a polypeptide domain of SEQ ID NO:Y as referenced in 
Table 2; and 

(h) a polynucleotide encoding a predicted epitope of SEQ ID NO:Y as referenced in 

Table IB. 

* 

8. Use of the nucleic acid molecule of claim 7, wherein said nucleic acid molecule 
comprises a heterologous polynucleotide sequence. 

9. Use of a nucleic acid molecule for the preparation of a diagnostic or pharmaceutical 
composition for diagnosing or treating a gastrointestinal disorder, wherein said nucleic acid 
molecule comprises a polynucleotide sequence selected from the group consisting of: 

(a) a polynucleotide fragment of SEQ ID NO:X as referenced in Table 1 A; 

(b) a polynucleotide encoding a full length polypeptide of SEQ ID NO:Y or a full 
length polypeptide encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ 
ID NO: Y as referenced in Table 1A; 

(c) a polynucleotide encoding a predicted secreted form of SEQ ID NO:Y or a secreted 
form of the polypeptide encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ID NO:Y as referenced in Table 1A; 

(d) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a 
polypeptide fragment encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ID NO: Y as referenced in Table 1A; 

(e) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a 
polypeptide fragment encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ID NO:Y as referenced in Table 1A, wherein said fragment has biological activity; 

(f) a polynucleotide encoding a polypeptide domain of SEQ ID NO:Y as referenced in 

Table IB; 

(g) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y as referenced in 
Table 2; and 

(h) a polynucleotide encoding a predicted epitope of SEQ ID NO:Y as referenced in 

Table IB. 
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10. Use of the nucleic acid molecule of claim 9, wherein said nucleic acid molecule 
comprises a heterologous polynucleotide sequence. 

11. Use of an agonist or antagonist for the preparation of a pharmaceutical composition 
for treating a gastrointestinal disorder, wherein said agonist or antagonist binds a polypeptide 
comprising an amino acid sequence at least 95% identical to a sequence selected from the group 
consisting of: 

(a) a full length polypeptide of SEQ ID NO: Y or a full length polypeptide encoded by the 
cDNA Clone ID in ATCC Deposit No:Z coiresponding to SEQ ID NO:Y as referenced in Table 
1A; 

(b) a predicted secreted form of SEQ ID NO:Y or a secreted form of the polypeptide 
encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as 
referenced in Table 1 A; 

(c) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A; 

(d) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1 A, wherein said fragment has biological activity; 

(e) a polypeptide domain of SEQ ID NO:Y as referenced in Table IB; 

(f) a polypeptide domain of SEQ ID NO:Y as referenced in Table 2; and 

(g) a predicted epitope of SEQ ID NO: Y as referenced in Table IB. 

12. Use of an agonist or antagonist for the preparation of a pharmaceutical composition 
for treating a gastrointestinal disorder, wherein said agonist or antagonist binds a polypeptide 
selected from the group consisting of: 

(a) a full length polypeptide of SEQ ID NO:Y or a full length polypeptide encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A; 

(b) a predicted secreted form of SEQ ID NO:Y or a secreted form of the polypeptide 
encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as 
referenced in Table 1 A; 
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(c) a polypeptide fragment of SEQ ID NO:Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO.Y as referenced in Table 
1A; 

(d) a polypeptide fragment of SEQ ID NO:Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A } wherein said fragment has biological activity; 

(e) a polypeptide domain of SEQ ID NO:Y as referenced in Table IB; 

(f) a polypeptide domain of SEQ ID NO: Y as referenced in Table 2; and 

(g) a predicted epitope of SEQ ID NO:Y as referenced in Table IB. 

13. A polypeptide comprising an amino acid sequence at least 95% identical to a sequence 
selected from the group consisting of: 

(a) a full length polypeptide of SEQ ID NO:Y or a full length polypeptide encoded by 
the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in 
Table 1A; 

(b) a predicted secreted form of SEQ ID NO:Y or a secreted form of the polypeptide 
encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as 
referenced in Table 1 A; 

(c) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A; 

(d) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A, wherein said fragment has biological activity; 

(e) a polypeptide domain of SEQ ID NO:Y as referenced in Table IB; 

(f) a polypeptide domain of SEQ ID NO:Y as referenced in Table 2; and 

(g) a predicted epitope of SEQ ID NO: Y as referenced in Table IB. 

14. The polypeptide of claim 13, wherein said polypeptide comprises a heterologous 
amino acid sequence. 

15. Use of the polypeptide of claim 13 for identifying a binding partner comprising: 

(a) contacting the polypeptide of claim 13 with a binding partner, and 

(b) determining whether the binding partner increases or decreases activity of the 
polypeptide. 
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16. A polypeptide comprising an amino acid sequence selected from the group consisting 

of: 

(a) a full length polypeptide of SEQ ID NO: Y or a full length polypeptide encoded by 
the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in 
Table 1A; 

(b) a predicted secreted form of SEQ ID NO: Y or a secreted form of the polypeptide 
encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as 
referenced in Table 1 A; 

(c) a polypeptide fragment of SEQ ID NO:Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A; 

(d) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1 A, wherein said fragment has biological activity; 

(e) a polypeptide domain of SEQ ID NO: Y as referenced in Table IB; 

(f) a polypeptide domain of SEQ ID NO:Y as referenced in Table 2; and 

(g) a predicted epitope of SEQ ID NO:Y as referenced in Table IB. 

17. The polypeptide of claim 16, wherein said polypeptide comprises a heterologous 
polypeptide sequence. 

18. Use of the polypeptide of claim 16 for identifying a binding partner comprising: 

(a) contacting the polypeptide of claim 16 with a binding partner, and 

(b) detenrining whether the binding partner increases or decreases activity of the 
polypeptide. 

19. An antibody or fragment thereof that binds a polypeptide comprising an amino acid 
sequence at least 95% identical to a sequence selected from the group consisting of: 

(a) a full length polypeptide of SEQ ID NO: Y or a full length polypeptide encoded by 
the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in 
Table 1A; 

(b) a predicted secreted form of SEQ ID NO: Y or a secreted form of the polypeptide 
encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as 
referenced in Table 1 A; 
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(c) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A; 

(d) a polypeptide fragment of SEQ ED NO:Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1 A, wherein said fragment has biological activity; 

(e) a polypeptide domain of SEQ ID NO:Y as referenced in Table IB; 

(f) a polypeptide domain of SEQ ID NO: Y as referenced in Table 2; and 

(g) a predicted epitope of SEQ ID NO: Y as referenced in Table IB. 

20. An antibody or fragment thereof that binds a polypeptide selected from the group 
consisting of: 

(a) a full length polypeptide of SEQ ID NO: Y or a full length polypeptide encoded by 
the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in 
Table 1A; 

(b) a predicted secreted form of SEQ ID NO:Y or a secreted form of the polypeptide 
encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as 
referenced in Table 1A; 

(c) a polypeptide fragment of SEQ ID NO: Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1A; 

(d) a polypeptide fragment of SEQ ID NO:Y or a polypeptide fragment encoded by the 
cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ ID NO:Y as referenced in Table 
1 A, wherein said fragment has biological activity; 

(e) a polypeptide domain of SEQ ID NO:Y as referenced in Table IB; 

(f) a polypeptide domain of SEQ ID NO: Y as referenced in Table 2; and 

(g) a predicted epitope of SEQ ID NO:Y as referenced in Table IB. 

21. A nucleic acid molecule comprising a polynucleotide sequence at least 95% identical 
to a sequence selected from the group consisting of: 

(a) a polynucleotide fragment of SEQ ID NO:X as referenced in Table 1 A; 

(b) a polynucleotide encoding a full length polypeptide of SEQ ID NO:Y or a full 
length polypeptide encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ 
ID NO:Y as referenced in Table I A; 
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(c) a polynucleotide encoding a predicted secreted form of SEQ ID NO:Y or a secreted 
form of the polypeptide encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ID NO:Y as referenced in Table 1A; 

(d) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a 
polypeptide fragment encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ID NO:Y as referenced in Table 1A; 

(e) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a 
polypeptide fragment encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ID NO:Y as referenced in Table 1A, wherein said fragment has biological activity; 

(f) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y as referenced in 

Table IB; 

(g) a polynucleotide encoding a polypeptide domain of SEQ ID NO:Y as referenced in 
Table 2; and 

(h) a polynucleotide encoding a predicted epitope of SEQ ID NO:Y as referenced in 

Table IB. 

22. The nucleic acid molecule of claim 21, wherein said nucleic acid molecule comprises 
a heterologous polynucleotide sequence. 

23. A recombinant vector comprising the nucleic acid molecule of claim 21. 

24. A recombinant vector comprising the nucleic acid molecule of claim 22. 

25. A recombinant host cell comprising the recombinant vector of claim 23. 

26. A recombinant host cell comprising the recombinant vector of claim 24. 

27. A nucleic acid molecule comprising a polynucleotide sequence selected from the 
group consisting of: 

(a) a polynucleotide fragment of SEQ ID NO:X as referenced in Table 1 A; 

(b) a polynucleotide encoding a full length polypeptide of SEQ ID NO:Y or a full 
length polypeptide encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to SEQ 
ID NO:Y as referenced in Table 1 A; 



750 



WO 02/076488 



PCT/US02/08276 



(c) a polynucleotide encoding a predicted secreted form of SEQ ID NO:Y or a secreted 
form of the polypeptide encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ID NO:Y as referenced in Table 1 A; 

(d) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a 
polypeptide fragment encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ID NO:Y as referenced in Table 1 A; 

(e) a polynucleotide encoding a polypeptide fragment of SEQ ID NO:Y or a . 
polypeptide fragment encoded by the cDNA Clone ID in ATCC Deposit No:Z corresponding to 
SEQ ID NO: Y as referenced in Table 1 A, wherein said fragment has biological activity; 

(f) a polynucleotide encoding a polypeptide domain of SEQ ID NO:Y as referenced in 

Table IB; 

(g) a polynucleotide encoding a polypeptide domain of SEQ ID NO:Y as referenced in 
Table 2; and 

(h) a polynucleotide encoding a predicted epitope of SEQ ID NO: Y as referenced in 

Table IB. 

28. The nucleic acid molecule of claim 27, wherein said nucleic acid molecule comprises 
a heterologous polynucleotide sequence. 

r 

29. A recombinant vector comprising the nucleic acid molecule of claim 27. 

30. A recombinant vector comprising the nucleic acid molecule of claim 28. 

31. A recombinant host cell comprising the recombinant vector of claim 29. 

32. A recombinant host cell comprising the recombinant vector of claim 30. 
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Sequence List 

<110> Human Genome Sciences, Inc. 

<120> Human Secreted Proteins 

<130> PS906PCT 

<150> US 60/331,287 
<151> 2001-11-13 

<150> US 60/306,171 
<151> 2001-07-19 

<150> US 60/277,340 
<151> 2001-03-21 

<160> 650 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 733 
<212> DNA 
<213> Homo 

<400> 1 
gggatccgga 
aattcgaggg 
tctcccggac 
tcaagttcaa 
aggagcagta 
ggctgaatgg 
agaaaaccat 
catcccggga 
atccaagcga 
ccacgcctcc 
acaagagcag 
acaaccacta 
gactctagag 

<210> 2 
<211> 5 
<212> PRT 
<213> Homo 

<220> 

<221> Site 
<222> (3) 

<223> Xaa equals any amino acid 

• 

<400> 2 

Trp Ser Xaa Trp Ser 
1 5 

<210> 3 
<211> 86 
<212> DNA 

<213> Artificial Sequence 
<220> 



sapiens 



gcccaaatct tctgacaaaa ctcacacatg 
tgcaccgtca gtcttcctct tccccccaaa 
tcctgaggtc acatgcgtgg tggtggacgt 
ctggtacgtg gacggcgtgg aggtgcataa 
caacagcacg taccgtgtgg tcagcgtcct 
caaggagtac aagtgcaagg tctccaacaa 
ctccaaagcc aaagggcagc cccgagaacc 
tgagctgacc aagaaccagg tcagcctgac 
catcgccgtg gagtgggaga gcaatgggca 
cgtgctggac tccgacggct ccttcttcct 
gtggcagcag gggaacgtct tctcatgctc 
cacgcagaag agcctctccc tgtctccggg 
gat 



sapiens 



cccaccgtgc 


ccagcacctg 


60 


acccaaggac 


accctcatga 


120 


aagccacgaa 


gaccctgagg 


180 


tgccaagaca 


aagccgcggg 


240 


caccgtcctg 


caccaggact 


300 


agccctccca 


acccccatcg 


360 


acaggtgtac 


accctgcccc 


420 


ctgcctggtc 


aaaggcttct 


480 


gccggagaac 


aactacaaga 


540 


ctacagcaag 


ctcaccgtgg 


600 


cgtgatgcat 


gaggctctgc 


660 


taaatgagtg 


cgacggccgc 


720 






733 



1 



WO 02/076488 



PCT/US02/08276 



<221> Primer_Bind 

<223> Synthetic sequence with 4 tandem copies of the GAS binding site 
found in the IRFl promoter (Rothman et al., Immunity 1:457-468 
(1994)), 18 nucleotides complementary to the SV40 early promoter, 
and a Xho I restriction site. 

<400> 3 

gcgcctcgag atttccccga aatctagatt tccccgaaat gatttccccg aaatgatttc 60 
cccgaaatat ctgccatctc aattag 86 

<210> 4 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> PrimerJBind 

<223> Synthetic sequence complementary to the SV40 promter; includes a 
Hind III restriction site. 

<400> 4 

gcggcaagct ttttgcaaag cctaggc 

<210> 5 
<211> 271 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> Protein_Bind 
<223> Synthetic promoter for use 

binding sites found in the 

1:457-468 (1994)). 

<400> 5 

ctcgagattt ccccgaaatc tagatttccc 
aaatatctgc catctcaatt agtcagcaac 
gcccctaact ccgcccagtt ccgcccattc 
ttatgcagag gccgaggccg cctcggcctc 
ttttggaggc ctaggctttt gcaaaaagct 

<210> 6 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> Primer_Bind 
<223> Synthetic primer complementary to human genomic EGR-1 promoter 

sequence (Sakamoto et al., Oncogene 6:867-871 (1991)); includes a 

Xho I restriction site. 

<400> 6 

gcgctcgagg gatgacagcg atagaacccc gg 32 

<210> 7 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



27 



in biological assays; includes GAS 
IRFl promoter (Rothman et al.. Immunity 



cgaaatgatt tccccgaaat gatttccccg 60 

catagtcccg cccctaactc cgcccatccc 120 

tccgccccat ggctgactaa ttttttttat 180 

tgagctattc cagaagtagt gaggaggctt 240 

t 271 



<220> 
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<221> Primer_Bind 

<223> Synthetic primer complementary to human genomic EGR-1 promoter 

sequence (Sakamoto et al., Oncogene 6:867-871 (1991)); includes a 
Hind III restriction site. 

<400> 7 

gcgaagcttc gcgactcccc ggatccgcct c 31 



<210> 8 
<211> 12 
<212> DNA 

<213> Homo sapiens 
<400> 8 

ggggactttc cc 12 



<210> 9 
<211> 73 
<212> DNA 

<213> Artificial Sequence 



<220> 

<221> Primer_Bind 

<223> Synthetic primer with 4 tandem copies of the NF-KB binding site 

(GGGGACTTTCCC) , 18 nucleotides complementary to the 5' end of the 
SV40 early promoter sequence, and a Xhol restriction site. 

<400> 9 

gcggcctcga ggggactttc ccggggactt tccggggact ttccgggact ttccatcctg 60 
ccatctcaat tag 73 



<210> 10 
<211> 256 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<221> Protein_Bind 

<223> Synthetic promoter for use in biological assays; includes NF-KB 
binding sites. 



<400> 10 

ctcgagggga ctttcccggg gactttccgg ggactttccg ggactttcca tctgccatct 60 

caattagtca gcaaccatag tcccgcccct aactccgccc atcccgcccc taactccgcc 120 

cagttccgcc cattctccgc cccatggctg actaattttt tttatttatg cagaggccga 180 

ggccgcctcg gcctctgagc tattccagaa gtagtgagga ggcttttttg gaggcctagg 240 

cttttgcaaa aagctt 256 



<210> 11 
<211> 2703 
<212> DNA 

<213> Homo sapiens 
<400> 11 

ggcacgagat ttcctacagg tgaaacgcca tcattaggat tcactgtaac gttagtgcta 60 
ttaaactcac tagcattttt attaatggcc gttatctaca ctaagctata ctgcaacttg 120 
gaaaaagagg acctctcaga aaactcacaa tctagcatga ttaagcatgt cgcttggcta 180 
atcttcacca attgcatctt tttctgccct gtggcgtttt tttcatttgc accattgatc 240 
actgcaatct ctatcagccc cgaaataatg aagtctgtta ctctgatatt ttttccattg 300 
cctgcttgcc tgaatccagt cctgtatgtt ttcttcaacc caaagtttaa agaagactgg 360 
aagttactga agcgacgtgt taccaagaaa agtggatcag tttcagtttc catcagtagc 420 
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caaggtggtt gtctggaaca ggatttctac tacgactgtg gcatgtactc acatttgcag 480 

ggcaacctga ctgtttgcga ctgctgcgaa tcgtttcttt taacaaagcc agtatcatgc 540 

aaacacttga taaaatcaca cagctgtcct gcattggcag tggcttcttg ccaaagacct 600 

gagggctact ggtccgactg tggcacacag tcggcccact ctgattatgc agatgaagaa 660 

gattcctttg tctcagacag ttctgaccag gtgcaggcct gtggacgagc ctgcttctac 720 

cagagtagag gattcccttt ggtgcgctat gcttacaatc taccaagagt taaagactga 780 

actactgtgt gtgtaaccgt ttcccccgtc aaccaaaatc agtgtttata gagtgaaccc 840 

tattctcatc tttcatctgg gaagcacttc tgtaatcact gcctggtgtc acttagaaga 900 

aggagaggtg gcagtttatt tctcaaacca gtcattttca aagaacaggt gcctaaatta 960 

taaattggtg aaaaatgcaa tgtccaagca atgtatgatc tgtttgaaac aaatatatga 1020 

cttgaaaagg atcttaggtg tagtagagca atataatgtt agttttttct gatccataag 1080 

aagcaaattt atacctattt gtgtattaag cacaagataa agaacagctg ttaatatttt 1140 

ttaaaaatct attttaaaat gtgattttct ataactgaag aaaatatctt gctaatttta 1200 

cctaatgttt catccttaat ctcaggacaa cttactgcag ggccaaaaaa gggactgtcc 1260 

cagctagaac tgtgagagta tacataggca ttactttatt atgttttcac ttgccatcct 1320 

tgacataaga gaactataaa ttttgtttaa gcaatttata aatctaaaac ctgaagatgt 1380 

ttttaaaaca atattaacag ctgttaggtt aaaaaaatag ctggacattt gttttcagtc 1440 

attatacatt gctttggtcc aatcagtaat tttttcttaa gtgttttgtg attacactac 1500 

tagaaaaaaa gtaaaaggct aattgctgtg tgggtttagt cgatttggct aaactactaa 1560 

ctaatgtggg ggtttaatag tatctgaggg atttggtggc ttcatgtaat gttctcatta 1620 

atgaatactt cctaatatcg ttggctctac taatattttc caatttgctg ggatgtcacc 1680 

tagcaatagc ttggattata tagaaagtaa actgtggtca atacttgcat ttaattagac 1740 

gaaacgggga gtaattatga cacgaagtac ttatgtttat ttcttagtga gctggattat 1800 

cttgaacctg tgctattaaa tggaaatttc catacatctt ccccatacta ttttttataa 1860 

aagagcctat tcaatagctc agaggttgaa ctctggttaa acaagataat atgttattaa 1920 

taaaaataga agaagaaaga ataaagctta gtcctgtgtc tttaaaaatt aaaaatttta 1980 

cttgattccc atctatgggc tttagaccta ttactgggtg gagtcttaaa gttataattg 2040 

ttcaatatgt tttttgaaca gtgtgctaaa tcaatagcaa acccactgcc atattagtta 2100 

ttctgaatat actaaaaaaa tccagctaga ttgcagttta ataattaaac tgtacatact 2160 

gtgcatataa tgaattttta tcttatgtaa attattttta gaacacaagt tgggaaatgt 2220 

ggcttctgtt catttcgttt aattaaagct acctcctaaa ctatagtggc tgccagtagc 2280 

agactgttaa attgtggttt atatactttt tgcattgtaa atagtctttg ttgtacattg 2340 

tcagtgtaat aaaaacagaa tctttgtata tcaaaatcat gtagtttgta taaaatgtgg 2400 

gaaggattta tttacagtgt gttgtaattt tgtaaggcca actatttaca agttttaaaa 2460 

attgctatca tgtatattta cacatctgat aaatattaaa tcataacttg gtaagaaact 2520 

cctaattaaa aggttttttc caaaattcag gttattgaaa atttttcatt ttattcattt 2580 

aaaaactaga ataacagata tataaaagtg ttaatctttg tgctatatgg tatgaaatac 2640 

aatattgtac tcagtgtttt gaattattaa agtttctaga aagcaaaaaa aaaaaaaaaa 2700 

aaa 2703 



<210> 12 
<211> 760 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> (13) . . (13) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 

<222> (300) . . (300) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (425) . . (425) 
<223> n equals a,t,g, or c 



<40O> 12 
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cggacgcgtg ggncggacgc gtggggaaaa aataacaaaa caaaaaacaa gaaaaaaaaa 60 

acacaaaacc ccgtaaaatc acaaagaaaa tccaacacca aaggcgcaga agccggctgg 120 

ccgtggtggg ggcagcgtag gcgtasatcc ctctcctctc acttagcctg ttgactcttg 180 

ttattatcat gatattcaca aaacgccgca tgtttaaaaa gtcatagatg tcatcttctc 240 

tctgccccca gggaggaaag ccaccttctc ttgccccttg gcccctttgt caggggccan 300 

gggtctgccg ggtgggggtg ccaacaggcc tggccctttc ctcccctgca tccagccatg 360 

ggggcctctg cgattgccgg aaggttgcat ggctggtccc agggccagca caggcccgag 420 

gccgngctgc ctggttttat ttttatttaa ctttattttc tgttttatga gtgtgtgtcc 480 

gcccaccccc acccccttca gtgttaagtg gggagccctg ggggagtctc tcctgcctcc 540 

cagcctctcc caagacctcc cccctcgtca ccagccatcc ctctggacca ggcagagggc 600 

ggaccgggtg ggcaggggcc tgagggtggc tcgggccagc ccaccagcca atggacccct 660 

cctcaggccg ccagtgtcgc cctgcccctt tttaaaacaa aatgccctcg tttgtaaacc 720 

cttagacgct tgagaataaa ccccttcctt ttcttccaaa 760 

<210> 13 

<211> 1445 

<212> DNA 

<213> Homo sapiens 

<400> 13 • 

ggcacgaggg atttgaacaa gatcattaga attcaaaaaa caccagaaat gaaagatctt 60 

tcctgaagct gtttaggaat attcatgata tacccttaac tgttctagag aacaaaatgc 120 

gtctgtgctc cttcacaaaa gtccctatga atttgtttct caatgtgatc cttcttaagt 180 

tctataactt tttgttttca ttaattttag gaaaatcctg ccttgcttcg ttgggcctat 240 

gcaagaacaa taaatgtcta tcctaatttc agacccactc ctaaaaactc actcatggga 300 

gctctgtgtg gatttgggcc cctcatcttc atttattata ttatcaaaac tgagagggta 360 

agtattcaga ccagatgttt agtatttgag tgataggttc actttctagg gaccagctgc 420 

agctccttct cttgaagatt gccaccagtg cccctcccac cttggggctg tcctctgcct 480 

tcccttcctc tcttctttta tctttattcc tttccagcag gagttaaaac agaaagtttt 540 

cagtcacctt tgtctatttt tgttagttca tttgtttttt aaaaagatga tgtttattgg 600 

gttaagtatt agcagaatac ataaatcatt tagtacgttt cctgtttgcg tgaattctat 660 

ttatgttggt cacattttgc aaattaatgt taaaacctat taatactcta cgggacagag 720 

aagcacaagc tgcctgtgtg gggaatagct gccgtcagca gcctgggtat atgattggag 780 

agaaagtcaa gctgatcttt ggcaccaaac cattccacat ctggtactaa accctgagct 840 

gcagccccca ggcttgtgtt gccactggag cccactcgtc tagctttgtc tttaactggc 900 

ccatctgcat tcccattaga gttcgtgtat tttgattatc tggtgaatga tctacttaac 960 

agaaaggtag tccacatttt cccagaaagt gtttgcattt tgctttcaat atatggtttt 1020 

atgggataat atatttctaa tgactaaaat gtgagtaaga tgtttttgaa taggagcatt 1080 

ttcttactgt gtctttagtt cctcggatta ctgtttcttc gcacactccc tgggctttag 1140 

acagtgggat tgcaattagg tttggagtgt ttcattctgt ttgtcagttg tacggtgggt 1200 

tgtgccaaaa tgcagttttt cttacctttt ttatttattt atttttatct aatatagcca 1260 

actggcagaa tatattgtct ttaatgtact ttttttctgt ctttacagga taggaaagaa 1320 

aaacttatcc aggaaggaaa attggatcga acatttcacc tctcatatta agtctggcaa 1380 

tgatgactat atgtattcct gcctaaataa atcatctatt aatcattaaa aaaaaaaaaa 1440 

aaaaa 1445 

<210> 14 

<211> 1333 

<212> DKA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 
<222> (411) . . (411) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (1264) . . (1264) 
<223> n equals a,t,g, or c 
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<220> 

<221> misc__f eature 
<222> (1319) . . (1319) 
<223> n equals a,t,g, or c 

<400> 14 

cctggaacac ttcaacaacc agtatccagc cgcagaggtg gtgaactttg gcacctggtt 60 

cctcttcagc ttccccatat ccctcatcat gctggtggtc agctggttct ggatgcactg 120 

gctgttcctg ggctgcaatt ttaaagagac ctgctctctg agcaagaaga agaagaccaa 180 

aagggaacag ttgtcagaga agagsmtcca agaagaatat gaaaaactgg gagacattag 240 

ctacccagaa atggtgactg gwtttttctt catcctgatg accgtactgt ggtttamccg 300 

ggagcctggc tttgtccctg gctgggattc tttctttgaa aagaaaggct accgtactga 360 

tgccacagtc tctgtcttcc ttggcttcct cctcttcctc attccagcga nagaagccct 420 

gctttgggaa aaagaatgat ggagagaacc aggagcactc actkgggacc gagcccatca 480 

tcacgtggaa ggacttccag aagaccatgc cctgggagat tgtcattctg gttgggggag 540 

gctatgctct ggcttctggt agcaagagct ctggcctctc tacatggatt gggaaccaga 600 

tgttgtccct gagcagcctc ccaccgtggg ctgtcaccct gctggcatgc atcctcgtgt 660 

ccattgtcac tgagtttgtg agcaacccag caaccatcac catcttcctg cccatcctgt 720 

gcagcctgtc tgaaacgctg cacattaacc ccctctacac cctgatccca gtcaccatgt 780 

gcatctcctt tgcagtgatg ctgcctgtgg gcaatccccc taatgccatc gtcttcagct 840 

atgggcactg ccagatcaaa gatatggtga aagctggcct gggagtcaac gttattggac 900 

tggtgatagt aatggtggcc atcaacacct ggggagttag cctcttccac ctggacactt 960 

acccagcatg ggcgagggtc agcaacatca ctgatcaagc ctaacgccaa gtgtacaaac 1020 

tggcccaacc acaggagctg ccagtatcca gcagtatctg gaccacaggc aaagaaaacc 1080 

actaggacca ccaggagcac acaaccccag acccacgccg gagggcatcc ctccaccaga 1140 

agattccgcc acctcaagtg aactgcagga atcctccaac aaccacaaac acat^cttcg 1200 

ctgttagtgt cttcttcctg ccctcagcac cacagctcaa gaaaacctaa agtttcaata 1260 

caanccatag gctcacaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaana 1320 

aaaaaaaaaa aaa 1333 

<210> 15 
<211> 751 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (1) . . (1) 

<223> n equals a,t,g, or c 

<400> 15 

natcattttc tgtcccctcc tatcttaggc tgaccggttc cctgatgtgt tacctgcttc 60 

tgctactgat ccaaactgca gaacttctca ttcatcccca aggcctccag gcagtatcca 120 

atggggaatc agctctaaaa ggaaccagac caacgttttc cagccccttc attctggtga 180 

ctgaggggag gaaagaatgg gagggggtat tcttgtctag tggatggaaa ggaaacacac 240 

tgtcaaatta ctatatctcc ttggttttct attacagtag aattctccag ccatattttt 300 

attgtctatg ggggaagttg gagatggtga ccttgattag aagtgtctgg agggggataa 360 

atggagggga taagattcag ttggttttgg aaaatgttaa agtcttaaaa taatgcgtcc 420 

atctgaagaa ttttttctaa aaccagagtt tataaaaata tcactgatac agcctgcccc 480 

ctcatttccc tgccacagga gatgtcttgg actagagaca cttgtttaat aatagcttgt 540 

ctctgatatt cccagtagct tccctctgtg tgaggaaagg atagaaatgt tcaggacatc 600 

atcatacagg ctcctcatct acaaagttcc agtagcagtg acgcctacac ggaagacttg 660 

gaactgcaaa caggctgggg tcacctcagt gacatctgac gctgtccaac cagaagttcg 720 

atttttgttc tgggggtgaa ggaggaaaca g 751 

<210> 16 
<211> 2849 
<212> DNA 

<213> Homo sapiens 
<220> 



6 



ft 

♦ 



<221> misc_f eature 

<222> (1) . . (1) 

<223> n equals a,t,g, or c 

<400> 16 

ngggctgcaa ggacctgagc tcagcttccg ccccagccag ggaagcggca ggggaaagca 60 

ccggctccag gccagcgtgg gccgctctct cgctcggtgc ccgccgccat gtgggccgtc 120 

ctgaggttag ccctgcggcc gtgtgcccgc gcctctcccg ccgggccgcg cgcctatcac 180 

ggggactcgg tggcctcgct gggcacccag ccggacttgg gctctgccct ctaccaggag 240 

aactacaagc agatgaaagc actagtaaat cagctccatg aacgagtgga gcatataaaa 300 

ctaggaggtg gtgagaaagc ccgagcactt cacatatcaa gaggaaaact attgcccaga 360 

gaaagaattg acaatctcat agacccaggg tctccatttc tggaattatc ccagtttgca 420 

ggttaccagt tatatgacaa tgaggaggtg ccaggaggtg gcattattac aggcattgga 480 

agagtatcag gagtagaatg catgattatt gccaatgatg ccaccgtcaa aggaggtgcc 540 

tactacccag tgactgtgaa aaaacaatta cgggcccaag aaattgccat gcaaaacagg 600 

ctcccctgca tctacttagt tgattcggga ggagcatact tacctcgaca agcagatgtg 660 

tttccagatc gagaccactt tggccgtaca ttctataatc aggcaattat gtcttctaaa 720 

aatattgcac agatcgcagt ggtcatgggc tcctgcaccg caggaggagc ctatgtgcct 780 

gccatggctg atgaaaacat cattgtacgc aagcagggta ccattttctt ggcaggaccc 840 

cccttggtta aagcggcaac tggggaagaa gtatctgctg aggatcttgg aggtgctgat 900 

cttcattgca gaaagtctgg agtaagtgac cactgggctt tggatgatca tcatgccctt 960 

cacttaacta ggaaggttgt gaggaatcta aattatcaga agaaattgga tgtcaccatt 1020 

gaaccttctg aagagccttt atttcctgct gatgaattgt atggaatagt tggtgctaac 1080 

cttaagagga gctttgatgt ccgagaggtc attgctagaa tcgtggatgg aagcagattc 1140 

actgagttca aagcctttta tggagacaca ttagttacag gatttgctcg aatatttggg 1200 

tacccagtag gtatcgttgg aaacaacgga gttctctttt ctgaatctgc aaaaaagggt 1260 

actcactttg tccagttatg ctgccaaaga aatattcctc tgctgttcct tcaaaacatt 1320 

actggattta tggttggtag agagtatgaa gctgaaggaa ttgccaagga tggtgccaag 1380 

atggtggccg ctgtggcctg tgcccaagtg cctaagataa ccctcatcat tgggggctcc 1440 

tatggagccg gaaactatgg gatgtgtggc agagcgtata gcccaagatt tctctacatt 1500 

tggccaaatg ctcgtatctc agtgatggga ggagagcagg cagccaatgt gttggccacg 1560 

ataacaaagg accaaagagc ccgggaagga aagcagttct ccagtgctga tgaagcggct 1620 

ttaaaagagc ccatcattaa gaagtttgaa gaggaaggaa acccttacta ttccagcgca 1680 

agggtatggg atgatgggat cattgatcca gcagacacca gactggtctt gggtctcagt 1740 

tttagtgcag ccctcaacgc accaatagag aagactgact tcggtatctt caggatgtaa 1800 

ctggaataaa ggatgttttc tgttggacat gtactgaaaa ttaacacatg tagtagcctt 1860 

aaaattttag acttctcgaa catgaggctg ttacagtaat ttttttaaca ctgtgcattg 1920 

tacttttcta ccttaaaaaa atcagtgagg atatttattt aatgaacatc aattcctttt 1980 

aaattttctt agagaaattt ctctgtggct cagttttacc acccataaag cggagacagt 2040 

aatttatggt tatcctttct gacccacaaa gtatgaaaag ttctgtaatc tgtaaactca 2100 

gttctgtaat ctgtattatt gagatgatta atataaagtt gtattttcac tgaaactgat 2160 

tgtcattgct gagttatgct atggtgatac tttcacgcgg gctataattt tatgacaagg 2220 

catctgttac ttcagctggc cataaagtgc cctcaacact gctgtgcaga ccatcaccac 2280 

cattcatctc cagaattggt actcagtacc aaactgcaaa ctatagcgat tcaaacaaca 2340 

gtgctcccca atttcaccac taccacgagc ccatgacatc tactatatat catactgagt 2400 

ctgagaaagc gatgtaacca caatagacac aaaccagaca atacgaaaga ataaatacac 2460 

gactccaccg aagcaacaca acacccagaa gatcaaacaa gactatgcgc caccacacca 2520 

cccaaacagc aggaacaaag agaacctaag aaaaacacca caaacaagaa tctacacaac 2580 

aaacgcaaca tcaacaagat agccaaggat agactacaca aaaagccata gaggaacctc 2640 

ggactggtca ccaagagcag aacaaccgac aacacacaat acagtacaaa caagacaaac 2700 

gctggacaag cgagtaaaaa gatgaagaaa aaacgaagaa tcaacaacag gagaagtata 2760 

tgaaaacaac accaacgcaa gaacacaaca gtctgaacga cagagacaaa tactaaaacg 2820 

acaaacgtca gctactatta accgaaaaa 2849 

<210> 17 
<211> 755 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> raise feature 
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<222> (1)..(1) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (733) . . (734) 
<223> n equals a,t,g, or c 

<400> 17 

natttcccgt tcagttattc cggtgacact atagaaggta cgcctgcagg taccggtccg 60 

gaattcccgg gtcgacccac gcgtccgaac tcctgaaaca gtgaggacat ctcacagacc 120 

agacaggagc tggggctctg catctcacag cggtgcctgt cagacaggaa gaagtcccgc 180 

agaagtggcg tgtgggtcag ggcctgcacg atgcagttca tgaagcatgt gttcccaagg 240 

ttgatcagcc cacgcagacc tatggtgcag ttcgaggtga tctttctcct tttcgggttg 300 

tgcttcagca gttcaagctc ccgtttggtt ggttcccaag ttgaaaactt ctctccaacg 360 

ccttgcattt tccaagcttt tcgctgctcc tccttggcga ttatttccat gtctttgtca 420 

tagatgtagt cctggcacag aaaacagtag atgcctccgt acatcagatc aatggccagg 480 

ttgtgccgct tcgccttcgc atgctcgtga atatgcttct ttgtgaaaca gccgaagaag 540 

acacagtaga ggcaggaatg cagcctgttg aggtggacgc cacagacatg gcagatacag 600 

gacttggcct tgcgcttgcg ggcctcagcc gtgccgctcc acacgaagca ctggtagatg 660 

gcccgcaggt tctgcttcca gttgtccacc ttgaagctgc ccaggtgcga gcagcccggc 720 

ggcgctaccg ccnnctcggc gtccatggcc tcgcc 755 

<210> 18 

<211> 4129 

<212> DNA 

<213> Homo sapiens 

<400> 18 

ccacgcgtcc gctttttctc aggatgaata ttttcctggc cgactcattg atccttggta 60 

caaataaact tctggaagac ccagagagag gaaaacacag gagaaattga gcgatgtacg 120 

tacatcaaat accactactc ctcagcaacc atccccagga acctcacttt caatatcacg 180 

aagaccatcc gtcaggatga gtggcatgcc ctacacctgc gcagaatgac ggctggcttc 240 

atgggcatgg cggtggccat catcctcttt ggctggatca tcggcgtgct gggctgctgc 300 

tgggaccgag gccttatgca gtacgtggca ggctgctctt cctcatggga gggaaaacag 360 

tggaattaaa gagtgtctgc cccagcccgg cagggtgaag taggatgggg aaaacgttct 420 

caccagaccc tgggacttct atgctgcagc atcgtgacct gaggggtgga tgcagttgcc 480 

acagctcttt gaggcaaagg ccccgatgct ctgtggacag cctcaggctt gggatggatt 540 

tggcagtgag gaacttattg taacagaaga aagtcatcca agatgcctga ggaaagaaac 600 

cttcaattga gccagccggc tggaaaatgt ggccaagaaa accgcagaga ccaatgttcg 660 

gaggagaaaa ccagaaagag gggcctgcct ggcccctttg atcctttatg gccgattccg 720 

tggacattgc tgctcctcac gccggcagcc tctcttgagt acctcaattg cagtctccag 780 

accctcaccc cgcaggcatt cctgggtcgg tgtcccagtc ggtcacagtc atggatcctc 840 

tgcagagcag tagaaagtcg ggaggggccc gtgcccatgg tcaggaaagg agcggcagga 900 

ggaaagagga gcatgagaac tcagaagaaa ttgtacctac tcagatgtgg agtgaggata 960 

gacgttccca gattcaaagg catcatgaag tgtcatgaca agatagaaaa gactttgggc 1020 

tggccaagaa ggaactggat aaaattatga gtgaggtaca gcaggtggga acagtgtcac 1080 

tgaaccctat caacagcaga gcatgagaac gtgaattcct gctgctgggg aggcaatgaa 1140 

atgatatggg ccttcagatg tctatgaatc ctgacccacc gtgggtgcca gttttcaaga 1200 

gggcttccca tcaaatattg tgcgcaaagg atggatggat gaaaggaaga gtgagccaat 1260 

aaacgaggga acgccgggaa aggcagcctc aagccggtgg gccctggcac ccccaccgtc 1320 

cctgagcatc gagccggttc ccgccccggc ccgaactggc ccgcgcgcgc tcgcagcccc 1380 

gcggcggaac ccgagggcgg cggcagcggt tccttgaacg agccggggaa tctggaggga 1440 

gcacacagga aaggcagagc cgcgagctgg accagccgtg caaatctcta gaagatgacg 1500 

gtgttcttta aaacgcttcg aaatcactgg aagaaaacta cagctgggct ctgcctgctg 1560 

acctggggag gccattggct ctatggaaaa cactgtgata acctcctaag gagagcagcc 1620 

tgtcaagaag ctcaggtgtt tggcaatcaa ctcattcctc ccaatgcaca agtgaagaag 1680 

gccactgttt tctcaatcct gcagcttgca aaggaaaagc caggactcta tttgaaaaaa 1740 

atgctgcccg attttacatt tatctggcat ggatgtgact attgtaagac agattatgag 1800 

ggacaagcca agaaactcct ggaactgatg gaaaacacgg atgtgatcat tgttgcagga 1860 

ggagatggga cactgcagga ggttgttact ggtgttcttc gacgaacaga tgaggctacc 1920 
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ttcagtaaga ttcccattgg atttatccca ctgggagaga ccagtagttt gagtcatacc 1980 

ctctttgccg aaagtggaaa caaagtccaa catattactg atgccacact tgccattgtg 2040 

aaaggagaga cagttccact tgatgtcttg cagatcaagg gtgaaaagga acagcctgta 2100 

tttgcaatga ccggccttcg atggggatct ttcagagatg ctggcgtcaa agttagcaag 2160 

tactggtatc ttgggcctct aaaaatcaaa gcagcccact ttttcagcac tcttaaggag 2220 

tggcctcaga ctcatcaagc ctctatctca tacacgggac ctacagagag acctcccaat 2280 

gaaccagagg agacccctgt acaaaggcct tctttgtaca ggagaatatt acgaaggctt 2340 

gcgtcctact gggcacaacc acaggatgcc ctttcccaag aggtgagccc ggaggtctgg 2400 

aaagatgtgc agctgtccac cattgaactg tccatcacaa cacggaataa tcagcttgac 2460 

ccgacaagca aagaagattt tctgaatatc tgcattgaac ctgacaccat . cagcaaagga 2520 

gactttataa ctataggaag tcgaaaggtg agaaacccca agctgcacgt ggagggcacg 2580 

gagtgtctcc aagccagcca gtgcactttg cttatcccgg agggagcagg gggctctttt 2640 

agcattgaca gtgaggagta tgaagcgatg cctgtggagg tgaaactgct ccccaggaag 2700 

ctgcagttct tctgtgatcc taggaagaga gaacagatgc tcacaagccc cacccagtga 2760 

gcagcagaag acaagcactc tgagaccaca ctttaggcca ccggtgggac caaaagggaa 2820 

caggtgcctc agccatccca acagtgtcgt cagagggtcc ccagggcatt ttcatggcaa 2880 

gtacccctct gcccccactc cagcagtgct tcccaaagtg tgctctgtca cctgctttgc 2940 

aatcggcttc cattagcgca tgttttattt tggtgtgacg gttggccctc ctaaacacgg 3000 

actttcctca ggctggttca agacggaaaa ggactttctt ctgttttctt ccaaagtgca 3060 

accacagtgg agagcccacg gtgggcttag cctgcctagg cccttccatt tctcttcttt 3120 

gaccgtgcta ggaattccag gaaagtgcat tcctgccctg gtgacctttt cctatgtcta 3180 

ggctcctcca caggtgctgc tattttgtga gctccggctc ctgtttagct tttatttcag 3240 

ttctaacctc agtccagaaa catatgtgag gttgtttccc tcttcagcca cggctacaat 3300 

accggaaaat gctagttttt atttattttt ttaagtagtg cttcctaaat ggtttgcatg 3360 

agagccacct ggggtacatg ttgaaaactt atttggggtc taccccaaac ctaataaccc 3420 

aaatttgggg atggggccca ggaatatgca tttttaaaaa gtcatctgcc cttcccaggt 3480 

gattctgtaa gttgtccctc aactgtactt ggagaaatcg tgttttaaag cagtagtcca 3540 

caaagtattc tgctcatgtg cccccaaaag tattttgaaa aatcatgtat accctcaccc 3600 

atctaagttg atatctaaaa ttttatctaa gttggtatct aaaatttttc atgggaagtt 3660 

aaatagttga caaagtatgt atttgctggt gtcgtgtaaa tattggtatt ttaaaataaa 3720 

aactgttaca tcactatttt aaacatatcc agtacaattt aaatatcaca acaatttgac 3780 

acccttcatt catttataaa aataaatgag ctagttcttt agtagttaaa catttcaaat 3840 

tggcttttct ccttctgtat ttccatacca cttttcagcc aagaatccta tcataatgta 3900 

• atctattatg cccgacatct ttttaatcaa ttcaccccat tacttcttgt caacaaaaaa 3960 

tataaatgga aatttttttt ttagctcttg ctttaagtgt ttgtttgtta tctcagtcca 4020 

gaaccaatat tatcgtaatt aattattggt atataatgaa aacggtatta attcttggat 4080 

gattaaaagt ttttttatta gaatgttaaa aaaaaaaaaa aaaaaaaaa 4129 

<210> 19 
<211> 1674 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1649) . . (1649) 
<223> n equals a,t,g, or c 

<220> 

<221> misc — feature 
<222> (1663) . . (1663) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (1665) . . (1665) 
<223> n equals a,t,g, or c 

<400> 19 

caagttggta cgcctgcagg taccggtccg gaattcccgg gtcgacccac gcgtccggtc 60 

gaagataggt tcgagacggt ggatgttgca gctgatcatg cagttgggtt cggtgctgct 120 
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cacacgctgc cccttttggg gctgcttcag ccagctcatg ctgtacgctg agagggctga 180 

ggcacgccgg aagcccgaca tcccagtgcc ttacctgtat ttcgacatgg gggcagccgt 240 

gctgtgcgct agtttcatgt cctttggcgt gaagcggcgc tggttcgcgc tgggggccgc 300 

actccaattg gccattagca cctacgccgc ctacatcggg ggctacgtcc actacgggga 360 

ctggctgaag gtccgtatgt actcgcgcac agttgccatc atcggcggct ttcttgtgtt 420 

ggccagcggt gctggggagc tgtaccgccg gaaacctcgc agccgctccc tgcagtccac 480 

cggccaggtg ttcctgggta tctacctcat ctgtgtggcc tactcactgc agcacagcaa 540 

ggaggaccgg ctggcgtatc tgaaccatct cccaggaggg gagctgatga tccagctgtt 600 

cttcgtgctg tatggcatcc tggccctggc ctttctgtca ggctactacg tgaccctcgc 660 

tgcccagatc ctggctgtac tgctgccccc tgtcatgctg ctcattgatg gcaatgfctgc 720 

ttactggcac aacacgcggc gtgttgagtt ctggaaccag atgaagctcc ttggagagag 780 

tgtgggcatc ttcggaactg ctgtcatcct ggccactgat ggctgagttt tatggcaaga 840 

ggctgagatg ggcacaggga gccactgagg gtcaccctgc cttcctcctt gctggcccag 900 

ctgctgttta tttatgcttt ttggtctgtt tgtttgatct tttgcttttt taaaattgtt 960 

ttttgcagtt aagaggcagc tcatttgtcc aaatttctgg gctcagcgct tgggagggca 1020 

ggagccctgg cactaatgct gtacaggttt ttttcctgtt aggagagctg aggccagctg 1080 

cccactgagt ctcctgtccc tgagaaggga gtatggcagg gctgggatgc ggctactgag 1140 

agtgggagag tgggagacag aggaaggaag atggagattg gaagtgagca aatgtgaaaa 1200 

attcctcttt gaacctggca gatgcagcta ggctctgcag tgctgtttgg agactgtgag 1260 

agggagtgtg tgtgttgaca catgtggatc aggcccagga agggcacagg ggctgagcac 1320 

tacagaagtc acatgggttc tcagggtatg ccaggggcag aaacagtacc ggctctctgt 1380 

cactcacctt gagagtagag cagaccctgt tctgctctgg gctgtgaagg ggtggagcag 1440 

gcagtggcca gctttgccct tcctgctgtc tctgtttcta gctccatggt tggcctggtg 1500 

ggggtggagt tccctcccaa acaccagacc acacagtcct ccaaaaataa acattttata 1560 

tagamaaaaa aaaaaaaaaa aagggcggcc gctctagagg atccctcgag gggcccaagc 1620 

ttacgcgtgc atgcgacgtc atagctctnt ccctatagaa gtngnaaagg gttc 1674 

<210> 20 
<211> 2005 
<212> DNA 

<213> Homo sapiens 
<400> 20 

ggttgctggc ccaggtgagc gggcgcgctg gtccaggtga gcgggcgcgt ccccgcgacg 60 

gcgctgcctg cccgaggcgg ttcacgtaaa gacagcgaga tcctgagggc cagccgggaa 120 

ggaggcgtgg atatggagct ggctgctgcc aagtccgggg cccgcgccgc tgcctagcgc 180 

gtcctgggga ctctgtgggg acgcgccccg cgccgcggct cggggacccg tagagcccgg 240 

cgctgcgcgc atggccctgc tctcgcgccc cgcgctcacc ctcctgctcc tcctcatggc 300 

cgctgttgtc aggtgccagg agqaggccca gaccaccgac tggagagcca ccctgaagac 360 

catccggaac ggcgttcata agatagacac gtacctgaac gccgccttgg acctcctggg 420 

aggcgaggac ggtctctgcc agtataaatg cagtgacgga tctaagcctt tcccacgtta 480 

tggttataaa ccctccccac cgaatggatg tggctctcca ctgtttggtg ktcatcttaa 540 

cattggtatc ccttccctga caaagtgttg caaccaacac gacaggtgct atgaracctg 600 

tggcaaaagc aagaatgact gtgatgaaga attccagtat tgcctctcca agatctgccg 660 

agatgtacag aaaacactag gactaactca gcatgttcag gcatgtgaaa caacagtgga 720 

gctcttgttt gacagtgtta tacatttagg ttgtaaacca tatctggaca gccaacgagc 780 

cgcatgcagg tgtcattatg aagaaaaaac tgatctttaa aggagatgcc gacagctagt 840 

gacagatgaa gatggaagaa cataaccttt gacaaataac taatgttttt acaacataaa 900 

actgtcttat ttttgtgaaa ggattatttt gagaccttaa aataatttat atcttgatgt 960 

taaaacctca aagcaaaaaa agtgagggag atagtgaggg gagggcacgc ttgtcttctc 1020 

aggtatcttc cccagcattg ctcccttact tagtatgcca aatgtcttga ccaatatcaa 1080 

aaacaagtgc ttgtttagcg gagaattttg aaaagaggaa tatataactc aattttcaca 1140 

accacattta ccaaaaaaag agatcaaata taaaattcat cataatgtct gttcaacatt 1200 

atcttatttg gaaaatgggg aaattatcac ttacaagtat ttgtttacta tgaaatttta 1260 

aatacacatt tatgcctaga aggaacggac tttttttttc tattttaatt acacataata 1320 

tgtaattaaa gtmcaacata atatgttgtt tctctgtagc ccgttgagca tatgagtaag 1380 

tcacatttct attaggacta cttmcaagga caaggtttcc atttttccag ttgtaaaatt 1440 

ggaaccatca gctgataacc tcgtagggag caaccccagg atagctaagt gttatgtaat 1500 

atgcctagaa ggtgatgtga atgcgattca gaagcatagc cactcccatt ttatgagcta 1560 

ctcacatgac aaatgtcatc ttttgctata acctttgcca agttagagaa aagatggatt 1620 

taatgagata aatgaaaaga tatttamcct aatatatcaa ggcactattt gctgttatgc 1680 
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tttgttattt atttcccagc acttgttcct tattgtagat tttttaaaga ctgtaacctt 1740 

ttactaactg tggtcttact aaaatttgtg cttgatactg cttttcaaaa agcctttaat 1800 

tagagccaaa aggatggaaa aggcaagata taaatgcctt ttatagatct cttatttaca 1860 

ttgaaaatta ttaccatatg tttagagcaa atccaagaaa acttcaacag cttctgaaga 1920 

tgtctatgaa tgttgaaaac ttttcaatst cttggratgc tcakttaatt cgcagaccgg 1980 

cttaacggat taaacgcccc ccccc 2005 

<210> 21 
<211> 812 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (17) . . (17) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (108) . . (108) 
<223> n equals a,t,g, or c 

<400> 21 

gaccattttt agccaanctt ggaattaacc ctcacttaag ggaacaaaag ctggagcttc 60 

caccgcgttg gcggccgctc tagaactagt ggatcccccg ggctgcanga attcggccac 120 

gagaggactt ccccacctca tgcagctatt tgggccgtgg cgtctgaaat ttattatttc 180 

agagtcaccc ctttratgac cttggcagtg ractgcagtc atctgtttag gcctttccat 240 

ggcccacgtc aatgccgtta tttctgtttg ttgcacattt gatttccttg ttgttggcat 300 

ttagaaggcc ccctgcttcc cagatcacac cacgggcatg gaccacagag attgcatctt 360 

gtgagtctgt agaaatggtc aaggccttgt cctctcttag gtccagagct caggtgaatg 420 

cagattttcc cggccatctg tgctgaagtc cctgtgggga ggctcctggc tggtttcctg 480 

taggtagaca gctacacgtc ctgcccttca ttggcttctt ttcatgaagc tcctgccatc 540 

tacaaaacat gtctcccttc ttgaatcaca tctctgttat tgaagctctg gaagtcaacc 600 

gggcgtggtg gctatgccta taatcccagc attttgggat gccggggcgg gtggatcacc 660 

tgaggtcagg agttcgggac cagcctggcc aacatggcga aaccccgtct ctaatacaag 720 

tgcaaaaatt ggccaggcgt ggtggtcact gtgctccagc ctgggtgaca gagcgagctc 780 

cgtctcaaaa aaaaaaaaaa aaaaaactcg ag 812 

<210> 22 
<211> 910 
<212> DNA 

<213> Homo sapiens 
<400> 22 

ggcagagctg gccttcgact cgctatgtcc actaacaata tgtcggaccc acggaggccg 60 

aacaaagtgc tgaggtgagg accccagcgt cgtgggcacg ggttcgggtt gtgggtgtgg 120 

atcggggccc tgggaagcgc ctgtctatcc cgggggcagg acctgagcgc ccctgaccct 180 

cgagcctgtc gcaggtacaa gcccccgccg agcgaatgta acccggcctt ggacgacccg 240 

acgccggact acatgaacct gctgggcatg atcttcagca tgtgcggcct catgcttaag 300 

ctgaagtggt gtgcttgggt cgctgtctac tgctccttca tcagctttgc caactctcgg 360 

agctcggagg acacgaagca aatgatgagt agcttcatgt gagacttgcc ctacagaaca 420 

agtgactctt gagtaagggg tggggggacc ccagcctggc catcctagac tgacacctct 480 

ctcctgtctt catgctgtcc atctctgccg tggtgatgtc ctatctgcag aatcctcagc 540 

ccatgacgcc cccatggtga taccagccta gaagggtcac attttggacc ctgtctatcc 600 

actaggcctg ggctttggct gctaaacctg ctgccttcag ctgccatcct ggacttccct 660 

gaatgaggcc gtctcggtgc ccccagctgg atagagggaa cctggccctt tcctagggaa 720 

caccctaggc ttacccctcc tgcctccctt cccctgcctg ctgctggggg agatgctgtc 780 

catgtttcta ggggtattca tttgctttct cgttgaaacc tgttgttaat aaagtttttc 840 

actctgaaaa aaaaaaaaaa aaaaaaaaac tygrgggggg gcccggaacc caattcsccg 900 

gatagtgagt 910 
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<210> 23 
<211> 821 . 
<212> DNA 

<213> Homo sapiens 
<400> 23 

gttttgagtg tgtgaattac atatatgaac atctgaraaa atcctataag cagtttaatc 60 

aactgttcca ctccactcca agtgagtcca taggcagaat tgagttatgg ggagagcggc 120 

ctagtaataa ttggtttgcg taatacaaag ttctactggg tagtgatgtt gtagaagttc 180 

atatagaatc agctgagctt tcagaaatgg tgaaagggtg gtaatagtca taacttagat 240 

tgtaattttt ttcccatagg cttttaaaaa atattcatga ggttcttttt ttatttcaat 300 

agtttttggg gaacaggtgg tttttggtta catgataagt tcttcagtgg tgatttctga 360 

gattttggtg cacctgtcat gtgagcagta tgaactctac tttatgtgta gtcttatccc 420 

tcatgtgtat gaactccacc ttatgtgtag tcttatccct cacccactcc tgcccttccc 480 

cacaagtccc caaagtccat tatatgatct ttatgccttt acatcttcac agtttagctc 540 

tcacacaact tattataatt tataagtaag ccagcattgg atatagttgt attccattat 600 

taatfctaaga aaccttatgc aagtaattat tagtcatcat cccaaaaaaa agggagaaca 660 

gggttagatt cagaatactt tgataagagc taaatactat catgagtgct gtcagtctgt 720 

agtaactttc cattggtatt ctatgtcttt taggcttaca gatacttttt acactcttac 780 

aaaatgtgca caagaagaag ctgcagctca gagctcgtgc c 821 

<210> 24 
<211> 981 
<212> DNA 

<213> Homo sapiens 
<400> 24 

acggaagcgc agagcacgga ccccgccccc tcgcggcccc gctcgtgacg tcgcgggggg 60 

cgccggcctc cgcccggccc cgagggcagg ctctccccgg aggctcagcc ccctctgctc 120 

cccatgggca actgccaggc agggcacaac ctgcacctgt gtctggccca ccacccacct 180 

ctggtctgtg ccactttgat cctgctgctc cttggcctct ctggcctggg ccttggcagc 240 

ttcctcctca cccacaggac tggcctgcgc agccctgaca tcccccagga ctgggtctct 300 

tttttgagat cttttggcca gctgaccctg tgtcccagga atgggacagt cacagggaag 360 

tggcgagggt ctcacgtcgt gggcttgctg accaccttga acttcggaga cggtccagac 420 

aggaacaaga cccggacatt ccaggccaca gtcctgggaa gtcagatggg attgaaagga 480 

tcttctgcag gacaactggt ccttatcaca gccagggtga ccacagaaag gactgcagga 540 

acctgcctat attttagtgc tgttccagga atcctaccct ccagccagcc acccatatcc 600 

tgctcagagg agggggctgg aaatgccacc ctgagcccta gaatgggtga ggaatgtgtt 660 

agtgtctgga gccatgaagg ccttgtgctg accaagctgc tcacctcgga ggagctggct 720 

ctgtgtggct ccaggctgct ggtcttgggc tccttcctgc ttctcttctg tggccttctc 780 

tgctgtgtca ctgctatgtg cttccacccg cgccgggagt cccactggtc tagaacccgg 840 

ctctgagggc actggcctag ttcccgactt gtttctcagg tgtgaatcaa cttcttgggc 900 

cttggctctg agttggaaaa ggttttagaa aaagtgaaga gctggaatgt gggggaaaat 960 

aaaaagcttt tttgcccaaa a 981 

<210> 25 
<211> 1038 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (2) . . (2) 

<223> n equals a,t,g, or c 

<400> 25 

gnaattcggc acgagttaat gtataaaata tttctataat gaattttaat gggaattaga 60 

gcatcataga aaaaatgctc ttactgttga aaacattatt tgttacattt tggtcaacta 120 

atctttcaat aacttttagt aactataatg ttaagttgta ccagtggcag tcttatatag 180 

taaatggcag ctgacagcat gaaaataaca tatctaatat tttgtgacta tcttattagg 240 

aaaatcagag aatttcaaaa ccttgttagt ttttagggta tagtcacatt ttataaatgt 300 
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gcggtatatt tatacatgat ttgacgtttg tgwaaatatt ttccctggac ttttatttta 360 

gatgagatct acagtgtagg caaacttata taatctgtca actccattag tgtcatagtc 420 

agactcatcc ccatgctaaa attatagttg tkaaaatacg cttttgtaaa tagttgtgtt 480 

aggtcattat caccaagtct tcaaggkatt acattataaa aaccttggkt tttattcttg 540 

tgaatamccg ttttttccat gcaaagttaa aattcttcag cctttaattt ttttattaat 600 

atataaggat gtgatgagta tgactacaaa acaggaaaaa ataaacagat ttcgtttgtg 660 

gcttttgcta aattgttacc tgacaaaatc ttagccagtt cttcattttc gttttgagat 720 

gaagatactt agttttagtc caggggctgg gcgcgatagc tgatgcctgt ggtcccagtg 780 

ctttgcgggg ccgaggcagg tggatcactt aaggtcagga gtttgagacc agcctgccca 840 

acatggtgaa acgttgtctc tactaaaaat acaaaaatta gacaggcgtg gtggcacaca 900 

tctgtaattc cagctactca ggaggctaac acaggaaaat tccttgaacc tgggaggcag 960 

aggttgcagt gagccattgc actccagcct gggcaacaca gtgagactct tgtctcaaaa 1020 

aaaaaaaaaa aaactcga 1038 

<210> 26 
<211> 843 
<212> DNA 

<213> Homo sapiens 
<400> 26 

ggcagctgtc caccgatccc ggccaccgcc cccggccacc cccaccccgc gagcccatgg 60 

aggctccggg accccgcgcc ttgcggactg cgctctgtgg cggctgttgc tgcctcctcc 120 

tatgtgccca gctggctgtg gctggtaaag gagctcgagg ctttgggagg ggagccctga 180 

tccgcctgaa tatctggccg gcggtccaag gggcctgcaa acagctggag gtctgtgagc 240 

actgcgtgga gggagacaga gcgcgcaatc tctccagctg catgtgggag cagtgccggc 300 

cagaggagcc aggacactgt gtggcccaat ctgaggtggt caaggaaggt tgctccatct 360 

acaaccgctc agaggcatgt ccagctgctc accaccaccc cacctatgaa ccgaagacag 420 

tcacaacagg gagcccccca gtccctgagg cccacagccc tggatttgac ggggccagct 480 

ttatcggagg tgtcgtgctg gtgttgagcc tacaggcggt ggctttcttt gtgctgcact 540 

tcctcaaggc caaggacagc acctaccaga cgctaatctg acccctttgg gcctggactc 600 

catcctgagg ggaaaggagg atgcagaggg tggcctctgg gcacccttgt gggtaagcgg 660 

ggggcggggg cgggaaaaac tctggccgcc agtttttggc tcctgcgggc accaagcagg 720 

ccaagtgttt aatgcctgac atctcctcct gtcctgggcc tggaacctgc agctgagaaa 780 

atccctcaac cacctcgtct cctccatcgc ccctgctggg ccccccagcc tgacagtggg 840 

ttg 843 

<210> 27 
<211> 601 
<212> DNA 

<213> Homo sapiens 
<400> 27 

gctgccagga attccggcac ggggaacagt gtaatattga agcaaatgct gtataacaac 60 

cacctggaag cccctcatgt atctcttttt gaaaacactc ctctctttct ccactctaat 120 

gatgaccacc gccttgtctt ttatggtaat cactgttctt tgggttttat tactgcattt 180 

attggctaat atatgcatcc ctagaaaatg tagttttgcc tgcttttata taaatggaat 240 

attactgcat gcagtctttt gatttgtgat tgttttgctc taaggcttgt aagggtcatc 300 

catgttttgc atatagtttg tttattgtca ttgccataga gtaaatcatt gtatgaatat 360 

actgcagttt atttactgtt gacatatgtt tcagttgttt ttaactacta ggaaatgcta 420 

ctctgtacat tcttgtatat gtaccttggt gcacatatgt atgtttttct agagtatata 480 

cagtggcatg ggattgctga attaaaaggt ttgtatatct tatactagaa gataataaaa 540 

acttttcctg atggattctg ccaattcaaa aaaaaaaaaa aaaaaaaaaa aaaaaactcg 600 

a 601 

<210> 28 

<211> 1276 

<212> DNA 

<213> Homo sapiens 

<400> 28 

gtgagtgtgt ggcactggtg gcctggagcc aaatttagct tgggtgagag ttgacaatgg 60 
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tagttttcct tcctcaagcc cctctgtgcc cctagagcac cctggctgtg gctgcctcct 120 

tcatccaaga gcagagtcca tgttgggcca ggagacttca gatccatgtc ctggtgctgc 180 

ctctggcttt gtctttcctc agtgggcagg actgggtctg ctggtccatc tttacccttc 240 

tctgagctat gcagccttgg cctgctgcgt ctccggcctg tattctctcc ccttcactca 300 

ggccctggga aaccagccca gtttctkgca ggagaggcag aggaggtcaa tgcctttgct 360 

ctgggcttcc tgagcaccag cagtggtgtc tctggagaag atgaagtaga gcccttacac 420 

gatggagttg aagaggcaga gaaaaagatg gaagaagaag gtgtgagtgt gagtgaaatg 480 

gaggcaacag gagcacaagg acccagcagg gtagaagagg ctgagggaca cacagaggtg 540 

acagaagcag agggatccca ggggactgct gaggctgacg ggccaggagc atcttcaggg 600 

gatgaggatg cctctggcag ggcagcaagt ccagagtcgg cctccagcac ccctgagtct 660 

ctccaggcca ggcgacatca tcagtttctt gagccagccc cagcgcctgg tgctgcagtc 720 

ttatcttcag agcctgcaga gcctctgttg gtcaggcatc cccctaggcc ccggaccacc 780 

ggccccaggc cccggcaaga tccccacaag gctggactga gccactatgt gaaactcttt 840 

agcttctatg ccaagatgcc catggagagg aaggctcttg agatggtgga gaagtgccta 900 

gataaatatt tccagcatct ttgtgatgat ctggaggtat ttgctgctca tgctggccgc 960 

aagactgtga agccagagga cc tggagctg ctgatgcggc ggcagggcct ggtcactgac 1020 

caagtctcac tgcacgtgct agtggagcgg cacctgcccc tggagtaccg gcagctgctc 1080 

atcccctgtg catacagtgg caactctgtc ttccctgccc agtagtggcc aggcttcaac 1140 

actttccctg tcccacctgg ggactcttgc ccccacatat ttctccaggt ctcctcccca 1200 

cccccccagc atcaataaag tgtcataaac agaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1260 

attggggggg ggcccc 1276 

<210> 29 

<211> 2084 

<212> DNA. 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (2075) . . (2075) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (2083) . . (2083) 
<223> n equals a,t,g, or c 

<400> 29 

ggcacgagga gttgtgcaga tacctggctg agagctggct caccttccag attcacctgc 60 

aggagctgct gcagtacaag aggcagaatc cagctcagtt ctgcgttcga gtctgctctg 120 

gctgtgctgt gttggctgtg ttgggacact atgttccagg gattatgatt tcctacattg 180 

tcttgttgag tatcctgctg tggcccctgg tggtttatca tgagctgatc cagaggatgt 240 

acactcgcct ggagcccctg ctcatgcagc tggactacag catgaaggca gaagccaatg 300 

ccctgcatca caaacacgac aagaggaagc gtcaggggaa gaatgcaccc ccaggaggtg 360 

atgagccact ggcagagaca gagagtgaaa gcgaggcaga gctggctggc ttctccccag 420 

tggtggatgt gaagaaaaca gcattggcct tggccattac agactcagag ctgtcagatg 480 

aggaggcttc tatcttggag agtggtggct tctccgtatc ccgggccaca actccgcagc 540 

tgactgatgt ctccgaggat ttggaccagc agagcctgcc aagtgaacca gaggagaccc 600 

taagccggga cctaggggag ggagaggagg gagagctggc ccctcccgaa gacctactag 660 

gccgtcctca agctctgtca aggcaagccc tggactcgga ggaagaggaa gaggatgtgg 720 

cagctaagga aaccttgttg cggctctcat cccccctcca ctttgtgaac acgcacttca 780 

atggggcagg gtccccccaa gatggagtga aatgctcccc tggaggacca gtggagacac 840 

tgagccccga gacagtgagt ggtggcctca ctgctctgcc cggcaccctg tcacctccac 900 

tttgccttgt tggaagtgac ccagccccct ccccttccat tctcccacct gttccccagg 960 

actcacccca gcccctgcct gcccctgagg aagaagaggc actcaccact gaggactttg 1020 

agttgctgga tcagggggag ctggagcagc tgaatgcaga gctgggcttg gagccagaga 1080 

caccgccaaa accccctgat gctccacccc tggggcccga catccattct ctggtacagt 1140 

cagaccaaga agctcaggcc gtggcagagc catgagccag ccgttgagga aggagctgca 1200 

ggcacagtag ggcttcttgg ctaggagtgt tgctgtttcc tcctttgcct accactctgg 1260 

ggtggggcag tgtgtgggga agctggctgt cggatggtag ctattccacc ctctgcctgc 1320 

ctgcctgcct gctgtcctgg gcatggtgca gtacctgtgc ctaggattgg ttttaaattt 1380 
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gtaaataatt ttccatttgg gttagtggat gtgaacaggg ctagggaagt ccttcccaca 1440 

gcctgcgctt gcctccctgc ctcatctcta ttctcattcc actatgcccc aagccctggt 1500 

ggtctggccc tttctttttc ctcctatcct cagggacctg tgctgctctg ccctcatgtc 1560 

ccacttggtt gtttagttga ggcactttat aatttttctc ttgtcttgtg ttcctfctctg 1620 

ctttatttcc ctgctgtgtc ctgtccttag cagctcaacc ccatcctttg ccagctcctc 1680 

ctatcccgtg ggcactggcc aagctttagg gaggctcctg gtctgggaag taaagagtaa 1740 

acctggggca gtgggtcagg ccagtagtta cactcttagg tcactgtagt ctgtgtaacc 1800 

ttcactgcat ccttgcccca ttcagcccgg cctttcatga tgcaggagag cagggatccc 1860 

gcagtacatg gcgccagcac tggagttggt gagcatgtgc tctctcttga gattaggagc 1920 

ttccttactg ctcctctggg tgatccaagt gtagtgggac cccctactag ggtyaggaag 1980 

tggacactaa catctgtgca ggtgttgact tgaaaaataa agtgttgatt ggctagaaaa 2040 

aaaaaaaaaa aaaaaaaaaa actcgagggg gggcnccggt acne 2084 

<210> 30 
<211> 1765 
<212> DNA 

<213> Homo sapiens 
<400> 30 

ggcacgagat ttctgggagt ectgeagagt etagttgeca agtggaacat tcttaaaaag 60 

ategttcaga agtttaccag aattaaaaga tgctgtcttg gaccagtatt caatgtgggg 120 

aaataaattt ggagtattgc tttttctgta ttctgtatta ctgacaaagg gcattgaaaa 180 

cataaaaaac gaaattgaag atgcaagtga acccttgata gatcctgtat atggacatgg 240 

cagecaaagt ttaattaatc tcctgctgac gggacatget gtttctaatg tatgggatgg 300 

tgatagagag tgctcaggaa tgaaacttct tggtatacat gaacaagcag cagtaggatt 360 

tttaacacta atggaagctt taagatactg taaggttggt tcttacttga aatctccaaa 420 

attccctatt tggattgttg gcagtgagac tcacctcacc gtattttttg ccaaggatat 480 

ggctttagtt gcccctgaag ctccttcaga acaagecaga agagtttttc aaacctacga 540 

cccagaagat aatggattca tacccgattc acttctggaa gatgtgatga aagcattgga 600 

ccttgtttca gatcctgaat atataaatct catgaagaat aaattagatc cagaaggatt 660 

aggaatcata ttattgggcc catttcttca agaatttttt cctgatcagg gctccagtgg 720 

tccagaatct tttactgtct accactacaa tggattgaag cagtcaaatt ataatgaaaa 780 

ggtcatgtac gtagaaggga ctgcagttgt gatgggtttt gaagatccca tgetacagae 840 

agatgacact cctattaaac getgtctgea aaccaaatgg ccatacattg agttactctg 900 

gaccacagat cgctctcctt cactaaatta atttgtctaa gtatttataa ggaagatctt 960 

aataacagat gttgaaagaa ggagtcaaga ctggcaattg gctggattaa gctaaacact 1020 

ggtatcactg attaactgta aataacaatt aaaaacacat tttcagtgtt tatgatatgt 1080 

ttaaattatt tgtcctaaag ctttatgtta aagattatcc tattttaccc cttcgtgtga 1140 

aatttactag caaaattaag ctttcatcaa agttcatcac ttttgeatte agatacttgg 1200 

tcatttactt accaaattac aaaegcaata ctacagcatt tgtatattaa gtatcacagt 1260 

tactattgat aaactacttt tgggttttat ttcattgagg cacttttttt attgtttgaa 1320 

tgattcegge ttgtaatata tcagcctcta caatgaaatg cagaagagtt catttttcta 1380 

agatctgttt ttcattagaa atattgacaa ataacacatt gtcaacctgg atcctttgac 1440 

aatttactta actctggcat gttcacaaaa agtagaaact ctaagagacc attaccattt 1500 

attcacagat gtatagggga tgtattctaa aaactgacag aaaagagaat ctgatagtca 1560 

acactgttaa cttttactgt gtaattgeca aatacacttt tccaaatttg tcccaacagc 1620 

cctgtaagcc agctttcttc tatatttata aacacgataa atgeatgaga agatctgtta 1680 

ttacattagt atattaegtt atttattatg atcctagttg atggcctaaa taaacacctt 1740 

tttctttaaa aaaaaaaaaa aaaaa 1765 

<210> 31 
<211> 2494 
<212> DNA. 

<213> Homo sapiens 
<400> 31 

ggcacgagga gatgtttaag gattacccgc cagecataaa accatcctac gatgtgctgc 60 

tgetgetget getgetagtg ctcctgctgc aggccggcct caacacgggc accgccatcc 120 

agtgcgtgcg cttcaaggtc agtgcaaggc tgcagggtgc atcctgggac acccagaacg 180 

gcccgcagga gcgcctggct ggggaggtgg ccaggagccc cctgaaggag ttcgacaagg 240 

agaaagectg gagagcegtc gtggtgcaaa tggcccagtg acccccagac gcggaaaccg 300 
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ggtggcagcg cccagcctgg ccccaagcat ggaaacgcac aacccctaat cgccctgagc 360 

tactgcttct aacacctctt ttcccttgtg tgagggcaaa ccaggctgca ggtggggttt 420 

tcacttccta gggtagttta attttaaaat aggccaatgt tggctagtct gtgcctcagt 480 

gagatcagtc agctccgagt ggctcccgtg tcgtaacagc aggagcatgg ccgcaacttc 540 

ccaggccgag gaagggcccc cggctcggcc tcttgagagc cccacccctg aactggcccc 600 

agctcctctt cctgcctctc tcatggcttg ggctggagtg ggctctctgg acctgaccag 660 

actgtgggtc cctgcgtctc ctgcccactc tgaccgggct tcctccctcc acgcttaggg 720 

tctgtcccgg gtactcagtc agcccagtgg gatcttaccc acttccctgc aaggtgcacc 780 

tgccccaggc tcaggctgcc cagcggctct tcctggacag tgagagcagg gctgggcgcc 840 

tctgtcctgg cccgggagcc gcaggggccc ctcctccaga gcctgggcgc aagcgacaca 900 

ggctgccgct gctctcccag gtgaaatcca caccagtcca cgccgggtcg cctgccctgt 960 

ctccctactt agacccagtc attctagagg gatccaccgc cacactggcc ggcccacgtc 1020 

ctgggtgctg tcatgcccag cttggagtgc cacgtggccg ctgcccacgt cccgggcact 1080 

gtcatgccca gcttggagtg ccacatggcc gctgcccacg tcccgggcac tgtcacgccc 1140 

agcttggagt gccacgtggc cgctgctgtg acaggcagtg ttcttggggg tggggctgca 1200 

tccaaggctt tgtaaaccgg ctggaccacg tctccctggc cccagtgacc gggggaagct 1260 

gagcccctcc ctcctgtgtt tgctcccatt actcaaaatg caggacagat caggtcagag 1320 

cccaggaatt ctcacaggtt cacccagcgc cctctacctc ctagcaagta ctttgtcttg 1380 

atcctcactg agaaggcccc agggcagcgg tcttctccat ctccgctgtt ttggggtctt 1440 

agggtacagc ccaggcggtc actgcccacc tgccaggctg cagggacagt tgggtgtgag 1500 

aataacactg gctttgggta gtgccatggc caggagtggg tttccctgcg tctcctcgtc 1560 

ccgagggcgc ctgggtcctc ccagctgacg gcagtaaatc cacagtgagt tggggcgact 1620 

gtgaaactgg aatgctgtta ctttgataat tactttccag caggtgtttt ccttcacaat 1680 

ggttttgttt ctttccttct gatctgagaa gacatgaacg ttttctcttc accgccgtgg 1740 

ggtgtattga ctggtccccc atgggctgct ggaaaggccc ggagatgcat ctgtggcctg 1800 

gggccatcaa gatcaaagaa ccaggaggcc tgggagatgc agctggatgg ggcggcctgc 1860 

agaccctgcc agggggtttg aggaccctcc caggtttccc actgcggaac aggagtgact 1920 

ctggctgcca agataccttc atggtgttca tgacaagtgg aatcattatt ttcaaccatt 1980 

gaagggggat gcaggcaaga caccttccca gctgctccta gaggggacaa gccaggccct 2040 

ctctgcagtc ctcggcagct ccggaaggac acagtcaggg gccgggcaaa cactttggcc 2100 

acagccccaa acaagcgcca ccgtgggaga ggagaggctg ctgtcactgg taccggatgc 2160 

agaccccacc ctgtctgcag gccaccccca cctccctgca gctttgaggc tggcggggtc 2220 

tgctcctggg aatggggtgg gagccacagg gacgacccgg ggcgggctga tgtcttcttg 2280 

ggggcagacc agagagctca agtttcagag tcagaattag gcacttggag cgtttttgct 2340 

ggcttgcact ttcttatttt cttattttag agcgcttaaa aaaatccgga aaaatggggt 2400 

ttaaaagaac tgtctctttc agtctacatt tttgtttaat acgcttgagc aataaacgct 2460 

tacttgcaaa aaaaaaaaaa aaaaaaaaaa aaaa 2494 



<210> 32 

<211> 885 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> (233).. (233) 
<223> n equals a,t,g, or c 

<400> 32 

aattcggcac gagagggctg catccttgcg ttctgtgagc tctgcccgtt gggagcatcc 60 

atgctgatgt gcaggggccg tgcagcactg cattcttctt gccttctctg ttctgtttag 120 

tacaaccacc ccagcaggtc tccagttcct gccaggttag tgtggatggc ccagcaccat 180 

ctcctctcca tcttgttggc tatcctctct tgttcctcac aaccccgcca ggntcgcggc 240 

tcaggagctc tgccgtgtga agtgtgctca gcagttctcc tcacatgtct acgcaaaatc 300 

tctggctccc tgtgtgtctg agcccaacag acacactgag cacaggagtt ggctctcagc 360 

tcctcccagc ttgccgtgac tgagccytgc cgtcctgtgg camcgccasg gagaccacag 420 

tgtccaactg tccaaccttt acgtaattgg catcccagga ggagaagcaa gagtgaatgg 480 

ggcaggaaaa gatcattaaa gaaatcgtgg ctgacataaa aaaggatgag ttcatgtcct 540 

ttgtagggac gcgtggatga agctggaaac catcattctg agcaaactat cgcaaggaca 600 

gaaaaccaaa caccatgtgt tctcactcat aggtgggaat tgaacaatga gatcacttgg 660 

acacagggtg gggaacatca cacaccgggg cctgtcgtgg ggtgaggggg atggggcagg 720 
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gatagcatta ggagatatac ctaatgtaaa tgacgagtta .atgggtgtca gcacaccaac 780 
atggcacatg tatacatatg taacaaacct gcatgttgtg cacatgtacc ccagaactta 840 
aagtataata aattaaaatt aaaaaaaaaa aaaaaaaact cgtag 885 

<210> 33 

<211> 790 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> (37) . . (37) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 

<222> (55) . . (55) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 

<222> (76) . . (76) 

<223> n equals a,t,g, or c 

<220> 

<221> mis cofeature 
<222> (112) . . (112) 
<223> n equals a,t,g, or c 

<220> 

<221> mis cofeature 
<222> (120).. (120) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 

<222> (137) . . (137) 

<223> n equals a,t,g, or c 

<400> 33 

tcaactgggt gaaaaggaaa acccaccctt ggcgccnaat acgcaaaccg ccttntcccc 60 

ggcgcgttgg ccgatncatt aatgcagctg gcacgacagt tttcccgact gnaaagcggn 120 

cagtgagcgc aacgcantta aatgtgagtt agctcactca ttagcacccc aggctttaca 180 

ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 240 

aaacagctat gaccatgatt acgccaagct ctaatacgac tcactatagg gaaagctggt 300 

acgcctgcag gtaccggtcc ggaattcccg ggtcgaccca cgcgtccggt tgaatgcact 360 

gagtcccttg gtgtagtagc aataaggaaa aatgaaatta ctttcctgtg cacacagtcc 420 

agcctaattg gtatgtgatg ttgcacttag cagccatgtg gtgggcatgt gtgactactc 480 

tggttttcac tttagtttct aaacttttta tccctctcaa gtccagcatg gatggggaaa 540 

tgtctctgga tccccacagc tgtgtacttg tttgcatttg tttccctttg agatttgtgt 600 

ttgtgtcctg ctttgagctg taccttgtcc agtccattgt gaaattatcc cagcagctgt 660 

aatgtacagt tccttctgaa gcaagcaaca tcagcagcag cagcagcagc agcacaattc 720 

tgtgttttat aaagacaaca gtggcttcta v/waaaaaaaa aaaaaaaaaa aaaaaaaaaa 780 

aaaaaaaaaa 790 

<210> 34 

<211> 1343 

<212> DNA 

<213> Homo sapiens 

<400> 34 
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ggcacgaggt caaggcaaaa atgggtcagg tttggagagt tcccccactc cttttgagtg 60 

ttcaggtttt ccttaccatg gctcatgctt tccatcaagc accagagttg cagtggcttg 120 

gcctctggtt ctgggtgagg ttatttgcag gtggagacgg ggggctgcac ctgaacattt 180 

ctagtgtcac cctccctctc cttcatggga aacagctctc cagggaagta ccttcctgcc 240 

aggggaagcc aaggctgggc cggccgccct acaaggagcc acaggattgc agccatgggt 300 

gccacctttc atggaagggg agatttatgg gctttcctgg aacccccagg ctgtcctggc 360 

caagaggaaa gaggtggtta cttcaggagt ttgaccttag ttagataact aaaagaatac 420 

atttcccctc ccttttcttt atttcctcaa taaaaatgta caaagtatca cccttctcca 480 

tgccccaatc tgtgttaaag tcacaatcta tgggtgtagt tctgggattc tgtcaaattc 540 

tccttcctgc tctccaaaat ggacaattgt cgtagggacc acatgccccc agaatacaat 600 

ggcctctgtg ttctactggg gtcaagcctg ctagaactca gcattcatga caggggctaa 660 

gtgtgcatga agtgacactg actacagcta gaaagccagg cgcacaaatg ccccttcccc 720 

ccagggccgc tctttccagc gcagtcatcc agaaaggccc acgtgcagag cccctgtgtc 780 

tcagatgctg cttcagttgc ccgtcctgtc ctcagaggcc actgtgctgg ccctctatca 840 

tttgacctga ctttagaacc tgacctcaag gatatggcag cgctagcctt tagctcccac 900 

agcacggatg ggggtgatgc cagttagaag tgggtagtga acgtttgctg agctgttcac 960 

tgtttctctc ttctctttgg aagcacctct ccgagccatg tgagccccct gatgccaccg 1020 

agcaggggca gcttcatgac cgatgtctgg ctgaggctgt ggcggacact ctcggggttg 1080 

tctgcaggag agcaagccag gaggacatgg gcctggacga cacggcctcg cagcaaagtg 1140 

tgtcagacga gcagtgacgg gcgtgcggcc gggcggggag gctggctccc ccacacctcc 1200 

cacctgcatt gctctccctc gtgctcccca aatcaccaca accaaccaat accgcaatcc 1260 

atgagggact cctcctgtgg aaaaggagag ctgttccaga acacagaact gatctcaggt 1320 

ttttgaaaaa aaaaaaaaaa aaa 1343 

<210> 35 

<211> 1089 

<212> BNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (353) . . (353) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (528) . . (528) 
<223> n equals a,t,g, or c 

<400> 35 

cggcacgaga aacgcggtgc ttgctcctcc cggagtggcc ttggcagggt gttggagccc 60 

tcggtctgcc ccgtccggtc tctggggcca aggctgggtt tccctcatgt atggcaagag 120 

ctctactcgt gcggtgcttc ttctccttgg catacagctc acagctcttt ggcctatagc 180 

agctgtggaa atttatacct cccgggtgct ggaggctgtt aatgggacag atgctcggtt 240 

aaaatgcact ttctccagct ttgcccctgt gggtgatgct ctaacagtga cctggaattt 300 

tcgtcctcta gacgggggac ctgagcagtt tgtattctac taccacatag atnccttcca 360 

acccatgagt gggcggttta aagaccgggt gtcttgggat gggaatcctg agcggtacga 420 

tgcctccatc cttctctgga aactgcagtt cgacgacaat gggacataca cctgccaggt 480 

gaagaaccca cctgatgttg atggggtgat aggggacatc cggctcancg tcgtgcacac 540 

tgtacgcttc tctgagatcc acttcctggc tctggccatt ggctctgcct gtgcactgat 600 

gatcataata gtaattgtag tggtcctctt ccagcattac cggaaaaagc gatgggccga 660 

aagagctcat aaagtggtgg agataaaatc aaaagaagag gaaaggctca accaagagaa 720 

aaaggtctct gtttatttag aagacacaga ctaacaattt tagatggtaa ggttcacaaa 780 

taggttgatt tctttcttca gctttctgac atgtccagcc catctctaat gaggactccc 840 

agatcatcac tttatggctg ttaggtgttt cccatatgaa attagaggag ctgggtcagg 900 

gagacaaaag tcttctatta gtcttatgga tagctcctcc ttgagtgtat tttgtgcaaa 960 

agattaagaa gctggactct actgccatta aagctgagag aatcctaagg ttaaaaaaaa 1020 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1080 

aaaaaaaaa 1089 



<210> 36 
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<211> 875 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (66) . . (66) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (872) . . (872) . 

<223> n equals a,t,g, or c 

<400> 36 

ggcacagcgc gaggctgggt cccggcccag gagaaggaag tcgctgaagg cagtggccat 60 

gctggncgtg gaaatgggag gcggttgcag rgggtctatg gggcccggtc ctggatactc 120 

ggcaggaagc cgtgtctgca gaggctcctc cctgcctcag gtggccccgt tcaaccccag 180 

ccgtgcccat ctcctgccac cgcctgtcgg tgggggttta aattcggtgt ggctttctgg 240 

ggtgcagctc agcacccccc cttatgcaga ctgggagggg gtcgggcagt cccctcagcc 300 

acgaggaccc tggatgggtt ctagttcact tgggaccgtg gggcctggct gcgtactgag 360 

tgggtgcccc acagtcaagg ccaacggggg ctccccctgc tctgagatgt tgggagaaag 420 

gcggcttctg gaaccttccg tgggacccgt aagtggctgt ccagaaaggc gggagggtgg 480 

gcacggggca cggggggcag ctggggtcgt cgttaagggt cacgcatccg tacagttgaa 540 

tttcctttct cttatcatgt tttacccacc ttgtcccttt tttccccaat tgtgcttttg 600 

catttttttc cttggcaaat gtaaactcag cctttcattc atgacgtgtg aaatttcagt 660 
ttctctggag tttgtcagac ggcgtgggaa ccacgcctga aactcaggta ataggaggaa . 720 

aaaaaaaaaa cttaaaaaaa tttttaaaaa acataaaact actctctacc tctgctggsc 780 

cagcctgtct cgccctggcc gcggcagggt ggcctgtaac aatttcagtt ttcgcagaac 840 

attcaggtat taaaaggaaa aaaaaaaaaa anggg 875 

<210> 37 
<211> 320 
<212> DNA 

<213> Homo sapiens 
<400> 37 

agggcgcacg gccaattgat gggcatgatc cttgtgctgg cgagcttcct ggcgcacccg 60 
gtcgaggcgc tcgcgcaagc tgtcgcgctg ggccagcagc aactcgcgct gctcggtgts 120 
carrgccatg ctgtcgaggg cttcctgcaa ttgcagrcgt gcttcgccgr cttgttcgtg 180 
ttcgarggcg cgttgctcgc ccatctcggc cacttcttcg tcgagccggg tgcggcgcag 240 
ggtcagttgc tcgaccttgg ccttgytcgc cgagagctgg gctttcaatt cgccctgctg 300 
gcgcgcttcg tcctgcaaca 320 

<210> 38 
<211> 710 
<212> DNA 

<213> Homo sapiens 
<400> 38 

ggcacgagct gggcctccag gttcttcacc tgtcacatga tcattttaca tattgtggtc 60 

tgtttattta ccatcagcat catagaagag caaaaagaag aaatactgtg ctccactaaa 120 

agccaggctg agaaaacagt tactcacatt gagcagtgag tgaccactag gtgggcattt 180 

gttcatagct gcatggagaa caagtgccca tatacatctt tctgctgatg cagcctctaa 240 

attttgaatg catcagtttt ttaaactgca ttgagcaata ttccgtgggt gtgatccata 300 

atagcgtaac tatttacgcc tgtgacagag aggaaaactg tatggatatc agatatcttt 360 

aagagctttt taatctttaa tcaagttagt acttcttaag gatgattaag gccaggcagt 420 

ggctcacacc tgtaatccca gcattttggg aggccaagat gggtggatcc cttaaggtca 480 

agagttcaag gccatcctgg ccaacatggt gaaaccccat ctctactaaa aatacaaaaa 540 

ttagctgggg tgtggtggca ggcgcctgta accccagcta ctcaagaggc tgagacaaga 600 

gaatcgcttg aagccaggag ttggagattg cagtgagcca agatcatgcc acttcactcc 660 
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agcctggaca gcagagtggg acttcttctt aaaaaaaaaa aaaaaaaaaa 710 

<210> 39 

<211> 1421 

<212> DNA 

<213> Homo sapiens 

<400> 39 

ggcagaggga gcggagagcg tgctaaccaa tgacttgagg gagtaggggg ccgggtttgg 60 

gccctcagtt gctaagggct acccgagtgg gaagcggttc aagagatggg gtgaagggtg 120 

gttcaccggt tcttcaagtc ctcagccttc tggcccgmgg aagttaagca accaagaggc 180 

gggcctaaga ccggaagcag gaaggagggc gcaggaagca gggcgccgca gcctgtcgta 240 

cggtccttct gtgggtctgt cggtgccgag ggcaggatgg agaagctgcg gctcctgggc 300 

ctccgctacc aggagtacgt gactcgtcac ccggccgcca cggcccagct ggagacagca 360 

gtgcggggct tcagttacct gctggcaggt cgattcgccg attcgcacga gctgtcagag 420 

ctggtgtact ctgcctctaa cctgcttgtg ctgctcaatg acgggatcct acggaaggag 480 

cttcggaaaa agttgcctgt gtcgctgtcc cagcagaagc tgctgacatg gctgagcgtg 540 

ctggagtgcg tggaggtgtt catggagatg ggagctgcca aggtgtgggg tgaagtgggc 600 

cgctggcttg tcatcgccct catccagctg gccaaggctg tactgcggat gctcctgctg 660 

ctctggttca aggctggcct ccagacttca ccccctatcg ttccactgga cagagagacc 720 

aggcacagcc cccggatggt gaccacagcc ywggyaacca tgagcagtcc tacgtgggga 780 

agcggtcaaa ccgggtggtg cgaaccctcc agaacacgcc gtccctgcac tccaggcact 840 

ggggagctcc ccagcagcgg gagggacggc agcagcagca tcacgaggag ctgagtgcga 900 

cccccacccc cctggggctt gcaggagacc atcgcagagt ttttgtacat tgcccggccg 960 

ctgctgcact tgctcagcct gggcctktgg ggtcarargt cgtggaaacc ctggctcttg 1020 

gctggtgttg tggacgtgac cagcctgagc ctcctgagtg acagaaaggg cctgacccgg 1080 

arggagcggc gggagctgcg gcgccggamc atcctgctgc tctactacct gctgcgctct 1140 

. cctttctacg accgcttctc cgaggccagg atcctcttcc tgctccagtt gctggccgac 1200 

cacgtccctg gcgttggcct ggtcacaagg ccgctcatgg attacttgcc cacctggcag 1260 

aaaatctact tctacagttg gggctgacag actcccggaa ggagggtgtg gggaggggtg 1320 

ggcagggagc ccctcttccc taataaaact gactccggca gcaaaaaaaa aaaaaaaaaa 1380 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaagggcggc c 1421 

<210> 40 

<211> 3447 

<212> DNA 

<213> Homo sapiens 

<400> 40 

accaattccc ttcctgggag ttgcggcttc cctcgctcgg ccccactccc gtttaccctt 60 

tccccagctc ccgccttagc caggggcttc cccgcctgcc gctagggctc gggccgaagc 120 

gccgctcagc gccagcctgc cgctccccgg gctccacttt cactttcggt cctgggggaa 180 

ctaagccgga ggcagtggtg gtggcggcgg cgcaagggtg agggcggccc cagaacccca 240 

ggtaggtaga gcaagaagat ggtgtttctg cccctcaaat ggtcccttgc aaccatgtca 300 

tttctacttt cctcactgtt ggctctctta actgtgtcca ctccttcatg gtgtcagagc 360 

actgaagcat ctccaaaacg tagtgatggg acaccatttc cttggaataa aatacgactt 420 

cctgagtacg tcatcccagt tcattatgat ctcttgatcc atgcaaacct taccacgctg 480 

accttctggg gaaccacgaa agtagaaatc acagccagtc agcccaccag caccatcatc 540 

ctgcatagtc accacctgca gatatctagg gccaccctca ggaagggagc tggagagagg 600 

ctatcggaag aacccctgca ggtcctggaa cacccccctc aggagcaaat tgcactgctg 660 

gctcccgagc ccctccttgt cgggctcccg tacacagttg tcattcacta tgctggcaat 720 

ctttcggaga ctttccacgg attttacaaa agcacctaca gaaccaagga aggggaactg 780 

aggatactag catcaacaca atttgaaccc actgcagcta gaatggcctt tccctgcttt 840 

gatgaacctg ccttcaaagc aagtttctca atcaaaatta gaagagagcc aaggcaccta 900 

gccatctcca atatgccatt ggtgaaatct gtgactgttg ctgaaggact catagaagac 960 

cattttgatg tcactgtgaa gatgagcacc tatctggtgg ccttcatcat ttcagatttt 1020 

gagtctgtca gcaagataac caagagtgga gtcaaggttt ctgtttatgc tgtgccagac 1080 

aagatgaatc aagcagatta tgcactggat gctgcggtga ctcttctaga attttatgag 1140 

gattatttca gcataccgta tcccctaccc aaacaagatc ttgctgctat tcccgacttt 1200 

cagtctggtg ctatggaaaa ctggggactg acaacatata gagaatctgc tctgttgttt 1260 

gatgcagaaa agtcttctgc atcaagtaag cttggcatca caatgactgt ggcccatgaa 1320 
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ctggcccacc agtggtttgg gaacctggtc actatggaat ggtggaatga tctttggcta 1380 

aatgaaggat ttgccaaatt tatggagttt gtgtctgtca gtgtgaccca tcctgaactg 1440 

aaagttggag attatttctt tggcaaatgt tttgacgcaa tggaggtaga tgctttaaat 1500 

tcctcacacc ctgtgtctac acctgtggaa aatcctgctc agatccggga gatgtttgat 1560 

gatgtttctt atgataaggg agcttgtatt ctgaatatgc taagggagta tcttagcgct 1620 

gacgcattta aaagtggtat tgtacagtat ctccagaagc atagctataa aaatacaaaa 1680 

aacgaggacc tgtgggatag tatggcaagt atttgcccta cagatggtgt aaaagggatg 1740 

gatggctttt gctctagaag tcaacattca tcttcatcct cacattggca tcaggaaggg 1800 

gtggatgtga aaaccatgat gaacacttgg acactgcaga ggggttttcc cctaataacc 1860 

atcacagtga gggggaggaa tgtacacatg aagcaagagc actacatgaa gggctctgac 1920 

ggcgccccgg acactgggta cctgtggcat gttccattga cattcatcac cagcaaatcc 1980 

gacatggtcc atcgattttt gctaaaaaca aaaacagatg tgctcatcct cccagaagag 2040 

gtggaatgga tcaaatttaa tgtgggcatg aatggctatt acattgtgca ttacgaggat 2100 

gatggatggg actctttgac tggcctttta aaaggaacac acacagcagt cagcagtaat 2160 

gatcgggcaa gtctcattaa caatgcattt cagctcgtca gcattgggaa gctgtccatt 2220 

gaaaaggcct tggatttatc cctgtacttg aaacatgaaa ctgaaattat gcccgtgttt 2280 

caaggtttga atgagctgat tcctatgtat aagttaatgg agaaaagaga tatgaatgaa 2340 

gtggaaactc aattcaaggc cttcctcatc aggctgctaa gggacctcat tgataagcag 2400 

acatggacag acgagggctc agtctcagag cgaatgctgc ggagtgaact actactcctc 2460 

gcctgtgtgc acaactatca gccgtgcgta cagagggcag aaggctattt cagaaagtgg 2520 

aaggaatcca atggaaactt gagcctgcct gtcgacgtga ccttggcagt gtttgctgtg 2580 

ggggcccaga gcacagaagg ctgggatttt ctttatagta aatatcagtt ttctttgtcc 2640 

agtactgaga aaagccaaat tgaatttgcc ctctgcagaa cccaaaataa ggaaaagctt 2700 

caatggctac tagatgaaag ctttaaggga gataaaataa aaactcagga gtttccacaa 2760 

attcttacac tcattggcag gaacccagta ggatacccac tggcctggca atttctgagg 2820 

aaaaactgga acaaacttgt acaaaagttt gaacttggct catcttccat agcccacatg 2880 

gtaatgggta caacaaatca attctccaca agaacacggc ttgaagaggt aaaaggattc 2940 

ttcagctctt tgaaagaaaa tggttctcag ctccgttgtg tccaacagac aattgaaacc 3000 

attgaagaaa acatcggttg gatggataag aattttgata aaatcagagt gtggctgcaa 3060 

agtgaaaagc ttgaacgtat gtaaaaattc ctcccttgcc aggttcctgt tatctctaat 3120 

caccaacatt ttgttgagtg tattttcaaa ctagagatgg ctgttttggc tccaactgga 3180 

gatacttttt tcccttcaac tcattttttg actatccctg tgaaaagaat agctgttagt 3240 

ttttcatgaa tgggctatcg ctaccatgtg ttttgttcat cacaggtgtt gccctgcaac 3300 

gtaaacccaa gtgttgggtt ccctgccaca gaagaataaa gtaccttatt cttctcattt 3360 

tatagtttat gcttaagcac ccgtgtccaa aaccctgtac cccatgttta fccattcataa 3420 

actgtttcat cagtctcaaa aaaaaaa 3447 

<210> 41 

<211> 3037 

<212> DNA 

<213> Homo sapiens 

<400> 41 

aattcggcag agcctaggag gagaaagttc catcatgtcg gagatcagag gaaaacccat 60 

tgagtccagc tgtatgtatg gcacctgctg cctctgggga aagacttatt ccatcggatt 120 

tctgaggttc tgcaaacagg ccaccctgca gttctgtgtg gtgaagccac tcatggcggt 180 

cagcactgtg gtcctccagg ccttcggcaa gtaccgggat ggggactttg acgtcaccag 240 

tggctacctc tacgtgacca tcatctacaa catctccgtc agcctggccc tctacgccct 300 

cttcctcttc tacttcgcca cccgggagct gctcagcccc tacagccccg tcctcaagtt 360 

cttcatggtc aagtccgtca tctttctttc cttctggcaa ggcatgctcc tggccatcct 420 

ggagaagtgt ggggccatcc ccaaaatcca ctcggcccgc gtgtcggtgg gcgagggcac 480 

cgtggctgcc ggctaccagg acttcatcat ctgtgtggag atgttctttg cagccctggc 540 

cctgcggcam gccttcamct acaaggtcta tgctgacaag aggctggacg cacaaggccg 600 

ctgtgccccc atgaagagca tctccagcag cctcaaggag accatgaacc cgcacgacat 660 

cgtgcaggac gccatccaca acttctcacc tgcctaccag cagtacacgc agcagtccac 720 

cctggagcct gggcccacct ggcgtggtgg cgcccacggc ctctcccgct cccacagcct 780 

cagtggcgcc cgcgacaacg agaagactct cc tgctcagc tctgatgatg aattctaggt 840 

gcgggctgca gtggcggaag tgctggcgcc atagccacgg tcaggctgtg ccccacctcc 900 

agcctcacca ccaggccagg aggcagctgg cacagtgctc acgccgcctt tatttattgg 960 

accagaaaca ctcacatgtc gcttccagag gaacggggga cagccaggct cgcccatggg 1020 

ccttcaggaa tatttataca tggcccagcc tgcactgccc gggcgagggc agaggacact 1080 



21 



t 



WO 027076488 PCT/US02/08276 

gggagcaagg cttatgcccc tgctgcccgt cctgtgctgg gggcatgctg ggaccagccg 1140 

cacccaggcc ccaatgcttg tgtgtggacc agcggctgca gccttctagc ccctcctccc 1200 

cgcgagactc tcaggctgag gtcggcaagc cgtggctccc ccacacaccg tgcaataccc 1260 

tgtctgacct gggctcttcc cgcctgcatc cctyccctgt ccacctttgt ccagtgctag 1320 

attcacctca ccccgggcag gagtggggat gtgggcgctc tgtggtcctc ccctcctgac 1380 

ccaggcctct gtggcatgct gcaaggatca gagccagaca ccaggagtca caggccccac 1440 

ccaggaaggg cattcagggc ccctgggcac cgcttctgtt gaagcagggg cttctgggcc 1500 

cctgggtatc cccacctgtc gtggccacac ctctgcctgc ctcatgcccc ttccccctgg 1560 

cctaccaagg acagcccaca gcccgcactg ccggctcact tgggtccttc ctcgatagct 1620 

ttgggcagag cccttgcttc ctggctgctt cagggctcag gggctcccag ccctccttcc 1680 

caggctgatg ctgggtcctc tctctctttg gggcttctcc ctcccgtttc aggggaaagg 1740 

tctgagtctc cacgtttcag accagcttct gggggaaggc agtccggcag ggagaccggg 1800 

aggggtggcc acacagtggg gagctgggag gtggggggaa tggtcccaga ctcctctcgg 1860 

ggcccctatc cacacagggc ctggtgttct accccatctg gcccctggcc catctcttct 1920 

gtgccttagt cacatatgaa agcgcccctc cctggctccc catctgtccc acacgctccc 1980 

tggggctctt agttcagctg ctggcactcg caggatcctg cagtgctggg cccagagccc 2040 

ttggacaggc ctcaggagtg gtcaggacca ccaagcccct cctctccccc tccacacctc 2100 

tagacctggg gcctccggaa cccccagcag gctgggctta tactagctcc tgacttagga 2160 

agagcctcgt gtcacaacac gtgtccctac aggcaaagtg tcctggcatt taaaacccag 2220 

attatccctg ggtttgggct gcagtcacct ggagaagctg gtagggtaag ggagagggac 2280 

cctgccggtg ttcactgggg attctttctt ttggtccttc ctggaatgaa caggttccct 2340 

ccctgccacc tgtgaggaga gttggggccc agccgtcttc ctggcctcct tcctttcctc 2400 

gtggcagagg cctgcatgtg ggtgccagag gccagctctc cccctccatc ttgggggggc 2460 

ggagcagttg ggcccaagct gcccgggagg gtgggtgcag acacaggctg aggaccagcc 2520 

ctggccctgc cccgccatct gctttcacca agctgtctct ccaccgtggc ttcccttctc 2580 

cctccaggcc aaagtgctgc tgattcccac tcccttggtt ttcgcctgcc cagcgttgct 2640 

gtttgcgtgg agggtggggg gagctcagtg gcagggaatc agcggtccgt ggggtcgtgg 2700 

ggacgggaac atgtgcccga ccgctccatc ccctcctcct ccttaggatg cataacctac 2760 

cttgtctttt tttttttaaa ttttctttcc aggtagagta gctctttgta cataaagaat 2820 

acttgaaaaa ttaattgtat gatgtatgag aagacagagt ctcctagttt tgtatcttgt 2880 

tgtatgactg ccatgagttc caccagaaag ccactctatt ttggtctctg tgacatttta 2940 

aatgcgtgac agaagtgagc aaataaagtg aggaagaaat ctaaaaaaaa aaaaaaaaaa 3000 

aaaaaaaaaa aaaaaaaaaa aaaaaaaggg cggccgc 3037 

<210> 42 

<211> 767 

<212> DNA 

<213> Homo sapiens 

<400> 42 

catgaaaaca cattctctta tagtttttaa attcatcatc caagagttcc tgctctttga 60 

tgatgagaca tacctggtag actccaaaac agagagcaga cgcctagtat ctttgttctg 120 

gggtgtgcat taagagtaca ttgacctgtc tgtctccagt cttgactctt ttggaagaga 180 

gatgctagta ctgatgacaa cctgcattct ggctgcggtg tgygtccaca ctgcacagtg 240 

tgcaccagac tctcgtatgg acaatgactg tccctcacat caggcgcaga tccattttag 300 

agcctcagaa gtcaggagag ggtggacttt caaccacgac tgaaaacact gtctttctta 360 

ggacatgctg tgtgtatgac acacttacag atgtctgtgc tcactgatgc ttgttgatgt 420 

gtcatcgcac atcagtgaca aacatttgtc atgtttttgc ctttggtgga acttctttat 480 

tatactcact ttcctcccaa accatttttc tcaacttcat catgaagcaa atgtcatgtg 540 

gtcattctgt gatggggctc agggctaggt taggtgatga tttctgaaag ctcagagacg 600 

tgaaggaaaa aggacatcag tgcttggatc ttagctctta taagcctcac gtgcaacaat 660 

aaacccgagt tcaagaatca gattcttaga tagattggtt tggtagcaaa tgacaaaaaa 720 

ccaacgtaaa tatgcttcgg caaaaaaaaa aaaaaaaaag ggcggcc 767 

<210> 43 
<211> 1057 
<212> DNA 

<213> Homo sapiens 
<400> 43 

tcgacccacg cgtccgctga gattacaggt gtgagccacc aggctcagcc ccctaagatt 60 
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tgaaacactt taaatggccc atggtagggt tcctgctagg ataaaacatt aagcggctgt 120 

taaaagaaat aaaaggagga cacgtctctg tgcactggtg tggacaaatc tccaagtcac 180 

tgcaaaatgg aaaaagtata agatgctctt tccctgaacc tcaagggtcc cgcccctctc 240 

actttcaggt ctctggacct ctgactgaca ctgtgcctgc ccaggtccct gtatgcactg 300 

ccacagtgcc ctgggcccca tgtccacccc tgtcctgccc ttctctggga tagggctggc 360 

cttcctctgc ctctgcctgg ctgcatccat ggtcgatctc aagtgccttg gcatgaactc 420 

cactctcctg cagccttcaa tcaaggaatg atggggatgt gtacataccc caccccaccc 480 

cttggcaggg tgatgctgag gtgtggattt ttaacagttc ccagactttc ccaggaggct 540 

tgggtttggg tgcccacagt gggagctggt gtgatatcat accttcgccg gccgcctttc 600 

cttcctgttc tctgtgcccc tactcccact ctagagctgc cccgtttctc tgttttcgtg 660 

aaagagctga ccctgtgctg cctcccactc tcccaatgcc cctgccactc ctgtgagcct 720 

gctgctggtg aggtcggtgc tgacctctgt gttgctggat aatgagtcat ctatctctgg 780 

aggagaagaa aggcaggtcc tccacagccc tgataaaatc tccaagtctc ccagtttcgg 840 

gtccctctcc tgggatgcag acccactgcc tgcccagctg gtacgatcca catgccctct 900 

tcttgggaat aggggcatgg gaaagtgact aaagatactg ttctggctgc tgtgttcact 960 

gtgagtaata aactgtccat ttctccgaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1020 

aaaaaaaaaa aaaaaaaaaa aaaaaaaggg cggccgc 1057 

<210> 44 
<211> 2687 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1614) . . (1614) 
<223> n equals a,t,g, or c 

<400> 44 

gtacaccatg ggcctccacc tccgccccta ccgtgtgggg ctgctcccgg atggcctcct 60 

gttcctcttg ctgctgctaa tgctgctcgc ggacccagcg ctcccggccg gacgtcaccc 120 

cccagtggtg ctggtccctg gtgatttggg taaccaactg gaagccaagc tggacaagcc 180 

gacagtggtg cactacctct gctccaagaa gaccgaaagc tacttcacaa tctggctgaa 240 

cctggaactg ctgctgcctg tgcatcattg actgctggat tgacaatatc aggctggttt 300 

acaacaaaac atccagggcc acccagtttc ctgatggtgt ggatgtacgt gtccctggct 360 

ttgggaagac cttctcactg gagttcctgg accccagcaa aagcagcgtg ggttcctatt 420 

tccacaccat ggtggagagc cttgtgggct ggggctacac acggggtgag gatgtccgag 480 

gggctcccta tgactggcgc cgagccccaa atgaaaacgg gccctacttc ctggccctcc 540 

gcgagatgat cgaggagatg taccagctgt atgggggccc cgtggtgctg gttgcccaca 600 

gtatgggcaa catgtacacg etc tact ttc tgeageggea gccgcaggcc tggaargaca 660 

agtatatccg ggccttcgtg tcactgggtg cgccctgggg gggcgtggcc aagaccctgc 720 

gcgtcctggc ttcaggagac aacaacegga tcccagtcat cgggcccctg aagatccggg 780 

ageagcageg gtcagctgtc tccaccagct ggctgctgcc ctacaactac acatggtcac 840 

ctgagaaggt gttcgtgcag acacccacaa tcaactacac actgegggae tacegcaagt 900 

tcttccagga categgcttt gaagatggct ggctcatgcg gcaggacaca gaagggctgg 960 

tggaagecac gatgccacct ggcgtgcagc tgcactgcct ctatggtact ggcgtcccca 1020 

caccagactc cttctactat gagagcttcc ctgaccgtga ccctaaaatc tgctttggtg 1080 

aeggegatgg tactgtgaac ttgaagagtg ccctgcagtg ccaggcctgg cagagccgcc 1140 

aggagcacca agtgttgctg caggagctgc caggcagega gcacatcgag atgctggcca 1200 

acgccaccac cctggcctat ctgaaacgtg tgctccttgg gccctgactc ctgtgccaca 1260 

ggactcctgt ggctcggccg tggacctget gttggcctct ggggctgtca tggcccacgc 1320 

gttttgcaaa gtttgtgact caccattcaa ggccccgagt cttggactgt gaagcatctg 1380 

ccatggggaa gtgctgtttg ttatcctttc tctgtggcag tgaagaagga agaaatgaga 1440 

gtctagactc aagggacact ggatggcaag aatgctgctg atggtggaac tgctgtracc 1500 

ttaggactgg ctccacaggg tggactggct gggccctggt cccagtccct gcctggggcc 1560 

atgtgtcccc cctattcctg tgggcttttc atacttgect actgggccct ggcncsgcag 1620 

ccttcctatg agggatgtta ctgggctgtg gtcctgtacc cagaggtccc agggategge 1680 

tcctggcccc tegggtgace cttcccacac accagccaca gataggectg ccactggtca 1740 

tgggtagcta gagctgetgg cttccctgtg gcttagctgg tggccagcct gactggcttc 1800 

ctgggcgagc ctagtagctc ctgcaggcag gggcagtttg ttgcgttctt cgtggttccc 1860 

aggccctggg acatctcact ccactcctac ctcccttacc accaggagca ttcaagctct 1920 
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ggattgggca gcagatgtgc ccccagtccc gcagctgtgt tccaggggcc ctgatttcct 1980 

cggatgtgct attggcccca ggactgaagc tgcctccctt caccctggga ctgtggttcc 2040 

aaggatgaga gcaggggttg gagccatggc cttctgggaa cctatggaga aagggaatcc 2100 

aaggaagcag ccaaggctgc tcgcagcttc cctgagctgc acctcttgct aaccccacca 2160 

tcacactgcc accctgccct agggtctcac tagtaccaag tgggtcagca cagggctgag 2220 

gatggggctc ctatccaccc tggccagcac ccagcttagt gctgggacta gcccagaaac 2280 

ttgaatggga ccctgagaga gccaggggtc ccctgaggcc cccctagggg ctttctgtct 2340 

gccccagggt gctccatgga tctccctgtg gcagcaggca tggagagtca gggctgcctt 2400 

catggcagta ggctctaagt gggtgactgg ccacaggccg agaaaagggt acagcctcta 2460 

ggtggggttc ccaaagacgc cttcasgctg gactgagctg ctctcccaca gggtttctgt 2520 

gcagctggat tttctctgtt gcatacatgc ctggcatctg tctccccttg ttcctgagtg 2580 

gccccacatg gggctctgag caggctgtat ctggattctg gcaataaaag tactctggat 2640 

gctgtaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaag ggcggcc 2687 

<210> 45 
<211> 728 
<212> DNA 

<213> Homo sapiens 
<400> 45 

aaatgattta gtgacctata caagtagcct gcagtaccgg atccgaattc ccggtcgacc 60 

cacgcgtccg gtgaaaacag cagagtgcta ctccatacca ctgggatctt gtccagtaaa 120 

catccagaga gtgaggttag gaaataaaaa gtatataaat attagatgcc tagaaatgca 180 

agtcacttta aagattttat gtgaaataga aaaaaaagag aggagaggga ctcattgtct 240 

tgtaatgggt ccttcccaga gagaggtgac tgtccagtgg caccgggccc ttttcctcct 300 

tcccctttta ctcttatcaa ctaggacaga aactaagaat tttggcttca agtggctaaa 360 

agactgatgg gggaaaaaag aaaatagaaa aaaataacag agagactgac gctctaggca 420 

gttacaagtc caagaaaaaa gacagaaact tttaagtatt gagccaaaac caggtctagc 480 

aamcataatg ctggccctag attatttatt aatttatgaa gaaacttcta gatatggggg 540 

tgacaaaagg aaattaaatc cattatatat gcatatattt taatgtaaat atataataga 600 

taaattatgt atacataata tataaccaaa ttgaaacagt tttacaattt ggtttgactg 660 

gaaattcaaa atccatatat taatttttgt agtaaaagtt tatgtaaaaa aaaaaaaaaa 720 

gggcggcc 728 

<210> 46 
<211> 1635 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> (85) . . (85) 

<223> n equals a,t,g, or c 

<400> 46 

tcgacccacg cgtccgcagc tctccgtgga caatgggctg tggcgtgtga ccctgtgcat 60 

gctggccttc ccgctgctcc tcacnggcct catctccttc agggagaaga ggctgcagga 120 

tgtgggcacc cccgcggccc gcgcccgtgc cttcttcacc gcacccgtgg tggtcttcca 180 

cctgaacatc ctctcctact tcgccttcct ctgcctgttc gcctacgtgc tcatggtgga 240 

cttccagcct gtgccctcct ggtgcgagtg tgccatctac ctctggctct tctccttggt 300 

gtgcgaggag atgcggcagc tcttctatga ccctgacgag tgcgggctga tgaagaaggc 360 

agccttgtac ttcagtgact tctggaataa gctggacgtc ggcgcaatct tgctcttcgt 420 

ggcagggctg acctgcaggc tcatcccggc gacgctgtac cccgggcgcg tcatcctctc 480 

tctggacttc atcctgttct gcctccggct catgcacatt tttaccatca gtaagacgct 540 

ggggcccaag atcatcattg tgaagcggat gatgaaggac gtcttcttct tcctcttcct 600 

gctggctgtg tgggtggtgt ccttcggggt ggccaagcag gccatcctca tccacaacga 660 

gcgccgggtg gactggctgt tccgargggc cgtctaccac tcctacctca ccatcttcgg 720 

gcagatcccg ggctacatcg acggtgtgaa cttcaacccg gagcactgca gccccaatgg 780 

caccgacccc tacaagccta agtgccccga gagcgacgcg acgcagcaga ggccggcctt 840 

ccctgagtgg ctgacggtcc tcctactctg cctctacctg ctcttcacca acatcctgct 900 

gctcaacctc ctcatcgcca tgttcaacta caccttccag caggtgcagg agcacacgga 960 
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ccagatttgg aagttccagc gccatgacct gatcgaggar taccacggcc gccccgccgc 1020 

gccgcccccc ttcatcctcc tcagccacct gcagctcttc atcaagaggg tggtcctgaa 1080 

gactccggcc aagaggcaca agcagctcaa gaacaagctg gagaagaacg aggaggcggc 1140 

cctgctatcc tgggagatct acctgaagga gaactacctc cagaaccgac agttccagca 1200 

aaagcagcgg cccgagcaga agatcgagga catcagcaat aaggttgacg ccatggtgga 1260 

cctgctggac ctggacccac tgaagaggtc gggctccatg gagcagaggt tggcctccct 1320 

ggaggagcag gtggcccaga cagcccgagc cctgcactgg atcgtgagga cgctgcgggc 1380 

cagcggcttc agctcggagg cggacgtccc cactctggcc tcccagaagg ccgcggagga 1440 

gccggatgct gagccgggag gcaggaagaa gacggaggag ccgggcgaca gctaccacgt 1500 

gaatgcccgg cacctcctct accccaactg ccctgtcacg cgcttccccg tgcccaacga 1560 

gaaggtgccc tgggagacgg agttcctgat ctatgaccca cccttttaca cggcagagag 1620 

gaaggacgcg gccgc 1635 

<210> 47 

<211> 4893 

<212> DNA 

<213> Homo sapiens 

<400> 47 

ccacgcgtcc gtgagaagat aatcctgaga ggctgcatcc tgagaaatac cagctggtgt 60 

tttggaatgg ttatttttgc aggtcctgac actaaactaa tgcagaatag tggtaagaca 120 

aagtttaaaa ggacaagcat tgatagattg atgaatactc tagtactatg gatttttggg 180 

tttctgatat gcttgggaat tattcttgca ataggaaatt caatctggga gagtcaaact 240 

ggggaccaat tcagaacttt cctcttttgg aatgaaggag agaagagctc tgtgttctcc 300 

ggattcttaa cattctggtc atatattatt attctcaata cagttgtacc catttcctta 360 

tatgtgagtg tggaagtaat tcgtctagga cacagttatt ttataaactg ggaccggaag 420 

atgtattatt ctcgaaaagc aatacctgca gtggctcgaa cgaccacgct caatgaggaa 480 

ctggggcaga ttgagtacat tttctccgac aaaacgggta ccctcactca aaacatcatg 540 

acctttaaaa gatgttccat taatgggaga atctatggtg aagtacatga tgacctggat 600 

cagaagacag aaataactca ggaaaaagag cctgtggatt tctcagtcaa atctcaagcg 660 

gatagagaat ttcagttctt tgaccacaat ctgatggaat ccattaaaat gggtgatccc 720 

aaagttcatg aattccttag gttacttgct ctctgccaca ctgtaatgtc agaagagaat 780 

agcgcaggag agctgattta ccaagttcag tcacctgatg aaggggctct agtgactgcc 840 

gctagaaatt ttgggttcat ttttaaatcc cggaccccag agaccataac aatagaagaa 900 

ttgggaacac tagttactta tcaattactt gcctttttgg atttcaacaa caccagaaaa 960 

aggatgtctg tcatagttcg aaacccagaa ggacagataa agctttattc caaaggagca 1020 

gatactattc tgtttgagaa acttcatcct tccaatgaag tccttttgtc tttgacgtca 1080 

gaccacctca gtgaatttgc aggggaaggc cttcggacct tggccatcgc atacagagac 1140 

ctggatgaca agtactttaa agagtggcat aagatgcttg aagatgcgaa tgttgccaca 1200 

gaagagaggg atgaacgaat agctgggcta tatgaagaaa ttgaaagaga tttgatgcta 1260 

ctaggtgcca ctgctgtaga agataagtta caggagggtg ttattgaaac agttacaagt 1320 

ttatcactag ccaatattaa gatctgggtc ctaacaggag acaaacaaga aactgccatc 1380 

aacatcggtt atgcctgcaa catgctgact gacgacatga atgatgtgtt tgtgatagca 1440 

gggaataatg ctgtggaagt gagagaagaa ctcaggaaag caaaacaaaa tttgtttgga 1500 

caaaacagaa atttttccaa tggccatgta gtttgtgaaa aaaagcagca gctggagttg 1560 

gattctattg tagaagaaac cataacagga gattatgcct taatcataaa tggccacagt 1620 

ttggctcatg ccctagaaag tgatgtcaag aatgatctcc tagaacttgc ttgcatgtgt 1680 

aagactgtaa tttgctgcag ggtcactcca ctccagaaag cccaagtggt agagctggtg 1740 

aagaagtaca gaaatgctgt tactttggcc attggtgatg gagccaatga tgtcagcatg 1800 

attaaaagtg ctcacattgg . tgttggcatc agcggccagg aaggattgca agcagtctta 1860 

gccagcgact attcatttgc acagtttaga tatctccaaa ggcttctcct tgttcatgga 1920 

aggtggtctt atttccgaat gtgcaaattc ttatgctatt tcttctataa gaattttgca 1980 

tttacacttg tgcatttctg gtttggtttc ttctgtggtt tctcagccca gactgtttat 2040 

gaccagtggt tcatcaccct ttttaacatt gtttacacat cactgcctgt tttagccatg 2100 

gggatttttg accaggatgt gagtgaccag aacagcgtgg actgtcccca gctctacaaa 2160 

ccaggacagc tgaatctgct ttttaacaag cgtaaatttt tcatttgcgt gatgcatgga 2220 

atctacacct cattagtcct tttcttcatc ccctatgggg ccttttacaa cgtggctgga 2280 

gaagatgggc aacatattgc tgactaccag tcctttgcag ttaccatggc cacatctttg 2340 

gtcattgtgg tcagtgtgca gatagccttg gataccagtt actggacttt cattaatcac 2400 

gtcttcatct gggggagcat tgccatttat ttctccattt tatttacaat gcacagtaat 2460 

ggcatctttg gcatcttccc aaaccagttt ccatttgttg gtaatgcacg acattccctg 2520 
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acccagaagt gcatctggct tgtaattctc ttaacaacag tggcttcagt tatgccagtg 2580 

gtggcattca gatttttgaa ggtggattta tacccaaccc tgagtgatca gatccgccgg 2640 

tggcagaagg ctcaaaagaa ggcaaggcct ccaagtagcc gaaggcctcg gacccgcagg 2700 

tcaagctcaa gaaggtctgg atatgctttt gctcaccaag aaggctatgg agagcttatc 2760 

acatctggaa aaaatatgcg agctaaaaat ccacccccaa catcagggct ggaaaagaca 2820 

cattataata gcactagctg gattgaaaat ttatgtaaga aaaccacaga caccgtgagc 2880 

agctttagcc aggataaaac agtgaaactg tgagtcaata tgaatttaaa ccacgtagtt 2940 

atcttttcac ttcaggtgga gctgaaattc tgctggctcc agagtttgag atttgaggca 3000 

agaggtgggg caggcagatt gcctcactta acttaaatct gcggcagaca actgccagtg 3060 

cccatcaaac aggagtgtgc gctatggaaa accaggccag agggtcactg tctggtttgt 3120 

gatttggtgg acaaaacact cgctgttaca agtacagatt tttttttttt ttaaatcaac 3180 

ctagatacca attgacctga actttagaat cttatttatg gagaaaaact tgtaaagctg 3240 

catattcact gaatggatcc tcaggcggat aaaagggtgc attttaaagg tatatatcca 3300 

agctgaaaag catgcctatt gacagataaa catgtatctg taagatcagc ctttcccaag 3360 

gtatactttt aaaatttaaa gcgtgtactg tgttgctttc agactgagtt gcatgtcact 3420 

ctttagtctt gatatctacc tgtctgttca gccaggacaa caaatggctt ccaagcctga 3480 

agaatacaaa agtgtgcttg tgtttctcat ttttatacca gtctagggac aaaggagact 3540 

gaacatcttt gcagcaggat aggctggtaa tttgatcaaa tttattcaaa aagctctcag 3600 

tctgtgtcat gtaaggacat gcttatgaaa tgtgagagag gctcgccact aagtattcta 3 660 

aatacttttc aatggctttt ctaacaacct cagtagtaat ttgctgagca tcatccagac 3720 

cattaataga atcagcaaag cactggaatt tcacacttta atgataatat tccacatagt 3780 

ctatgggcaa atattttcaa catttccaat ttttaaagct tcagaattga agccaaacaa 3840 

attaataaat aattgtttta attactattt aaaaactcag gtttagattg tttaaaatta 3900 

gttgcttttg atactcagct gtcatgttta taattcaaac atgtagtaaa catatgtagg 3960 

taaggttgtt tttttggaga tgttgcagct caaatttcag tccacatatg aatcatcagt 4020 

gtattttcca taaagtgatt cgggcatatt tgtgtgaaaa cctcagttct gtcacttctt 4080 

acctctataa acttggacga taatgtgcct tctctgagac tcagtttctt cctctgtaaa 4140 

atgaggacat actacctacc tcacgtggtt ggttgatgat tgtctgtcaa agcacaaact 4200 

ctgaaattat taaaaacata attatttcat aaacagatga gttaagttcc agttaactca 4260 

acatcagtat aacagagcaa ttggaagaga atatgaaaaa actggaatct aaatagtcag 4320 

tgaggaaggc tttgataaaa tgaaattgcc agaaagatat aaaactggtt agggtcctac 4380 

agggaaataa aattataacc gtggaggtac atttctctac cagaaagcaa aaataaagca 4440 

tcatgtctta atggttttct acaaatcaac ttctaattct acagagtcct taatctggtc 4500 

cctattaaat tcttggtcag acaaagttac atttcccaag agagtcaggt gacacttgag 4560 

tgagtttgat ggataatgag ctaatgtgat atctataggt cacaattttt taaaaccaaa 4620 

attttcaagt ctgggataat ctttcctaaa tgggatcaaa tgaaataata tgtgtaaaag 4680 

agtcaaatgc agtcctttac catagtaact gcctatggac gttgtctttc ccttacatgc 4740 

ctgcctacac ttaaccagat gttggttttc aatgtctaat ttgtcattag tttcaccaca 4800 

tttgctcact ttttgtaaca tttttgcaag atttgaaaac tttcagtaaa tgttttggca 4860 

ctattggtaa aaaaaaaaaa aaaaaaaaaa aaa 4893 

<210> 48 
<211> 1655 
<212> DNA 

<213> Homo sapiens 
<400> 48 

ccacgcgtcc gggcaaagaa ttaaacctgg tgtttggact tcaacttagc atggctagaa 60 

ttggaagtac agtaaacatg aacctcatgg gatggctgta ttctaagatt gaagctttgt 120 

taggttctgc tggtcacaca accctcggga tcacacttat gattgggggt ataacgtgta 180 

ttctttcact aatctgtgcc ttggctcttg cctacttgga tcagagagca gagagaatcc 240 

ttcataaaga acaaggaaaa acaggtgaag ttattaaatt aactgatgta aaggacttct 300 

ccttacccct gtggcttata tttatcatct gtgtctgcta ttatgttgct gtgttccctt 360 

ttattggact tgggaaagtt ttctttacag agaaatttgg attttcttcc caggcagcaa 420 

gtgcaattaa cagtgttgta tatgtcatat cagctcccat gtccccggtg tttgggctcc 480 

tggtggataa aacagggaag aacatcatct gggttctttg cgcatagcag ccactcttgt 540 

gtcccacatg atgctggcct ttacgatgtg gaacccttgg attgctatgt gtcttctggg 600 

actctcctac tcattgcttg cctgtgcatt gtggccaatg gtggcatttg tagttcctga 660 

acatcagctg ggaactgcat atggcttcat gcagtccatt cagaatcttg ggttggccat 720 

catttccatc attgctggta tgatactgga ttctcggggg tatttgtttt tggaagtgtt 780 

cttcattgcc tgtgtttctt tgtcactttt atctgtggtc ttactctatt ggtgaatcgt 840 
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gcccagggtg ggaacctaaa ttattctgca agacaaagga agaaataaaa tttcccatac 900 

tgaatgagaa gttaaaatga atgtgtcaga gaatgggctt aacacatcgt tggtttgaaa 960 

acttccattt taaaaattta gagtttagtc attagaaaaa ataatggact ggaaagttat 1020 

atttatatcc aaatatacct atttcaaagt gtatttgtga ggcctgtttt agcctgtgtc 1080 

ttttgtattg tgtgttgcta aagaattcta cttttagtag gctaatcaac aatgaaaggg 1140 

ttagaaaatt gctgtggaac atccaggtga acttcaggaa agacagtgaa aaatggaaaa 1200 

cgttggagct tctgttgaga taatcttcat taggtatata tcttagggat acagcctttt 1260 

ctttatctta tagcaggaaa aaaaaacttt tgagggaaat agaagggctg cgttacacaa 1320 

aataaacaat ggcattgtca taggccttcc ttttactagt agggcataat gctagggaat 1380 

atgtgaagat gtttttttga agtctctttc tgatcacgaa caatagcttg cgctctactc 1440 

tgtagttatg tggattgccg agcaatgacc cttttcaatt tcttatttct gtgttactga 1500 

ggaccctaat cacttaggga tgtaatttta tagtataaac tttctgtaca gtttttctta 1560 

tagtctaata agtaaaaagt gtccttcaaa ttatgataat tgcctatgta catggataaa 1620 

ttaaaacact gcacacggaa aaaaaaaaaa aaaaa 1655 

<210> 49 
<211> 6297 
<212> DNA 

<213> Homo sapiens 
<400> 49 

ccacgcgtcc ggtcagcttt catctcgtcc tatctttgtt caggcaaact tctctagttc 60 

tgttttaata ggcatatttg ttaggtctgt tttttgaaat cctctttttt acattgttta 120 

aagataatgc cttggctaaa aagcctgctt cacttttccc tgtttttagt tgttttctcc 180 

acattggcag taaagagcct tggcgtccca gtagcagcag gttctccttt ttgtattgtg 240 

gatgttttgc atttcatact gttgtgaaga gtggctttga tcatacatgt tgttggtata 300 

tttgcctttt tgctgggggt gtgagaagaa ccagagatga gcagaggtac acccagtaga 360 

cttcccagcc tgcagagcct cccgggaaga gcttccgtgt tcaggtgctt ggggccccac 420 

cctaggagcc tgactcacag tcagagcagg gtcccggctt gtgttcagga ttttgaaaca 480 

tttgtaaggt gattttgttg tttctacacc tttctcctca tctttttttt tttgtagtta 540 

atcgttacta ataacagaaa agacattttt ggcatggtaa ttggcacaaa gtgaataatt 600 

gttgaataga tgacttttga ggctttcaaa attcgagtgt ccataaaatc catccagagc 660 

cacctggttc ctttttttga accacttaac gtaattctgg aaaaccttga ctgtgggtct 720 

taagtttggt ggattgctgc ttctcactgg ctgacctttg gaggtcgcat atttcaggat 780 

gtgattccac ttaggctcca tttcacctga cactgcaatt ctgtgccttc agagggattt 840 

gttattgcga atgatgtgga caacaagcgc tgctacctgc tcgtccatca agccaagagg 900 

ctgagcagcc cctgcatcat ggtggtcaac catgatgcct ccagcatacc caggctccag 960 

atagatgtgg acggcaggaa agagatcctc ttctatgatc gaattttatg tgatgtccct 1020 

tgcagtggag acggcactat gagaaaaaac attgatgttt ggaaaaagtg gaccacctta 1080 

aatagcttgc agctacatgg cttacagctg cggattgcaa cacgcggggc tgaacagctg 1140 

gctgaaggtg gaaggatggt gtattccacg tgttcactaa accctattga ggatgaagca 1200 

gtcatagcat ctttactgga aaaaagtgaa ggtgctttgg agcttgctga tgtgtctaat 1260 

gaactgccag ggctgaagtg gatgcctgga atcacacagt ggaaggtaac ctttcctcga 1320 

gaactttcat tctaaagagt aggtgcagca tcactgaagt agagtcaagt ttcaaagcat 1380 

tcacgtgtga gtaacttgaa taaatactac atctggttat gccaattaga atcaattfccg 1440 

gagtgttatt tcatgacaca tttcatgaca agtggcatgt ttattcctgg - cagtggaaaa 1500 

gttttttttt ctccacgtac agaaataaac tcttttactc tcatccctgt aagggtagct 1560 

ttgctttttt tttttttttt taaattgggc cgggattcaa gccttgtttc caatatgaag 1620 

taattcatta caattttagg ccagaaacag cctgaggctt gtttaaaaag aaaaaaacta 1680 

gatggaaaat gttattttat aatgcttgtc ctggttttta gaataaatgt atttcatctt 1740 

tgtttttaac agaatttatg tattaataat ttggggattt tctgtaaaga ttgttttgtt 1800 

ttgtcttggc aaataatctt cctatctttg gagtgaatga gaatcaccat ttgtcacctt 1860 

tgagagaatg gatactcctg ccctgtgatt tttgttggta ttggatagtg ctagtaatct 1920 

ggaatgtacc ctgtggttct gcaggtaatg acgaaagatg ggcagtggtt tacagactgg 1980 

gacgctgttc ctcacagcag acacacccag atccgaccta ccatgttccc tccgaaggac 2040 

ccagaaaagc tgcaggccat gcacctggag cgatgcctta ggatattacc ccatcatcag 2100 

aatactggag ggttttttgt ggcagtattg gtgaaaaaat cttcaatgcc gtggaataaa 2160 

cgtcagccaa aggtgagttt ttctttttcc aaaatgacat aacatttgat cttgtacatt 2220 

taagacaaaa actaaccgga gtttagtaga agtacagagg aaagaggagc ttcttgctgt 2280 

gggcagcagg agagctgacc ctgaatgagg gggaatttca tttaaatatc aagttttcca 2340 

aaaagcagaa atttctcata ggtgataggt aagtggagaa gtcagtgttt gggggaatgt 2400 
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attcctgcca cactgtaaat ccagttattt aataaaaatt gaaaagacat gaaagattgc 2460 

tctgtggctc tcaattggaa gccccaggtt tctgtgctct agttccttgt gagtggttca 2520 

tttcaccaat tacagatagc agagctctgc tgaccccaag ccagcccggg ttcaccttgg 2580 

ctgcaaggaa tgatgacggc cttgtccaga cctggctaga aagatgcagc ccggcctgtt 2640 

tgctatggat ctaaactgcc tgctggttcc tttccaaggc aggccaggaa acagtggtga 2700 

aggagtgttg ccctcatcct aacacgcagt cctttgtaat gcgtgctgtc tcacctgtat 2760 

cacgccagca ttatttatta gttcataaat cagccttcca tgatgaaaga acctggcctg 2820 

gaatcaaagt ctggaagtct gtatttcttt aagatccatg cttgaaaatt aggacaaaaa 2880 

acgcttagct ttggaggaac aaaaaggaaa cagttccgca aagagctcca gcctttttct 2940 

ggggcacggt ttgtgcagtt taacgttgga acgtacagcc tcagacgggc aaagggggcg 3000 

actgcacttc tgccgccacc agggtttttc tgtcaggtta gaaagtattt cactttgagg 3060 

ctaaaagtct cacaaggtat cttaacgctg atggaatgtt attttcatgg aatcagtata 3120 

agaaattata ttgtaaagta ttagatactt tgcattcatt catactaggt ttcagtagct 3180 

tgtgttttag actttgcgct tgtcacattt taagtggtca gtgaccacag gcttgtggct 3240 

gcccagctgc agagcacagt gcagtcacag aggagcctgt cttagagacg cgtgctttag 3300 

gttggcctgc attagggctt acattgatgt ttctgacgtg ttaatactta catagaaagg 3360 

ttttgacatt ttttcaatta gccccttatg tatagtctta ctttttagaa caacttattg 3420 

tcattttctc gtttaaataa tatgaatact tctctctttc tgttactatg tcagtttctt 3480 

attcacttag ttcaacagat ctagtgctag catggcctgt gctgctgttg gtcctactgg 3540 

gaacgcaggt agagtctcca tggtcaggat gccgtgttct gttgtggtgc tgaggcctgt 3600 

aagcacttgg ctagaagtta ggccagggaa gctcacactg accttggtat ttgaaggtcc 3660 

cagaatgagg ttgttgaggt agaacagaga tgggagaaca tgcccttgga gctggactca 3720 

acaagaaggc ccctctgggg gaagctgagg ttggacagga agggcgtgtg ccctcactga 3780 

cttgctcaca ggcttggggt tcatctgttt tggtttttgc ttttttacat tatattaaca 3840 

tggaaataaa aggtgttccc tgggatgctc ccggcttctc tgctcagtag ctttgtggct 3900 

ctgagtaaaa tgaacttgcc tgtgttgaaa tatcctaatt tttaaactta cttcataggg 3960 

actgaggaat tacgactttt atcaattttg tgacctgtta aaatgttaaa aaggacatgt 4020 

attttttaaa gatctttaag taaaacattt tgctcattcc caaagccaaa tttaaattat 4080 

accatggccc taattcagaa gttcattctt tggcaggtgt ttccttggtg cctggggcac 4140 

tctctctttc tcccagttcc tgtggcagtt tgtccaggtg cccaagaaca attcataccc 4200 

tcctttctcg ttatttatat acttgtcttt ttgcccctgc cgggtatttt agaaatcgtg 4260 

cttggtgcac tttgtgtcct tcagtgtgcc ttacccagag cagatgcacg ataagcattt 4320 

ttacacgaga acaagctggt ggtgtaggcc tctgctaagg aacaggctgt atatgctctt 4380 

tgtgggatta aggtagaatc agctttaact ccaaagaaac tgtccatgaa ttttgtttat 4440 

aatagcaagt agatttaaaa tgacactttg aaaaaattct gttgtctttc tctacatata 4500 

atgctgtaga aaaatataac tgatggattt tgtgaacgtg gtatttttaa cttttcgtag 4560 

cagcacaaag aatgctcctc tttgtgtgtc aatgaaattg ttctttatga cagcttcagg 4620 

gtaaatctgc agagaccaga gaaagcacac agctgagccc tgcagatctc acagaaggga 4680 

aacccacaga tccctctaag ctggaaagtc cgtcattcac aggaactggt gacacagaaa 4740 

tagctcatgc aactgaggat ttagagaata atggcagtaa gaaagatggc gtgtgtgggt 4800 

aagaaaagtg gttatgtctt gatctaatac gctggtgtct tcacagtcct tttggattaa 4860 

atgggatccc agagccactt cttggtcggt ttgagggggc agtacatgtg tggtatcagg 4920 

cacatgcagt gtgagggctg gactctgtgg aagccggaag gtttcaacat ctgccgcaaa 4980 

ccgttccatg ttgcacagaa ctgacagaaa ggaagaatgt gcttttggta ttgaaggtta 5040 

ccacacatta cagattgatt tgtctcatcc tgattccctt tttagtcctc ctccatcaaa 5100 

gaaaatgaag ttatttggat ttaaagaaga tccatttgta tttattcctg aagatgaccc 5160 

attatttcca cctattgagt aaggattcag cctttttaat tattcattta aagaaattta 5220 

ctatagagta tcaaatgtac aactgatcac atgtaaccat tgttttgtat gtagttctgt 5280 

ctagcttttt tttttttttt aaccttttta actgcatatt agagcaggat gaaactttag 5340 

aggttactca atcttttaat ttaaggagaa agtaaacttt tactttgtga acatgataga 5400 

taaaaaaaaa ctggaccggg cgcggtggct cacggctgta atcccagcac tttgggaggc 5460 

cgagacgggc ggagcacgag gtcaggagat tgagaccatc ctggctgaca cggtgaaacc 5520 

ccgtctctac taaaaataca aaaaaaatta gccgggcgtg gtggcgggca cctgtagtcc 5580 

cagctactct ggaggctgag gcaggagaat ggcgtgaacc tgggaggtgg agcttgcagt 5640 

gagccgagat cgcgccgctg cactccagcc tgggcgacag agcgagactc cgtctcaaaa 5700 

aaaacaaaag aaaactggac tgtgattatg aatctaaatt agttgtgatc ctgaacctaa 5760 

attactcaat tgagtatata gacgagtagc caaggttgtc tagttcagtt tctgtagaag 5820 

aaatgaagag cagcatgggt gagggctaat tagggatgac atagacagaa tatagaggaa 5880 

aagggtttag gacaagtctg acattcatct gtttcttatt cattgaattt tagaatctat 5940 

ttaccagggc ggtcacttac tgtctttttt aataactggt cctttgcata catttgtaaa 6000 

agtctattaa aaaataagtc tcagccgggc acatcccagc acttggggag gctgaggcag 6060 



28 



WO 02/076488 PCT/US02/08276 



gcagatcatg aggttaggag tttgagacca gtctgaccaa tgtggtgaaa ccccgtcttt 6120 

actaaaaata caaaaaaaaa ttaggtgtgg tggcgtgcac ctataatcac agctgctcag 6180 

gctgaggcag gagaaccgct tcaacccggg aggcagaggt tgcagtgagc tgagactgcg 6240 

ccactgcact ccagcttgga agacagagca agactccata tcaaaaaaaa aaaaaaa 6297 

<210> 50 
<211> 3408 
<212> DNA 

<213> Homo sapiens 
<400> 50 

ccacgcgtcc ggaggcagga ccttgtccta ttcattaatc ttgcccctca acagttattt 60 

tcagaggggc aagaagtgtt tcagggttct tggcccttgt ttgaccagtc gtcctaaccc 120 

tcatgtcttg ggtcattgtt gttataatct ggggttacct tttggaaggt catggggtac 180 

ccttttgcaa aagttatggg ccctctcctt ggaaactgca cacacaccat gcagcttaca 240 

attcagggag ttcacaggtc tacagaatcc tggaaactct catgtccggt tctactcatt 300 

gtagcttcag tggaaccttc tagcagtcct ttccagctcc tccccagctc ctcagctctg 360 

cttccctccg cccatcaagc cctcctcagc ccataaggtg ggccaggtgg cctgtgggga 420 

taaatcagag tgcccacaag tgcaggggcc caaagacatc ccagagcaaa cccaagaatc 480 

cctctacaag ccccagccca ctcagaagga tgcattttgc cccctctgtt tatttgtttg 540 

tttttaatta tgaaagtagg gcatggtcat tgttgagaat gtgagaaatg cagagaagtt 600 

aaaatgatgc tttttgttta gggcgctgct gcttctggct taagatccta aatcaaagca 660 

gctgccagat ctggacctaa gacttgcttc ccatcacctt acataaaaga aagagcactg 720 

gactaggaat caaagatctg aattcccatc aaatctctgc tattactagc ttttatctct 780 

atttatttat t:tatctgtct atctatctag taccttttgt gaatatgagt gtttctcacc 840 

gaggccctcc catctctctt gcccccgatc ctggaatgga ccaaatacct tgtttactga 900 

aggatacaga tggcatgtga ctgttgagaa tcactcaccc tcttagagcc atggtttcct 960 

cttctataaa atagggatgt tcgtgcctat ttgctaatca tgagtgacaa tgacataagg 1020 

tacataaagc tgtactgcat gatgcaaaca taacatcata tttgtagggt tgttgctaat 1080 

aatactatcc atcagcaaag cagtcattca tttactcagt caaatactga tgcactggta 1140 

cattcagttt cctcttttgt aaaatgggga taaaaatagg acttagctca tagggtattt 1200 

aagattcagt gagttaatat atataaaata cttagagcag tccttggaac atattaagaa 1260 

ctcaatacat attagctagt tgagcaggct gtagtatttg ttccagccaa gaaaagactg 1320 

ttctctaaca gcacaggaaa taaagatggg gttaggcacg acacggcagc aggacttacc 1380 

ttctgtctaa ttcagctggc agtcaaagaa agaattatta gaagcctatg agcttggtct 1440 

cccaaagatc tactgagtac agggggatat ttaaagaata aaaatcccta gacccactta 1500 

cagtacagga gagacaagaa gctgttcaca caataacaag tgctaaattc cttgattttt 1560 

atactgacag ctgaagtttt agagaagaga aggatcaata aagaccggaa tattaaagca 1620 

gacaggccta aaagaggatt ttagatttga taaagaattc ctccagttct cagagcaggg 1680 

actttggagg gtaacaactt ggatttcagt ccttgcctgc cacttactga ttgtatgacc 1740 

ttggaaaagt tacttcactt ctgtgagcaa tgattctctc atctggaaaa caaaacaaaa 1800 

caaaaaaact aacaagggtg ataataatac ctacctccct acctcatagg gctgatgaga 1860 

agattaaaaa gtacctacat aaagcccttc tctgcgtgcc tggagcatgg caagggctcc 1920 

atgtgaacca ctattttttt tttttttttt aatgaaaagt catgggcagc accaaagctc 1980 

agaattttgg cagtaaggaa ttatgattct acattgaaat ttgccagaag gggagctgac 2040 

tgcctcatga gacatttttg aatgaggcca aaaaaggaaa caagtgtatc ctgggatttt 2100 

acgagatgct aggtatgtcc ttagcactta atcctcatga ctaccctatg atgtaagtac 2160 

tatctgttgt ccctatttta cagttggcca aggtcacata gctgaaacgg acttctgtgt 2220 

gtcttctaac ttttgctttc aggtctggaa aaatgcaatg taaacctaga ccctctttga 2280 

aatactgaag atggtaatct tatcccttcc tcactttctg ttttctaaaa taacagtcct 2340 

tgttccttac aatgtctgtt ttccagattc ttagaagact ttttgcttat tttccataac 2400 

tctttacttg tgatccctga atgacaccgg gggtatagca gagaatgtcc atttcctcaa 2460 

agttcaaagg tcctacaaaa aatagttgct agcctggcat gatggtgtgt gcctgtggcc 2520 

tcagctacct gggaggctga ggcaggagga ttgcttgagc tgaggagttt gaggcgcagc 2580 

gtgatcgcgg ctcaccgcag cctcccgggc tcaagcattc ctctcgcctc agcctcctga 2640 

gtagctggga ccacagctcc actaattttg aagttttttt ggtagacatg aagtctccct 2700 

gtgttgcccg ggctggtctc aaactcctga cctcaagcag tcctcctgtc ttggcctctg 2760 

gaaatgctgg gattacaggc gtgagccact gtgctggcct cttttttctt tttctttttt 2820 

tttaaggttt ttatttgtta aatgggaagt ctgtgccatc aactgagcat tgtattttct 2880 

ccttagtaag agcctgggtg ggccactggg agagaactat acattaaatg taagtagcct 2940 

ctgggtagag agcccctggc tggtttcctt tcctttctct ccttttctct actttggtgt 3000 
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ctggaggcat ttcccagact ccagtttctt accaccctca cggattttgc tattgtatta 3060 

tcacctcctt tatcattccc aaaattgact ttatggagac tcattaaaag aaagaatcat 3120 

cggccgggag cggtggctca cgccacgaag gcgggcgaat cacctgaggt gcggagttcg 3180 

tgaccagcct gaccaaaaca gagaaacccc atctctacta aacaatacaa aattagctgg 3240 

gcgtggtggt gcacgcctgt aatcccagct actggggagg ctgggacggg agaatcactt 3300 

gaacccggga ggcagaggtt gcagtgacca aagatcgcac tattgcactc cagcctgggc 3360 

aacaagagca aaactctatc tcaaaaaaaa aaaaaaaaaa aaaaaaaa 3408 

<210> 51 

<211> 1663 

<212> DNA 

<213> Homo sapiens 

<400> 51 

tcgacccacg cgtccgggca gtggggtgag ggcacacaag cagttcaggg tcccagcagg 60 

aagtggggct gcagggccgg ggtgggtcct gggcctggcc atcaggcagc ctagcaggtt 120 

gttctgggca tggagggggc ctggtgtggc tgagggcatg cccagggctc cctggaggat 180 

cccgctctgt gccctgccca ccctgtgcct ggggagccct ctgccctcac agcccaccca 240 

ccccattttc tatgaccaca gagctccgac ctggaagatg gctcacccag gaggtcccag 300 

gagctctcac tcccccagga cctggaggac acccagctct cagacaaagg ctgccttgcc 360 

ggcgggggga gcccgaaaca gccctttgca gctctgcacc aggagcaggt tttgcggaac 420 

ccccatgtaa ggcttccccg gggtggggtc ctcccagccg tgggcctcag ggtgaccgat 480 

cacagggaga gtggctccct gccctgggca ccccctgcgg tggccccgac gacagctgag 540 

gagtgaccac aaggtctctg cccacagtgc tcggggtgcg gtgtctgggc tgcgaagtgg 600 

atccccctcc tttcttgggc actgcagcag cttggggggc tttttggacg tggatgtgcc 660 

tggtcctggt ttcccgaggg cctttacagt ggatgaggag gtgaacacag gagtcctgag 720 

agcaagcacc acctcgggct ttgttgtaga aacaatggcc cggaccccag gccggagccg 780 

tggcttggcc tcctgggtgt gtcttggcat ctgaaatgca ggctacccac accggctcac 840 

ctccaggggt acaggcaggt cccacaggga gagcttggcg ctgagctgag gctgtctggg 900 

ctcctcgcct cccaaccagt ctgcagttac aggggccagt ggggggcggg tgagaaggac 960 

gggttccctc aggggagccg gccggagccc gagccttccc ccttctccag gacgcaggcc 1020 

tgagcagcgg ggagccgccc gagaaggagc ggcggcgcct caaagagagt tttgagaact 1080 

accgcaggaa gcgcgccctc aggaagatgc agaaaggatg gcggcagggg gaggaggacc 1140 

gggagaacac cacgggcagc gacaacaccg acactgaggg ctcctagccg cagcagcgca 1200 

ggccccgacc agggcacacc caccggcccg gcctcctgcc acccgggggt gccgacgccc 1260 

tggggcgcag acttccccga gccgtcgctg acttggcctg gaacgaggaa tctggtgccc 1320 

tgaaaggccc agccggactg ccgggcattg gggccgtttg ttaagcggca ctcattttgc 1380 

ggaggccatg cgggtgctca ccacccccat gcacacgcca tctgtgtaac ttcaggatct 1440 

gttctgtttc accatgtaac acacaataca tgcatgcatt gtattagtgt tagaaaacac 1500 

agctgcgtaa ataaacagca cgggtgaccc gcaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1560 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1620 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 1663 

<210> 52 
<211> 2343 
<212> DNA 

<213> Homo sapiens 
<400> 52 

agtcaggact cccaggacag agagtgcaca aactacccag cacagccccc tccgccccct 60 

ctggaggctg aagagggatt ccagcccctg ccacccacag acacgggctg actggggtgt 120 

ctgcccccct tggggggggg cagcacaggg cctcaggcct gggtgccacc tggcacctag 180 

aagatgcctg tgccctggtt cttgctgtcc ttggcactgg gccgaagccc agtggtcctt 240 

tctctggaga ggcttgtggg gcctcaggac gctacccact gctctccggg cctctcctgc 300 

cgcctctggg acagtgacat actctgcctg cctggggaca tcgtgcctgc tccgggcccc 360 

gtgctggcgc ctacgcacct gcagacagag ctggtgctga ggtgccagaa ggagaccgac 420 

tgtgacctct gtctgcgtgt ggctgtccac ttggccgtgc atgggcactg ggaagagcct 480 

gaagatgagg aaaagtttgg aggagcagct gacttagggg tggaggagcc taggaatgcc 540 

tctctccagg cccaagtcgt gctctccttc caggcctacc ctactgcccg ctgcgtcctg 600 

ctggaggtgc aagtgcctgc tgcccttgtg cagtttggtc agtctgtggg ctctgtggta 660 

tatgactgct tcgaggctgc cctagggagt gaggtacgaa tctggtccta tactcagccc 720 
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aggtacgaga aggaactcaa ccacacacag cagctgcctg actgcagggg gctcgaagtc 780 

tggaacagca tcccgagctg ctgggccctg ccctggctca acgtgtcagc agatggtgac 840 

aacgtgcact tcggcctctc cctgtactgg aatcaggtcc agggcccccc aaaaccccgg 900 

tggcacaaaa acctgactgg accgcagatc attaccttga accacacaga cctggttccc 960 

tgcctctgta ttcaggtgtg gcctctggaa cctgactccg ttaggacgaa catctgcccc 1020 

ttcagggagg acccccgcgc acaccagaac ctctggcaag ccgcccgact gcgactgctg 1080 

accctgcaga gctggctgct ggacgcaccg tgctcgctgc ccgcagaagc ggcactgtgc 1140 

tggcgggctc cgggtgggga cccctgccag ccactggtcc caccgctttc ctgggagaac 1200 

gtcactgtgg acaaggttct cgagttccca ttgctgaaag gccaccctaa cctctgtgtt 1260 

caggtgaaca gctcggagaa gctgcagctg caggagtgct tgtgggctga ctccctgggg 1320 

cctctcaaag acgatgtgct actgttggag acacgaggcc cccaggacaa cagatccctc 1380 

tgtgccttgg aacccagtgg ctgtacttca ctacccagca aagcctccac gagggcagct 1440 

cgccttggag agtacttact acaagacctg cagtcaggcc agtgtctgca gctatgggac 1500 

gatgacttgg gagcgctatg ggcctgcccc atggacaaat acatccacaa gcgctgggcc 1560 

ctcgtgtggc tggcctgcct actctttgcc gctgcgcttt ccctcatcct ccttctcaaa 1620 

aaggatcacg cgaaagggtg gctgaggctc ttgaaacagg acgtccgctc gggggcggcc 1680 

gccaggggcc gcgcggctct gctcctctac tcagccgatg actcgggttt cgagcgcctg 1740 

gtgggcgccc tggcgtcggc cctgtgccag ctgccgctgc gcgtggccgt agacctgtgg 1800 

agccgtcgtg aactgagcgc gcaggggccc gtggcttggt ttcacgcgca gcggcgccag 1860 

accctgcagg agggcggcgt ggtggtcttg ctcttctctc ccggtgcggt ggcgctgtgc 1920 

agcgagtggc tacaggatgg ggtgtccggg cccggggcgc acggcccgca cgacgccttc 1980 

cgcgcctcgc tcagctgcgt gctgcccgac ttcttgcagg gccgggcgcc cggcagctac 2040 

gtgggggcct gcttcgacag gctgctccac ccggacgccg tacccgccct tttccgcacc 2100 

gtgcccgtct tcacactgcc ctcccaactg ccagacttcc tgggggccct gcagcagcct 2160 

cgcgccccgc gttccgggcg gctccaagag agagcggagc aagtgtcccg ggcccttcag 2220 

ccagccctgg atagctactt ccatcccccg gggactcccg cgccgggacg cggggtggga 2280 

ccaggggcgg gacctggggc gggggacggg acttaaataa aggcagacgc tgtttttcta 2340 

ccc 2343 

<210> 53 
<211> 3091 
<212> DNA 

<213> Homo sapiens 

* 

<400> 53 

aaaccggaaa gtttgtagga aaattgctgc acatggcctt tgcagaaaag agagccttca 60 

aaacctctta cattccagta gaaaactctc tctgcaagtc cttaactttg ttcactcatt 120 

ccaggaaggt gcttcaatat tggatattca cacagagccc agtttttcaa gtttgctttc 180 

acagtcatcg tatgctgaca tgggtgttcc acttcctgca aaaaacttaa tatttaaaga 240 

tggtgtctta tcagaakgga gtggacggtc accttcctca cttcttattg ctaatctcca 300 

tttgcaataa tttggttaca ccatttgttg ctcacacttt ctgccttttt tctttcttaa 360 

cgttagcttt atagtgtcag ccactaaaaa gcatcctgct gctgcagtgc aattcttgct 420 

taactaatat taaaagttgg ggaacatatt catgttttct gaagttttgc tcattattgc 480 

acatcttatt gcgacaaagt gctttttagc agccagcact gtatttttta ccttgagaca 540 

atctgcattt cttttataaa actaagtata tactttatag gctttatgat gactgttatg 600 

tttataagca gtcactatga aaattgcaat ggtaatttta tatgttagtt tatcaaacat 660 

aaatcttgtt taattttata ttttgttacc tatactttgg gggatcaagg gaagagatgg 720 

aactcttcct ctgaaaaggc ttcttggtac ttaaagtagt aaaactataa aacaataaac 780 

atccagtatt gagagatgat atgatagggc attatgaatt cctatgggtg tctgtaaatt 840 

atgtatgtca gttggacatt gtagaaggta tgtaaatcag catagttgtg tataacttaa 900 

ccttgattta taaggtctta agattatgac tattcattga catctcatga gaagctttag 960 

aagactttct atttttaaac accatttata tgtggacttc tgttgtcact gactttgggc 1020 

tttatatttt catagagtct ttatggaaaa aatagaattt attttccact cttgtagcta 1080 

tagctgctgc acactttcac cctgatttat ttttttgttt cttagctttg atgttttcaa 1140 

accaaggatt gtgattttag gttagaatta catattagaa gcattaagac tatgtctttg 1200 

gatcagaatg ctttagtgat aaacctactt tgaagacata ctcttaagca atctggatct 1260 

taaatttatg tgaatacttt tttagaaaat gataaagaaa aatggaatta cttcaaagtg 1320 

tttcttgagt cattgattct tttagcatct caaatgttaa ttagaataat tggaatcact 1380 

ttttagactt ttcaagttac cttccttggg aagtttgtgc agtgttatag tttagtttag 1440 

ctcctcttac agggtaatgg tttgctagtt taaaactgta accaaacgaa ctggtcagac 1500 

aacatatatc taaaacactt aaaatgttag gaagtttggg aatgttataa cctaaacgtt 1560 



31 



WO 02/076488 PCTAJS02/08276 



tttgctggta actttttgtt atttatagat atttgtgtat ttaacataca tacttcagga 1620 

aatatatgcc tttcctaaaa cttaaccatg cattcaatac catggcctat ctatagaatt 1680 

gaatattttg gaccatgtta tctgtggcac agtcagtgct gtgtttgagg taaatgcagt 1740 

aacggttagt tttctacttt gtcttataga aggtagaaac catgtgtatg ttatgtttgt 1800 

ctataaaaga aaaaatacta atattaaata atttcttacg actctgagtc actcacttat 1860 

ttttccaata attgatattg tacattccta gtgccattag gtatgtatgt atgtaacttt 1920 

tacagttttt cagctgaaag ttgtaagtat tttttttttt tgatcggggc tctttaatct 1980 

cattttaatt tcctttgttt gaactgtagt tatttattcc tatattaacc atctaaacca 2040 

actgtaatga catgtacact aatacagaat tgaacatttg tagttgttgg cagtgaaccc 2100 

agttgttggt gaatttaaag cttaaaatat gggaatgatt tgctgctata tttcctttga 2160 

gagagaaagg aggaagaaat agaacctaat agtgatcatg aattttaggg aaagtaccga 2220 

agaaccatgg ggtcccctct ggtttcttgt gttgaatgag gcaagggtaa tcatctgatt 2280 

ccgagctgaa gacctctggt cctcttaagg agggagagtg catttttaga gcttttagca 2340 

aaatgtgaaa agctgatgtt tgcgccttgc tttgtgaatt tggctttgtt ttacttatac 2400 

attaactcat gtaatctctt aaatcttaca agcattgatc catttcaaca aaaaggtaaa 2460 

tttaaaatgc agactttgtt atttgccaaa gaagattcat gaaaaattta cgtccaatta 2520 

ttttgcaaat agttaatttc atttggcttt ttaccatgtt ccttcctttc tttttcccgc 2580 

ttccttaatg taatttaaac cctggcaaac attctttaga aaccaagagg aaagaaagaa 2640 

caaatatcaa aaaagacata gaatttaata ttgatacaat ttcacctcta aaatggattt 2700 

gaagaaatgc aactttatat caaaaaatgt catctgattt cctttgtttc ttttttaaat 2760 

tatgtaatca gatgatttta tgtttttttt tcaggggagc ggaatattgg tttcttttac 2820 

ttgttgtttt cagttttctc tgccattcat gtttcttttt tgtgttcagt gtttcaaata 2880 

caatttgtat ttaaggattt taaaatacca aactgtaact gagtacagtg gatcgttttc 2940 

tgttaggatg ttaatattat acaatgaaat ctataaagtg ttgtcaattt gattattgac 3000 

acatataaca tgtttacaaa taaactgtgg tattgatcaa gttactatga aaaaaaaaaa 3060 

aaacccgggg ggggccccgg aacccaatcc c 3091 

<210> 54 
<211> 1748 
<212> DNA 

<213> Homo sapiens 
<400> 54 

agacgttccc tcgcggccct ggcacctcca accccagata tgctgctgct gctgctgctg 60 

cccctgctct gggggaggga gagggtggaa ggacagaaga gtaaccggaa ggattactcg 120 

ctgacgatgc agagttccgt gaccgtgcaa gagggcatgt gtgtccatgt gcgctgctcc 180 

ttctcctacc cagtggacag ccagactgac tctgacccag ttcatggcta ctggttccgg 240 

gcagggaatg atataagctg gaaggctcca gtggccacaa acaacccagc ttgggcagtg 300 

caggaggaaa ctcgggaccg attccacctc cttggggacc cacagaccaa aaattgcacc 360 

ctgagcatca gagatgccag aatgagtgat gcggggagat acttctttcg tatggagaaa 420 

ggaaatataa aatggaatta taaatatgac cagctctctg tgaacgtgac agccttgacc 480 

cacaggccca acatccttat ccccggtacc ctggagtctg gctgcttcca gaatctgacc 540 

tgctctgtgc cctgggcctg tgagcagggg acgcccccta tgatctcctg gatggggacc 600 

tctgtgtccc ccctgcaccc ctccaccacc cgctcctcag tgctcaccct catcccacag 660 

ccccagcacc acggcaccag cctcacctgt caggtgacct tgcctggggc cggcgtgacc 720 

acgaacagga ccatccaact caatgtgtcc taccctcctc agaacttgac tgtgactgtc 780 

ttccaaggag aaggcacagc atccacagct ctggggaaca gctcatctct ttcagtccta 840 

gagggccagt ctctgcgctt ggtctgtgct gttgacagca atccccctgc caggctgagc 900 

tggacctgga ggagtctgac cctgtacccc tcacagccct caaaccctct ggtactggag 960 

ctgcaagtgc acctggggga tgaaggggaa ttcacctgtc gagctcagaa ctctctgggt 1020 

tcccagcacg tttccctgaa cctctccctg caacaggagt acacaggcaa aatgaggcct 1080 

gtatcaggag tgttgctggg ggcggtcggg ggagctggag ccacagccct ggtcttcctc 1140 

tccttctgtg tcatcttcat tgtagtgagg tcctgcagga agaaatcggc aaggccagca 1200 

gcggacgtgg gagacatagg catgaaggat gcaaacacca tcaggggctc agcctctcag 1260 

ggtaacctga ctgagtcctg ggcagatgat aacccccgac accatggcct ggctgcccac 1320 

tcctcagggg aggaaagaga gatccagtat gcacccctca gctttcataa gggggagcct 1380 

caggacctat caggtcaaga agccaccaac aatgagtact cagagatcaa gatccccaag 1440 

taagaaaatg cagaggctcg ggcttgtttg agggttcacg acccctccag caaaggagtc 1500 

tgaggctgat tccagtagaa ttagcagccc tcaatgctgt gcaacaagac atcagaactt 1560 

attcctcttg tctaactgaa aatgcatgcc tgatgaccaa actctccctt tccccatcca 1620 

atcggtccac actccccgcc ctggcctctg gtacccacca ttctcctctg tacttctcta 1680 
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aggatgacta ctttagattc cgaatatagt gagattgtaa cgtgaaaaaa aaaaaaaaaa 1740 
aaaaaaaa 1748 

<210> 55 
<211> 766 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (670).. (670) 
<223> n equals a,t,g, or c 

<220> 

<221> mis cofeature 
<222> (713) . . (713) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (721).. (721) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (728) . . (728) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (731).. (731) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (756)- . (756) 
<223> n equals a,t,g, or c 

<400> 55 

ggcgagacgc gttcaccacc atatgcactg ttgtcaactc ccctggagat gcgcccaagc 60 

cccacaggaa gccttcctcc tctgcctcct cttcctcatc ctcgtcctcg ttctccttgg 120 

atgcagccgg ggcctccctg gccacactcc ctggaggctc catcctgcag ccgcggccct 180 

ccttgcccct ctcctccacg atgcacttgg ggcctgtggt ttccaaggcc ctgagtacct 240 

cttgccttgt ttgctgcctc tgccaaaacc cggccaactt caaggacctt ggggacctct 300 

gtgggcccta ctaccctgaa cactgcctcc ccaaaaagaa gccaaaactc aaggagaagg 360 

tgcggccaga aggcacctgt gaggaggcct cgctgccgct tgagagaaca ctcaaaggtc 420 

ccgagtgtgc agctgccgcc actgccggga agccccccag gcctgacggc ccagctgacc 480 

cggccaagca gggcccactg cgcaccagtg cccggggcct gtcccggagg ctgcagagct 540 

gctactgctg tgatggccgg gaggatgggg gcgaggaggc agccccagcc gacaagggtc 600 

gcaaacatga gtgcagcaag gaggctccgg cagagcccgg cggggaggcc caggaagcac 660 

tgggtgmatn aagcctgttm ccgtgttgga ccgggggggt tttaactggt ggncggggaa 720 

ntcthttngg nttgcaagag gccaattaaa gttggncctt ggaaaa 766 

<210> 56 
<211> 2803 
<212> DNA 

<213> Homo sapiens 
<400> 56 

cccacgcgtc cgcgacccac gcgtccgggg ggaggtaact gcagtaagtc ccgcttggcc 60 
ctggagtcca cgcggatttt cgaagctggg gctggcaaga ggccgctgga caccacgctc 120 
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cagtcgtcag cccacttcct agctgaacag cgcgaggcgg cggcagcgag ccgggtccca 180 

ccatggccgc gaattattcc agtaccagta cccggagaga acatgtcaaa gttaaaacca 240 

gctcccagcc aggcttcctg gaacggctga gcgagacctc gggtgggatg tttgtggggc 300 

tcatggcctt cctgctctcc ttctacctaa ttttcaccaa tgagggccgc gcattgaaga 360 

cggcaacctc attggctgag gggctctcgc ttgtggtgtc tcccgacagc atccacagtg 420 

tggctccgga gaatgaagga aggctggtgc acatcattgg cgccttacgg acatccaagc 480 

ttttgtctga tccaaactat ggggtccatc ttccggctgt gaaactgcgg aggcacgtgg 540 

agatgtacca atgggtagaa actgaggagt ccagggagta caccgaggat gggcaggtga 600 

agaaggagac gaggtattcc tacaacactg aatggaggtc agaaatcatc aacagcaaaa 660 1 

acttcgaccg agagattggc cacaaaaacc ccagtgccat ggcagtggag tcattcatgg 720 

caacagcccc ctttgtccaa attggcaggt ttttcctctc gtcaggcctc atcgacaaag 780 

tcgacaactt caagtccctg agcctatcca agctggagga ccctcatgtg gacatcattc 840 

gccgtggaga ctttttctac cacagcgaaa atcccaagta tccagaggtg ggagacttgc 900 

gtgtctcctt ttcctatgct ggactgagcg gcgatgaccc tgacctgggc ccagctcacg 960 

tggtcactgt gattgcccgg cagcggggtg accagctagt cccattctcc accaagtctg 1020 

gggatacctt actgctcctg caccacgggg acttctcagc agaggaggtg tttcatagag 1080 

aactaaggag caactccatg aagacctggg gcctgcgggc agctggctgg atggccatgt 1140 

tcatgggcct caaccttatg acacggatcc tctacacctt ggtggactgg tttcctgttt 1200 

tccgagacct ggtcaacatt ggcctgaaag cctttgcctt ctgtgtggcc acctcgctga 1260 

ccctgctgac cgtggcggct ggctggctct tctaccgacc cctgtgggcc ctcctcattg 1320 

ccggcctggc ccttgtgccc atccttgttg ctcggacacg ggtgccagcc aaaaagttgg 1380 

agtgaaaaga ccctggcacc cgcccgacac ctgcgtgagc cctaggatcc aggtcctctc 1440 

tcacctctga cccagctcca tgccagagca ggagccccgg tcaattttgg actctgcact 1500 

ccctctcctc ttcaggggcc agacttggca gcatgtgcac caggttggtg ttcaccagct 1560 

catgtcttcc ccacatctct tcttgccagt aagcagcttt ggtgggcagc agcagctcat 1620 

gaatggcaag ctgacagctt ctcctgctgt ttccttcctc tcttggactg agtgggtacg 1680 

gccagccact cagcccattg gcagctgaca acgcagacac gctctacgga ggcctgctga 1740 

taaagggctc agccttgccg tgtgctgctt ctcatcactg cacacaagtg ccatgctttg 1800 

ccaccaccac caagcacatc tgtgatcctg aagggcggcc gttagtcatt actgctgagt 1860 

cctgggtcac cagcagacac actgggcatg gacccctcaa agcaggcaca cccaaaacac 1920 

aagtctgtgg ctagaacctg atgtggtgtt taaaaga.gaa gaaacactga agatgtcctg 1980 

aggagaaaag ctggacatat actgggcttc acacttatct tatggcttgg cagaatcttt 2040 

gtagtgtgtg ggatctctga aggccctatt taagtttttc ttcgttactt tgctgcttca 2100 

tgtgtacttt cctaccccaa gaggaagttt tctgaaataa gatttaaaaa caaaacaaaa 2160 

aaaacactta atatttcaga ctgttacagg aaacaccctt tagtctgtca gttgaattca 2220 

gagcactgaa aggtgttaaa ttggggtatg tggtttgatt gataaaaagt tacctctcag 2280 

tattttgtgt cactgagaag ctttacaatg gatgcttttg aaacaagtat cagcaaaagg 2340 

atttgttttc actctgggag gagagggtgg agaaagcact tgctttcatc ctctggcatc 2400 

ggaaactccc ctatgcactt gaagatggtt taaaagatta aagaaacgat taagagaaaa 2460 

ggttggaagc tttatactaa atgggctcct tcatggtgac gccccgtcaa ccacaatcaa 2520 

gaactgaggc ctgaggctgg ttgtacaatg cccacgcctg cctggctgct ttcacctggg 2580 

agtgctttcg atgtgggcac ctgggcttcc tagggctgct tatgagtggt tctttcacgt 2640 

gttgtgtcca tagctttagt cttcctaaat aagatccacc cacacctaag tcacagaatt 2700 

tctaagttcc ccaactactc tcacaccctt ttaaagataa agtatgttgt aaccaaaaaa 2760 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 2803 

<210> 57 

<211> 2181 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (5) . . (5) 

<223> n equals a,t,g, or c 

<400> 57 

gtacnggatt cccgggtcga cccacgcrtc cggcaggcag aggggcagtg ggcgcgggga 60 

agatgattct gggcctcccc catctactgt cattaaccaa aatgaaacat ttgccaacat 120 

aatttttaaa cctactgtag tacaacaagc caggattgcc cagaatggaa ttttgggaga 180 

ctttatcatt agatatgacg tcaatagaga acagagcatt ggggacatcc aggttctaaa 240 
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tggctatttt gtgcactact ttgctcctaa agaccttcct cctttaccca agaatgtggt 300 

attcgtgctt gacagcagtg cttctatggt gggaaccaaa ctccggcaga ccaaggatgc 360 

cctcttcaca attctccatg acctccgacc ccaggaccgt ttcagtatca ttggattttc 420 

caaccggatc aaagtatgga aggaccactt gatatcagtc actccagaca gcatcaggga 480 

tgggaaagtg tacattcacc atatgtcacc cactggaggc acagacatca acggggtcct 540 

gcagagggcc atcaggctcc tcaacaagta cgtggcccac agtggcattg gagaccggag 600 

cgtgtccctc atcgtcttcc tgacggatgg gaagcccacg gtcggggaga cgcacaccct 660 

caagatcctc aacaacaccc gagaggccgc ccgaggccaa gtctgcatct tcaccattgg 720 

catcggcaac gacgtggact tcaggctgct ggagaaactg tcgctggaga actgcggcct 780 

cacacggcgc gtgcacgagg aggaggacgc aggctcgcag ctcatcgggt tctacgatga 840 

aatcaggacc ccgctcctct ctgacatccg catcgattat ccccccagct cagtggtgca 900 

ggccaccaag accctgttcc ccaactactt caacggctcg gagatcatca ttgcggggaa 960 

gctggtggac aggaagctgg atcacctgca cgtggaggtc accgccagca acagtaagaa 1020 

attcatcatc ctgaagacag atgtgcctgt gcggcctcag aaggcaggga aagatgtcac 1080 

aggaagcccc aggcctggag gcgatggaga gggggacmcc aaccacatcg agcgtctctg 1140 

gagctacctc accacaaagg agctgctgag ctcctggctg caaagtgacg atgaaccgga 1200 

gaaggagcgg ctgcggcagc gggcccaggc cctggctgtg agctaccgct tcctcactcc 1260 

cttcacctcc atgaagctga gggggccggt cccacgcatg gacggcctgg aggaggccca 1320 

cggcatgtcg gctgccatgg gacccgaacc ggtggtgcag agcgtgcgag gagctggcac 1380 

gcagccagga cctttgctca agaagccata ccagccaaga attaaaatct ctaaaacatc 1440 

agtggatggt gatccccact ttgttgtgga tttccccctg agcagactca ccgtgtgctt 1500 

caacattgat gggcagcccg gggacatcct caggctggtc tctgatcaca gggactctgg 1560 

tgtcacagtg aacggagagt taattggggc acccgcccct ccaaatggcc acaagaaaca 1620 

gcgcacttac ttgcgcacta tcaccatcct catcaacaag ccagagagat cttatctcga 1680 

gatcacaccg agcagagtca tcttggatgg tggggacaga ctggtgctcc cctgcaacca 1740 

gagtgtggtg gtggggagct ggggkctgga ggtgtccgtg tctgccaacg ccaatgtcac 1800 

cgtcaccatc cagggctcca tagcctttgt catcctcatc cacctctaca aaaagccggc 1860 

gcccttccag cgacaccacc tgggtttcta cattgccaac agcgagggcc tttccagcaa 1920 

ctgccacgga ctgctgggtc agttcctgaa tcaggatgcc agactcacag aagaccctgc 1980 

agggcccagc cagaacctca ctcaccctct gctccttcag gtgggagagg ggcctgaggc 2040 

cgtcctaaca gtgaaaggcc accaagtccc agtggtctgg aagcaaagga agatttacaa 2100 

cggggaagag cagwtagayt gytggtttgc caggaacatg ccgccaaact gattgacggg 2160 

gagtacagga ttacctggca t 2181 

<210> 58 

<211> 2207 

<212> DMA 

<213> Homo sapiens 

<400> 58 

ccacgcgtcc ggaaaaaggg aaaagatgcc gtgtaaaatc tcgttctgtg tctgaattgc 60 

cgtagggctc agatcttcat ttgaggttct gtgtctgaat tgccgtaggg ctcagatctt 120 

catttgaggt tatgttctat aagttaacgt tgatcttgtg tgagctttcg gtagctggag 180 

taacacaggc ggcctcacag cgacctctcc agcgccttcc aaggcacatc tgcagccagc 240 

gtagctcctc ctgggagatg cctcctcaag gccctgctcc agaccacgtg gggagggcct 300 

gacagccaat tcccaggctg tccccaccct tggagagtga ccctaaacgc tagacagatg 360 

gggaatggga aagaaaagaa agctgcagac ctcaagttaa aattccctca aaaacgtttt 420 

tatttatctg ctttttctga aaggataaag gctttttgaa aattattttc taacaaataa 480 

catgaacact tctagaaacc ctagaaaaac acaaagtatt caaaatagaa agaaaaatta 540 

cccattactc tttaagccag cattatccat tgcggtgctt ttggagttgg gtgaggccgt 600 

agcctctgcc aagtcaagga gcccggtggt ggctgtggca ttcctgcagg gttgtttttt 660 

tttctttgag atggagtctc actcttgtca ccccagctgg aatgtggtgg tgtaaacagc 720 

tcactgcagc cttgaccctg aggctcaagc gatccttctg ccttggcctc ctgagtagct 780 

gggatcccag gcgagagtca ccacaccctg tccatgttcc tgcaggtctt gatatgcgag 840 

gacgctgtgt cttccctgcc acattttctt cttctttctt gagacagacc cttgctccat 900 

cacccaggcc agagtgtggt ggtgcgaaca cggctcactg cagcctcgac cctcaggctc 960 

aagcgatcct cacgcctcgg acccccaaag tgctgggatc acaggcgaga gtcaccatgc 1020 

tggcctgaat cttcagggta ttttacggtt gaagtgtcac ttacttaacc atccctgttt 1080 

caagagtgta ggtggtcacc ctgtctctgc cgctgacctg gcctggaccc tcggctgtga 1140 

gagggagggg tgggctgggc tggaggaacc tgaagccctc gtgatgtcac aagcccatct 1200 

ggctgggcat cccctgctgt gtcctgagct gcacatgccc caggtggccc ccacagcaga 1260 
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ggcgagccac tggagggtgg agggcttcca cgggacggtc ttcaggggga gaaggaaggg 1320 

cccaggcccc caggagactc aggagaccag agcctggggt caggggctca gccaggggct 1380 

cagccagggc tggatgtccg gagccagccc cgcagccctg tgttctttgt tcttcgcact 1440 

cccaccgtcc gtgtgaacag ctccagcccc acctgcgcct ccctgtgctg ggctccatca 1500 

gggagcccag aagacgtgtg tgcttctgaa attgggtccc tacatgcctt tgtcccagtg 1560 

caccttgctc cttccattta ctatcgagat ttaaatgcct gttttctccc cagaggttga 1620 

cggatatatt cagacgttac gacacggatc aggacggctg gattcaggtg tcgtacgaac 1680 

agtacctgtc catggtcttc agtatcgtat gaccctggcc tctcgtgaag agcagcacaa 1740 

catggaaaga gccaaaatgt cacagttcct atctgtgagg gaatggagca caggtgcagt 1800 

tagatgctgt tcttccttta gattttgtca cgtggggacc cagctgtaca tatgtggata 1860 

agctgattaa tggttttgca actgtaatag tagctgtatc gttctaatgc agacattgga 1920 

tt'tggtgact gtctcattgt gccatgaggt aaatgtaatg tttcaggcat tctgcttgca 1980 

aaaaaatcta tcatgtgctt ttctagatgt ctctggttct atagtgcaaa tgctttttta 2040 

gccaatagga attttaaaat aacatggaac ttacacaaaa ggcttttcat gtgccttact 2100 

tttttaaaaa ggagtttatt gtattcattg gaatatgtga cgtaagcaat aaagggaatg 2160 

ttagacgtgt aaaaaaaaaa aaaaagggcg gccgctctag aggatcc 2207 

<210> 59 

<211> 3533 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<222> (44) . . (44) 

<223> n equals a,t,g, or c 

<400> 59 

ttttatttac ttcaaattaa ctgtacttta ctcaaataga aaangaataa ttttcacatt 60 

atgaagctac acaattccaa aatacacatg ctgaggctct ttttaagtcc gaattgtcta 120 

gtaattacaa aaaagtgaag agtttacaga tatacaagga aataaaggcg aattattgca 180 

aagaaaacaa gtttaatttc actttgaatg acaacgattt ttctggaaag cagatacttc 240 

actcctttaa gtttccaccc aagccacaat aatttcaaac ggtcttgcgg atgacccagc 300 

tggtcactct tgtttatgtg gggactggag gtaatgagag ccaaaaaaag tgctataaac 360 

ctaatttggc tagagcaagt tcacacgaca cgaccgtgct ttaaaaactt gctctccatt 420 

atgtacttcc ttccatcagg ttggggaaaa aaaaatggtg gggatggtga gtaaacacac 480 

cagtggtttc atcagagggg aactcactac tcaggaggtg acggtgacgt ggtgccggtc 540 

cctgaagtac gcgcacaagc tccggaggtt gcgggagctt ccgctgccgc ctggagggaa 600 

gccggagcga cgggggtcac ggcggcggtc agagggtaaa ggtcttgctc ccagcagcct 660 

ccgcggtgga tacgtcgcca tcttggatcc gcgggacaag aaaattcatg cgagggagac 720 

gtggtgggcg gtccttcctg tgacacgacc cttgagtgac agttctattt gattgcctcc 780 

ggtactgtga ggaaaggaca cgactctatg gtgaggactg atggacatac attatctgag 840 

aaaagaaact accaggtgac aaacagcatg tttggtgctt caagaaagaa gtttgtagag 900 

ggggtcgaca gtgactacca tgacgaaaac atgtactaca gccagtcttc tatgtttcca 960 

catcggtcag aaaaagatat gctggcatca ccatctacat caggtcagct gtctcagttt 1020 

ggggcaagtt tatacgggca acaaagtgca ctaggccttc caatgagggg gatgagcaac 1080 

aatacccctc agttaaatcg cagcttatca caaggcactc agttaccgag ccacgtcacg 1140 

ccaacaacag gggtaccaac aatgtcactt cacacgcctc catctccaag caggggtatt 1200 

ttgcctatga atcctargaa tatgatgaac cactcccagg ttggtcaggg cattggaatt 1260 

cctagcagga caaatagcat gagcagttca gggttaggta gccccaacag aagctcgcca 1320 

agcataatat gtatgccaaa gcagcagcct tctcgacagc cttttactgt gaacagtatg 1380 

tctggatttg gaatgaacag gaatcaggca tttggaatga ataactcctt atcaagtaac 1440 

atttttaatg gaacagacgg aagtgaaaat gtgacaggat tggacctttc agatttccca 1500 

gcattagcag accgaaacag gagggaagga agtggtaacc caactccatt aataaacccc 1560 

ttggctggaa gagctcctta tgttggaatg gtaacaaaac cagcaaatga acaatcccag 1620 

gacttctcaa tacacaatga agattttcca gcattaccag gctccagcta taaagatcca 1680 

acatcaagta atgatgacag taaatctaat ttgaatacat ctggcaagac aacttcaagt 1740 

acagatggac ccaaattccc tggagataaa agttcaacaa cacaaaataa taaccagcag 1800 

aaaaaaggga tccaggtgtt acctgatggt cgggttacta acattcctca agggatggtg 1860 

acggaccaat ttggaatgat tggcctgtta acatttatca gggcagcaga gacagaccca 1920 

ggaatggtac atcttgcatt aggaagtgac ttaacaacat taggcctcaa tctgaactct 1980 
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cctgaaaatc tctaccccaa atttgcgtca ccctgggcat cttcaccttg tcgacctcaa 2040 

gacatagact tccatgttcc atctgagtac ttaacgaaca ttcacattag ggataagctg 2100 

gctgcaataa aacttggccg atatggtgaa gaccttctct tctatctcta ttacatgaat 2160 

ggaggagacg tattacaact tttagctgca gtggagcttt ttaaccgtga ttggagatac 2220 

cacaaagaag aacgagtatg gattaccagg gcaccaggca tggagccaac aatgaaaacc 2280 

aatacctatg agaggggaac atattacttc tttgactgtc ttaactggag gaaagtagct 2340 

aaggagttcc atctggaata tgacaaatta gaagaacggc ctcacctgcc atccaccttc 2400 

aactacaacc ctgctcagca agccttctaa aaaaaaaaaa aaaaaaaaaa aaaaagactt 2460 

cccttttctt ggggtatggc tgtctcagca caatactcaa cataactgca gaactgatgt 2520 

ggctcaggca ccctggtttt aattccttga ggatctggca attggcttac gcaaaaggtc 2580 

accatttgag gtcctgcctt actaattatg tgctgcccaa caactaaatt tgtaatttgt 2640 

ttttctctag tttgagcagg gtctgaattt tttcatttat ttcctttttt gccagcagac 2700 

agacttgagt ctgtaaagac aagcaaatac actgacagaa gtttaccata gtttctaaaa 2760 

tgtaaaaaag aaaaccccca aaagactcaa gaaaattaga ccacaaattt tgcattgttc 2820 

attgtagcac tattggtaat aaaataacaa atgtttgtgc atttttatgt gaagatcctt 2880 

ctcgtatttc atttggaaag atgagcaaga ggtctgcttc cttcatttta cttccccttc 2940 

tgtttttgaa aggcagtttc gccaagctta atgcaagaat atctgactgt ttagaagaaa 3000 

gatattgcca caatctctgg atggttttcc agggttgtgt tattactgag cttcatcttt 3060 

ccagaatgag caaaacactg tccagtcttt gttacgattt tgtaataaat gtgtacattt 3120 

tttttaaatt tttggacatc acatgaataa aggtatgtat gtacgaatgt gtatatatta 3180 

tatatatgac atctattttg gaaaatgttt gccctgctgt acctcatttt taggaggtgt 3240 

gcatggatgc aatatatgaa aatgggacat tctggaactg ctggtcaggg gactttgtcg 3300 

ccctgtgcac taaaagggcc agattttcag cagccaagga catccatacc caagtgaatg 3360 

tgatgggact taaaagaagt gaactgagac aattcacbct ggctgtttga acagcagcgt 3420 

ttcataggaa gagaaaaaaa gatcaatctt gtattttctg accacataaa ggcttcttct 3480 

ctttgtaata aagtagaaaa gctctcctca aaaaaaaaaa aaaaaaactc gag 3533 

<210> 60 
<211> 867 
<212> DNA 

<213> Homo sapiens 
<400> 60 

ggcacgagca ggtactgggt gactgcctgg ctgaggaaaa gttaactaga cacttgggga 60 

aaggagatcc aagggagtaa gaggcaaaat gcctttgcat gcttttcttc ctatctcttt 120 

ttctttctct ccttctcact ctctcccttc cttcctttct tcctttctct ttcttttttt 180 

tttctctttt cccccacctc tctgcctgcc tccttccttc cctcccctcc cctcccttcc 240 

ccctccctcc ctccctccct tccttccttc cttccttcct tccttccttc cttccctccc 300 

tcctctctcc ctccttccct gccttctttc cttcgttctg ccaacttgcc agaaggagcc 360 

caagaaaaag cacccagatg cttcagtcaa cttcttagaa ttcttctttt ttttatgttc 420 

agaaaagatg gaaattcatt tctgctaaag agaaagaaaa aattggaaga cagggtgaag 480 

gtgaacaggc ccattataag aaagaaacaa aaatctatat tctgtctaca aggaagcgag 540 

agagagaaag agagagaaga aagaagttcc aggattctaa tgtaccaaag ggatctcctt 600 

tttcttgttt tgttctgaaa atttcaccaa aagagcacag gagaacatct tggctaattc 660 

attggcgatg atgtaagaaa actgagagaa atgaaagaaa tgaagaatta ctgctgcaga 720 

taatatacag ccttgaggaa agaaaggctt ttaagattat agatataaag gctattgctg 780 

tattctggga taaaagaaag tctgatgtca gggaaagggg aagttggaaa aactggaaaa 840 

agaaaaaaga aaaaaaaaaa aaaaaaa 867 

<210> 61 
<211> 1558 
<212> DNA 

<213> Homo sapiens 



<400> 61 



ggttttggag tatatatatt gtatgccatg aactatattt 
catttaattg ccatagcact tacatggggc aggtattcat 
gaaactgaat ttcagagatg tcaggtaacc tgcctacttc 
tttaattttg aactaagatc tatctggctt gaaagctctt 
aatatacttg gaacgtaggt gtgtttttgg cacagaacat 
aacacagact tgcccagatt caaacttacc aatcttctgt 



ttctgcttat ggctttgcct 
tttcctgctt agcaaataag 
acacactagg agttttgatg 
tgcattaaac aaccttgaac 
ggcatgtgtg tgagggattg 
ttcatgtgcc cagaagaaac 



60 
120 
180 
240 
300 
360 
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agcctgtttc tcagcctcaa acccaaactt ctagttgtct tgattggttc agcctgactg 420 

tccaactctg atttatagct gtgattgggg gagctgagat tacacagtgt aggcaggcag 480 

aagggcccca ggcctattga tatgggtgag gacaatactc acgcactccc ttcacttact 540 

cactcttcca aggtcttggc ttgaacccaa ttttttttga gagaataaac caggcttttt 600 

gttctccact tggcctgact ccatttctgg cattccagcc atgtatttag ctgttatcag 660 

ctttcagatt tagascaaag ccttgtttcc aataagcttg tttctctgaa gtaattgtta 720 

aaatataatt ttcagaaaaa ggttaaatca tgactcatac aaatataaaa atgaacatgt 780 

gctaaagatt tttatttcac tcatgtgata tgaagtaacc agacagaagt tataaccagt 840 

acatatggaa agtcaaaaag cacaaattca tatgtagtaa aggaattgga ttgcaaatga 900 

aggcaaaact gtttttycta cagggtggag ggaagataat caaaatgcta gaaccagaat 960 

ttscatgcct gtcacttagc ttcaatttac aaaagcccag aataactcaa aggcaaattc 1020 

tagccctgca aatatcagcc ctaaagctgt gctgtggcca gtgcatagtt ttctattgaa 1080 

gtacaatttt ttccccaaat acattatctc tcagagggag tccaaattgc ttccctttca 1140 

ctcagcagat ctgttcagtc aacagatgtt aaatagctac agcgtatcag gcacaaataa 1200 

ttctttataa aataaagtaa caaactatat gttgtttcaa agttccagtt aaggccagcc 1260 

gtggtagctc acccttataa tcccaacact gggaggccga ggcaggcgga tcacttgggc 1320 

taggagttcc ataccagcat ggccaacatg gtgaaaccct gctctactag aatgcaaaga 1380 

ttagccaggt gtggtggcgc atgccggtag tccaggctac tcaggtggct gaggcacagg 1440 

aatggcttga gcctgggagg cggaggttgc agtgagccga gattgcgwcc gctgcactcc 1500 

agcctgggca acactgtgag actcctgtct acaaaaaaaa aaaaaaaaaa aactcgta 1558 

<210> 62 
<211> 2199 
<212> DNA 

<213> Homo sapiens 
<400> 62 

ggcacgagct tttccatctt gagcttggca gcctgtctag ttgtggaagc tgtggtgtgg 60 

aaatcggtga ccaagaatcg gacttcttat atgcgccaca cctgcatagt gaatatcgct 120 

gcctcccttc tggtcgccaa cacctggttc attgtggtcg ctgccatcca ggacaatcgc 180 

tacatactct gcaagacagc ctgtgtggct gccaccttct tcatccactt cttctacctc 240 

agcgtcttct tctggatgct gacactgggg cctcatgctg ttctatcgcc tggttttcat 300 

tctgcatgaa acaagcaggt ccactcagaa agccattgcc ttctgtcttg gctatggctg 360 

cccacttgcc atctcggtca tcacgctggg agccacccag ccccgggaag tctatacgag 420 

gaagaatgtc tgttggctca actgggagga caccaaggcc ctgctggctt tcgccatccc 480 

agcactgatc attgtggtgg tgaacataac catcactatt gtggtcatca ccaagatcct 540 

gaggccttcc attggagaca agccatgcaa gcaggagaag agcagcctgt ttcagatcag 600 

caagagcatt ggggtcctca caccactctt gggcctcact tggggttttg gtctcaccac 660 

tgtgttccca gggaccaacc ttgtgttcca tatcatattt gccatcctca atgtcttcca 720 

gggattattc attttactct ttggatgcct ctgggatctg aaggtacagg aagctttgct 780 

gaataagttt tcattgtcga gatggtcttc acagcactca aagtcaacat ccctgggttc 840 

atccacacct gtgttttcta tgagttctcc aatatcaagg agatttaaca atttgtttgg 900 

taaaacagga acgtataatg tttccacccc agaagcaacc agctcatccc tggaaaactc 960 

atccagtgct tcttcgttgc tcaactaaga acaggataat ccaacctacg tgacctcccg 1020 

gggacagtgg ctgtgctttt aaaaagagat gcttgcaaag caatggggaa cgtgttctcg 1080 

gggcaggttt ccgggagcag atgccaaaaa gactttttca tagagaagag gctttctttt 1140 

gtaaagacag aataaaaata attgttatgt ttctgtttgt tccctccccc tcccccttgt 1200 

gtgataccac atgtgtatag tatttaagtg aaactcaagc cctcaaggcc caacttctct 1260 

gtctatattg taatatagaa tttcgaagag acattttcac tttttacaca ttgggcacaa 1320 

agataagctt tgattaaagt agtaagtaaa aggctaccta ggaaatactt cagtgaattc 1380 

taagaaggaa ggaaggaaga aaggaaggaa agaagggagg gaaacaggga gaaagggaaa 1440 

aagaagaaaa agagaaagat gaaaatagga acaaataaag acaaacaaca ttaagggcca 1500 

tattgtaaga tttccatgtt aatgatctaa tataatcact cagtgcaaca ttgagaattt 1560 

ttttttaatg gctcaaaaat ggaaactgaa agcaagtcat ggggaatgaa tactttgggc 1620 

agtatcttcc tcatgtcttc ttagctaaga ggaggaaaaa aaggctgaaa aaatagggag 1680 

gaaattcctt catcagaacg acttcaagtg gataacaata tttataagaa atgaatggaa 1740 

ggaaatatga tcctcctgag actaactttg tatgttaagg tttgaactaa gtgaatgtat 1800 

ctgcagagga agtattacaa agatatgtca ttagatccca agtgctgatt aaatttttat 1860 

agtttatcag aaaagcctta tattttagtt tgttccacat tttgaaagca aaaaatatat 1920 

atttgatata cccttcaatt gccaaatttg atatgttgca ctgaagacag accctgtcat 1980 

atatttaatg gcttcaagca ggtacttctc tgtgcattat agaatagatt ttaataatct 2040 
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tatagcattg tatattatta ttgctgttgt cactgttatt attattgtgg atactggccc 2100 
ttggtgtgtt gcatagctcc ctatgtattc tctgtttcca tctttaagtt cccagaccaa 2160 
tatacattaa gagttttgaa aaaaaaaaaa aaaaaaaaa 2199 



<210> 63 

<211> 832 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 
<222> (827).. (829) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (831).. (831) 
<223> n equals a,t,g, or c 



<400> 63 

gaattcggca cgagtatgaa actaacaaca tagaatgccc cccaaacaaa ttcctctaac 60 

ctcactgagt ttacttgccc tattactatt tttttttttt aagatcttct gtctcttgtt 120 

tttgttttat cccttacctg atgaaagtga acatttctag tggagaaaga agatcacagt 180 

tctctaatat gggcattaag agaggggtac agctagaggg gaggtgaaaa cctgcctcca 240 

ctggggtgaa aaacagtgtg ctgaggtttc agccagtgat tacactgggt aatcaaccag 300 

tcccatgttt cacaaaggag ttgtaatgat taacagttca ggtatgctty tgaggaaatc 360 

taattgagac ctttggaaaa tagcattgtt atgaatggtg tggtgttacg ccctggaggg 420 

gaaaaggcta ggaaaaacat tttaactttt caagtgtatt taaattaaca tccaaatgtt 480 

tcagtgtgct ttactggaga ctgcctgagt ttggaattca aatattgtaa ccaaattact 540 

ccaggtttct gaactaaaat gatctattga tgtttctcaa agtatagatc acagagtaag 600 

aaaagaggaa atcaagtctg gtttatgaca aacttttttc catgttaaca ttggacccaa 660 

agatgttamt aagagctttt tactactgtg agagraccag cgtgatgtga agacaacgaa 720' 

cattttaaga agtttgacta gtagacattt cgtttaagtc ttttggaggg tcttggttga 780 

caacccacaa ttttattgtg gctccccagg ctgggagaac gtggaannnc na 832 

<210> 64 

<211> 1336 

<212> DNA 

<213> Homo sapiens 

<400> 64 

ccacgcgtcc ggagttccac aaaatttgtt agtcatcaaa taatagagtt tttcttaagt 60 

gaccacttac atactcatct acaataaacc ccaactaact aatttttcac tttgtgtacc 120 

tcatgctaat atccgtggac agtaatgtgc ctgttgtgtt ccttttgctt ttcatccttg 180 

tgatcttatg tcacatggaa tgtaaaggcc acatatatat atgtgtgtgt gtgtgtgtgt 240 

atatgtatat ttttaagaat atttagtctg gatttcatga aattgacttc tgaaataatt 300 

tgcctcaatt ttgtttcctg gtggtttgag aagaaagttc ctgtggtgaa atgaaaaggg 360 

gataaaggga agtacttatt ttaaaacata agtaacttgt ggattgttga atactggaaa 420 

aagagtgtta cttccccgtt aacctacgcc tcgtgtaatc cttcaggttg gaagtcggat 480 

cgcagaccgt gtatatgaca tacccagaaa tttccccctt gctttggatc ttggttgtgg 540 

aagaggttac attgcacaat atttgaataa gcttcagtta ttccattgca ggaaactatt 600 

ggaaagtttt tccaagctga cattgcagaa aatgctttgt ttgcattggg tgaatgacct 660 

tcctagagca cttgagcaga ttcattatat tttaaaacca gatggagtgt ttatcggtgc 720 

aatgtttgga ggcgacacac tctatgaact tcggtgttcc ttacagttag cggaaacgga 780 

aagggaagga ggattttctc cacacatttc tcctttcact gctgtcaatg acctgggaca 840 

tctgcttggg agagctggct ttaatactct gactgtggac actgatgaaa ttcaagttaa 900 

ctatcctgga atgtttgaat tgatggaaga tttacaagaa caaaagtcca gaatgttgac 960 

ctaattttac aaaacaagct gcatatcagc tgatgaatgc atgagaaatt ttcaaggctt 1020 

tcacagtggt cttaaggtat gggtgagagt aactgtgctt ggaatagaaa agccctgctg 1080 

catcgagaca caatgctggc agctgcggca gtgtacagag aaatgtacag aaatgaagat 1140 

ggttcagtac ctgctacata ccagatctat tacatgatag gatggaaata tcatgagtca 1200 
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caggcaagac cagctgaaag aggttccgca actgtgtcat ttggagagct aggaaaaata 1260 

aacaacctta tgccaccggg gaaaaaatca caataaatat ttattcagtg ttaaaaaaaa 1320 

aaaaaaaaaa aaaaaa 1336 

<210> 65 

<211> 799 

<212> DNA 

<213> Homo sapiens 

<400> 65 

ggagacggtg ggtgaccaga gagtcctgtc tatcctagga ggagaacatt cagcccaaat 60 

cccagcccca tcatgcacag atcagagcca tttctgaaaa tgtcgctgct gattctgctt 120 

ttcctgggat tggcagaagc ctgtactcct cgtgaagtca acttgctgaa agggatcata 180 

ggtctcatga gcagactgtc accggatgag atcctaggct tgctgagcct ccaagtactg 240 

catgaagaaa caagtggctg caaggaggaa gttaaaccct tctcaggcac caccccatcc 300 

aggaaaccac tccccaagag gaagaacacg tggaacttcc tgaaatgcgc ctacatggtg 360 

atgacctacc tcttcgtatc ctacaacaaa ggggactggt tcactttttc ctcccaagtg 420 

ttactgccac tactgtaact tggaactgga catcagggat gatccctgct gttctttcta 480 

gtgagcctgc tccatctcag cttagccttc acaaggcctc catctcccag gcattctaac 540 

ctctgaagaa agctctctgt cccctggact gcctgtgtgg agggtaatga actgggtcct 600 

ttaaggaatg gcacctgggt gcccagaggc atggccagaa ggtgtctgtg ggggccatgc 660 

cttaggggga tgcacccagg gcggctgaga gagcaactgc aggagtttcc cctaaaatct 720 

ctcctccaga tcgttctcga actttcccca ctacttccat aataaaatgt atacttgttg 780 

aaaaaaaaaa aaaaaaaaa 799 



<210> 66 

<211> 1347 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (83) . . (83) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (334) . . (334) 
<223> n equals a,t,g, or c 



<400> 66 

gggcagccct caggccctcc ggcagcctgg ccgggcccga gtggccatgg cagcactggt 60 

gtggctgctg gcgggagcac atngtcaagc ctcaacaagt ggatcttcac agtgcacggc 120 

tttgggcggc ccctgctgct gtcggccctg cacatgctgg tggcagccct ggcatgccac 180 

cggggggcac ggcgccccat gccaggcggc actcgctgcc gagtcctact gctcagtctc 240 

acctttggca cgtccatggc ctgcggcaac gtgggcctaa ggctgtgccc ctggacctgg 300 

cacaactggt tactaccacc acacctctgt tcancctggc cctgtcggcg ctgctgctgg 360 

gccgccgcca ccacccactt cagttggccg ccatgggtcc gctctgcctg ggggccgcct 420 

gcagcctggc tggagagttc cggacacccc ctaccggctg tggcttcctg ctcgcagcca 480 

cctgcctccg cggactcaag tcggttcagc aaagtgccct gctgcaggag gagaggctgg 540 

acgcggtgac cctgctttac gccacctcgc tgcccagctt ctgcctgctg gcgggtgcag 600 

ccctggtgct ggaggctggc gttgccccac cgcccactgc tggcgactct cgcctctggg 660 

cctgcatcct gctcagctgc ctcctgtctg ttctctataa cctggccagc ttctccctgc 720 

tggccctcac ctctgccctc accgtccacg tcctgggcaa cctcaccgtg gtgggcaacc 780 

tcatcctgtc ccggctgttg tttggcagcc gcctcagtgc cctcagctac gtgggcatcg 840 

cactcactct ttcaggaatg ttcctttacc acaactgcga rttcgtggcc tcctgggctg 900 

cccgtcgggg gctgtggcgg agggaccagc ccagcaaggg tctttgagac ctgggggatc 960 

tcaggagcca cctgggatgg ccctggcctg aatccagcct ccgctgtggc catagaagga 1020 

atggagaaca gggctgggca tggtsgctca cgcctataat cccagcactt ccagagtccg 1080 

aggtgggtgg atcacctgag gccaggagtt cgagaccagc ctggctagca tggcaaaacc 1140 

tcatctctac taaaaataga aaaattagct gggcatggtg gcgcgtgcct atagtcccag 1200 
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ctacatggga ggctgaggtg ggaggatcac ttgagccctg gagatcgagg ctgcagtaag 1260 
ccaagatcgc atgctactgc actccagcct gggagacaga gcgagacgct gtctcaatta 1320 
aaaaaaaaaa aaaaaaccgg cacgtag 1347 



<210> 67 
<211> 642 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> miscjEeature 

<222> (41).. (41) 

<223> n equals a,t,g, or c 



<220> 

<221> misc_feature 

<222> (49) . . (49) 

<223> n equals a,t,g, or c 

<220> 

<221> misc__f eature 

<222> (64) . . (64) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (607) . . (607) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (621) . . (621) 
<223> n equals a,t,g, or c 



<400> 67 

taaaggggaa acaaaaaggc tgggaaggct tccaaccgcg ntttgcggnc cggcttctag 60 

gaantagtgg aatcccccgg gctggcagga attcggcacg agaaaatgac ttcagacaaa 120 

tatgatcaat ctctacagtc ccctgatgaa tttcacaggt tcccaccacc atcagttcta 180 

cctattcatc tcatccatgc tcattgttct gcctctttcc tgttcctatg gatgccctgg 240 

catgtttgtt tctttctttc tggcctccta tttccctccc ctcagacatc acttcagcat 300 

ctgtgccttc tcacttccct tatcctgggt gttaccattt cagcctatga gcatgccatt 360 

aatttgccat ctttacaaaa ttctctcttg acttcacatc cctctgtagc tgccctctcc 420 

cttctctcct cttctttaca acaaaactcc ttaaaagaac tgttggctgg gcacagtggt 480 

tcactcctat aatcccagca ctttcagaag ccaaggtggg aacatcactt gaggccaaga 540 

ggtcgagacc agcccaggca acacagtgag acctcatcac tacaaaaaaa aaaaaaaaaa 600 

actcganggg ggggccggta nccaattggc ctaaagtgag tc 642 



<210> 68 
<211> 802 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<222> (105) . . (105) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_ feature 
<222> (730) . . (730) 
<223> n equals a,t,g, or c 
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<220> 

<221> misc_f eature 
<222> (755) . . (755) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (757) . . (757) 
<223> n equals a,t,g, or c 

<220> 

<221> mis cofeature 
<222> (777) . . (777) 
<223> n equals a,t,g, or c 

<400> 68 

cgtgcctgta gtaagctcat ccctgccttt gagatggtga tgcgtgccaa ggacaatgtt 60 

taccacctgg actgctttgc atgtcagctt tgtaatcaga gattntgtgt tggagacaaa 120 

tttttcctaa agaataacwt gaycctttgc caracggact acgaggaagg tttaatgaaa 180 

gaaggttatg caccccmggt tcgctgatct atcaacatca ccccattaag aatacaaagc 240 

actacattct tttatctttt ttgctccaca tgtacataag aattgacaca ggaacctact 300 

gaatagcgta gatataggaa ggcaggatgg ttatatggaa taaaaggcgg actgcatctg 360 

tatgtagtga aattgcccca gttcagagtt gaatgtttat tattaaagaa aaaagtaatg 420 

tacatatggc tggatttttt tgcttgctat tcgtttttgt gtcacttggc atgagatgtt 480 

tattttggac tattgtatat aatgtattgt aatatttgaa gcacaaatgt aatacagttt 540 

tattgtgtta ccatttgtgt tccatttgct yctttgtatt gttgcattta gtacaatcag 600 

tgtttaaact tactgtatat ttatgctttc tgtatttacc agctatttta aatgagctgt 660 

aactttctag taaagaattg aaaagcaaat cctcactaaa ggatacacag gataggataa 720 

agccaagtcn catcaacatt aaaaaatact aaaananaaa acacaaaaaa aaaaaanccc 780 

ggggggggcc cggaacccat tc 802 

<210> 69 
<211> 470 
<212> DNA 

<213> Homo sapiens 
<400> 69 

ggcacgaggg aaatcttgca cataggcagg taaataatta taaatggtga agtggattat 60 
tctgagctgc ttaattttaa agggaaagag aactttaaac tcttcaacct tttatgctgc 120 
taataagagt tccacaatca atagaaatct atcttggcag gcacttcctt ttacccacta 180 
gaattttttc ccttgggagt tcacgatccc cagaaactgt gatatgagcc attcaatatt 240 
gatgtactaa aacagtgctc tgcttaaata cagtttttca acatacagtc ttggaagaaa 300 
caaaatccaa aataaattcc aatagtccag taacaggaat aaagacaact attgcaaatt 360 
aaatcttaca gacttatatg aaagctgttg ttaacagctg ggtactagtt atttgaaaag 420 
tttctcgtgc cgaattcgat atcaagctta tcgataccgt cgacctcgta 470 

<210> 70 
<211> 1881 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> (70) . . (70) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (126) . . (126) 

<223> n equals a,t,g, or c 
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<220> 

<221> misc_f eature 
<222> (1860) . . (1860) 
<223> n equals a,t,g, or c 



<400> 70 " 

atttttcctt 

actaactttn 

acagcnctgc 

gtcaagtctg 

tcatgctcac 

acttaaagga 

aactgcaaga 

ccagagtgtt 

gtacagatgt 

agctgctggg 

aatattggag 

atactttggg 

caatggaata 

atgcccagca 

tatctacata 

attaaacttt 

cctcccaagt 

gctaaaatta 

tgacaagctt 

ttttcattat 

agaatttata 

ttttcttaat 

cagtggtctg 

tgaccacaga 

gggttggtat 

tgttcagcaa 
tattgtgttc 
tgttgctctt 
ctgcttacta 
gttycctgga 
tttatcaaga 
ctgcggccga 



ttccttaaca 

cttcagcctt 

tagtagagcg 

tctaatctaa 

ccttctcttc 

tgcgagcaaa 

tatccagtct 

tcggccgacg 

gtatcccacc 

atgtatctct 

gtttcatgtt 

aatttgtggc 

tgtcacactc 

tcaagggata 

ggaaataaaa 

ggatcaaaga 

ccaggaccca 

gctacaattc 

tccagaagca 

tcttgagaaa 

tcataactag 

tattaagaag 

gatacgaagt 

cctctaacct 

ttcatttttt 

gagaactgct 

cagaacatta 

tatgaaatga 

tagggttaac 

ttttataaca 

tgctattaaa 

caagggaatt 



atacctttgg 
ttaatgcgaa 
aagtatttat 
ctagcgcctc 
cagatccact 
atgcaggctt 
cggtcaaaag 
tatttacagc 
aaactaatgt 
aggaaaacct 
agccatttta 
atgttcacat 
cttaataggg 
agaccgtaaa 
tggaattgat 
taatgtgatt 
ttaatttcct 
tgctgaccct 
attttataac 
tgtcagcttt 
agacccaaat 
tgttacctta 
ttacagtttc 
cctactttta 
tcttatctaa 
tgagtttaag 
agtgactgta 
tccatataac 
tatgtatatt 
tacatgtgca 
attgaacatc 
c 



ccattttttt 
gcaactagta 
taatacagaa 
gctttgcctt 
tcctcatgat 
accaggatat 
aacaactcaa 
tctgacaaat 
agatgtacaa 
tccagtgggt 
aaaggcaaca 
ttattgctag 
acctgtgact 
ttcacatata 
gtaaagttcc 
cttatgattg 
gagccaatca 
actattaaag 
agatttcatt 
gccttaatga 
atctttcaca 
ttaaaatgac 
ataccaacta 
tagacttgaa 
atcttagttt 
ccattttcaa 
ggtactgggt 
tgttgggtgc 
cattgttaag 
gaaatgtatt 
aagtataaaa 



ccagttcact 
gagcatgctt 
ttaaccttmg 
ctcacaatgc 
actgtcttct 
caaagcaaag 
ttcttacaca 
catcagacag 
acacttcact 
aaatcttttt 
ctttgacaaa 
ggcaattcta 
ccttaataag 
tgccatctgt 
atttctgaca 
atttctcaaa 
gaaatatatt 
aatctggatg 
ttaacaaaat 
gtatttgctt 
gaattttgtt 
caccattcta 
tctaaaacct 
tacttaagta 
cctggaataa 
aagaaacttg 
attagtgatg 
atcagtgctt 
agttaacttg 
caaatgaaag 
aaaaaaaaaa 



atgtttgtat 
tcaggatctg 
cccctttaaa 
tcactagcca 
aactgggctt 
gaagaacagg 
taaatgtttg 
ctgctctgca 
gtctgtctca 
ctttagaaca 
atgatcgttc 
ccaagacact 
gacctgtgac 
cctcaagtgt 
gctgacattt 
ctagcttttc 
tttcaataat 
ctggactcac 
actgatccaa 
taaatttcta 
ccataaatgt 
aaccattttt 
aattgcaaat 
atttaaatta 
taaagtttga 
ccttttacat 
gtaaactttg 
ttcaaagggg 
tggtttggct 
gaagcatacc 
aaaaaaattn. 



<210> 71 
<211> 541 
<212> DNA 

<213> Homo sapiens 



<400> 71 

tgcaggtaac 

tggtcttctt 

gggggtgtcc 

gtagcgccgc 
gtcatgctga 
tctggtgtgt 
tgtgtccagc 
agcagagctg 
attagccaag 
c 



cgttccggaa 
gcacagggcc 
cgtgcattgc 
accccgccgt 
gaggtacagg 
ctctcccctg 
tgccctgggc 
gaccagcctg 
cgtggtggaa 



ttcccgggtt 
tcagcctggc 
aggacatcca 
cgtgacagcc 
ggtgctgccc 
tgtgtagcgt 
tgcaagcagg 
gccaacatgg 
ctctgtctca 



cgacccaagg 
acctctgcca 
gcagcatccc 
caggtctccc 
ccagggtttg 
ggagtcactg 
tccctcctgc 
tgaaacccca 
aaaaaaaaaa 



ggtcccgtga 
ccgtgggctc 
cggcctcctg 
ggtgtgcaga 
aacgctgtct 
gatgagtgtg 
agccctccag 
tctctactaa 
aaaaaaaaaa 



tgccttgtca 
tctgttgtgt 
ctccgtgcca 
atgcccgctg 
aactcccacc 
gtgacctccc 
gccaccctta 
aaatacaaaa 
aaagggcggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
°1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1881 



60 
120 
180 
240 
300 
360 
420 
480 
540 
541 



<210> 72 
<211> 762 
<212> DNA 

<213> Homo sapiens 
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<400> 72 

ggcacgagtg cctctacgtc atgtcttctc caatgtcatg attcacgtcg tgcagtactg 60 

ttttggactt gtctattatg tccttgttgg cctaactgtg ctgagccaag tgccaatgga 120 

tggcaggaat gcctacataa cagggaaaaa tctattgatg caagcacggt ggttccatat 180 

tcttgggatg atgatgttca tctggtcatc tgcccatcag tataagtgcc catgttattc 240 

gcggcaatct caggaaaaat aaagcaggag tggtcattca ctgtaaccac aggatcccat 300 

ttggagactg gtttgaatat gtttcttccc ctaactactt agcagagctg atgatctacg 360 

tttccatggc cgtcaccttt gggttccaca acttaacttg gtggctagtg gtgacaaatg 420 

tcttctttaa tcaggccctg tctgcctttc tcagccacca attctacaaa agcaaatttg 480 

tctcttaccc gaagcatagg aaagctttcc taccattttt gttttaagtt aacctcagtc 540 

atgaagaatg caaaccaggt gatggtttca atgcctaagg acagtgaagt ctggagccca 600 

aagtacagtt tcagcaaagc tgtttgaaac tctccattcc atttctatac cccacaagtt 660 

ttcactgaat gagcatggca gtgcqactca agaaaatgaa tctccaaagt atcttcaaag 720 

aataaatact aatggcagaa aaaaaaaaaa aaaaaaaaaa aa 762 

<210> 73 

<211> 1103 

<212> DNA 

<213> Homo sapiens 

<400> 73 

gtcttaatga gcaacagcaa cagcagtctc cagttaagaa agagagaatt aaatacagca 60 

gagatttcct gttgaagctc tcaagtgttt ccatctgcag aaaaaaacca gactttctgc 120 

ctgatcatcc cattgtactg caaaaaccag aaaacaacca aagttttaag tagcatttta 180 

agaacagatg aatttaagtt tggacatctg caaatgaggt ggatctagca acaataactg 240 

taatggactg tgacaattca atttattctt aattttgatg gttggctatt tgacttctct 300 

aaaaatgaga aagagctatt ttaaaatata aagaattttc taatcagttt cagctttgca 360 

ggaggtttcc tgcataaatt gggaagtaac actggaaagt aggaatttgg ttagtgaagt 420 

gggaagactg tatatttata atttgcatac tacttgcaat tttttgtttt tcatcacttg 480 

taataatgga atggaaatgt aagctgtaaa gactctcaaa tataaaatat ttgctacagt 540 

gtatatatgg tacataattg cttgttgctt ttaaagttcc ttctgttgtt ctgcttccca 600 

ctgatttcat accagctcat gaatggatca ttacagtctc tccagaggct tagaatgatt 660 

cagaatgttc aatgcatagt tctcaataaa caggaggcag aatttttaat gggtatttct 720 

tttcagatat atgattggtc tctaggtttt tgataataat atggtcttaa attcataatt 780 

actagcagag attgataatt tggaaacaat ggtagtgaat gaaactgaag ttgaaaaacg 840 

gctgctactt atgtcactaa tcagaccata tgaatagcag aagttgagca atttcaaagt 900 

aaaactgata tttttatttc caaaggaatt tagacatttg aaaataattg acatacatta 960 

agttttaatt cgataatttc ttatatatgg atgaacaatt tttgggttta agcttttaat 1020 

tcctagaaat tttatacatt aaatctcctg caatttgtca ctctggatgt tactgtttaa 1080 

aaaaaaaaaa aaaaaactcg tag 1103 

<210> 74 

<211> 1633 

<212> DNA 

<213> Homo sapiens 

<400> 74 

ggcacgagca tcagtaagta ggagctaggg aagagaaagc atgcaaaagg gcaagaatca 60 

ggagaattga tatagtagct ggacagaact ctagatgtgt gtgtgtgaga gaaagagagg 120 

cagggagaag gagggaggag ttacccccac gatgacctcc aacttccctt tctgcaccct 180 

catcctgggg atagcacagg ctcaggcctg ccctggttgc cctggcgatt ggcctggcct 240 

gggctcaggg gtgggggagg ggctgcacca cattaggacc tgccgtactc caatcccatg 300 

cagtcctcct gctcctgctg ctgcgtgcct gggctctggt catgccaggc ttccttgtgt 360 

cctgcgtctg tggccggttc ctgccaacct gtctagtcct ttcaggcttg aggccctgca 420 

ttgctctttc tggtcctctc ccctccttcc cgctccccat ctagcctttt ttgggttccg 480 

ggatctgctg acagactttc ttcttgctgc ctgcctgctt acatttcaga agacccctct 540 

ggaactgccc atggctgtgg tccacctgct ggtagcaacg ccctgttacc aaatgctaga 600 

taatctgcca ctcccctctg cagccgccaa ctggtgctga gctgccacct gttgtctgtt 660 

tccacaccct tgtacactgc attctgcctg tcactctgga cagctgccca gcagctgcag 720 

ctgaggccgt cctgccagca cctaccgcag ccccatgggt ggtcctgtcc agttcctggc 780 
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gctcccagcc tgcctcatct gctgtactta cagagccaca tctggttgaa tacagagctg 840 

gggtagcctg gatgggccca tggcacactc actcccagag aggcgggcat cctctctctg 900 

gacctctagg gaagggcagg gtctggagcc caataataaa acacagtaat gataatcata 960 

gctaatgttt actgagaact taggatgtga taggctcaat gatttaccag aattactcta 1020 

tactgcataa ctaccctatc aggtaagtac tattaatatc ccctttccca catgaagaaa 1080 

ctaaggctaa ccaatagaaa tgaacctgtg taatgtccca tgggcataat tcatggagcc 1140 

aggattcaga accaggtgct ttacttcagg gctgaagctc ataaccacta gaccaaagcg 1200 

cctcctccgt gacctctggg gaaggcccag caacatcctc tattgcgtcc aatgacttct 1260 

ccctggtctg agatccactg actcacaggg gtagggttaa ggttaaggcc agagtctcag 1320 

ctaagctttg gatactttct tctggatttg gagaaggctg gaataactga atttctgctc 1380 

atcttcagga gcggcctact agagccacac atttcccagc tctccttgtg tgtttcccag 1440 

acccttcttc catcgtgtcc tctcccttag gctcctggaa agttttcaga gagaatcacc 1500 

cagtggtaac attgttaaac aaaacaggaa aatgggactt gtgtgtatat atgattaaat 1560 

tattaattga tggatctacc ttcttagctc gtgccgaatt cgatatcaag cttatcgata 1620 

ccgtcgacct cga 1633 

<210> 75 

<211> 1384 

<212> DNA 

<213> Homo sapiens 

<400> 75 

ggcacgagca gttattttca aaatggctat ggaaaacacg taagttttaa aatatgccct 60 

ctttctcgtt ttaaaaaatt attactattg tccatacatg ttactctttt catctagatt 120 

tatcatgttt ctttggcctc cagtctctgg tgtttgccta agctttatta gagacaggtc 180 

atttctacct atgtgtcatt ttatctatgt cttgatctta tgtaattcaa ttgctcttta 240 

agattatgtt ctcttctcat gtttggttta tccattatcc aaattttcca tttctttaac 300 

ctgttatccc ttgactcttt acagttctac ctttttattc acttagtctt ttaccctttt 360 

tttattcgtt cacccctttt tgttgtttca ggtactcctt acttatctcc ttagcctttt 420 

cttcttcatc ttctttctta cttttctcct acttctcatt ttacataata cttacttttt 480 

gcttcagtct tcaaccattg tcaatcttgt ttttccttat attccatttt actttctgaa 540 

ctactcttta atctcctgtt caacactacc tttccttctt ttttatcccc tcttatttac 600 

acggtgatta caacagtttg gtatagtctg atttatctga ttgtaaaatt gatgagttgg 660 

atgtaccaaa aatataagga agctaaattc aaagaaggta aaagatttgc ttgtgtcacc 720 

tagctggtta attttggcat atgcattgtt tctctacata gtctatgtag tcaaacaggt 780 

ttcatttaga aatcattccc cataagaagg gtttcaattt gatttgaaca ggcagagatg 840 

gaaaaaattt cctctctgat aactactgct actgttgtat accagtagaa atataacagc 900 

agcacttagg ttagaagaag ctcattagct attcagaata aatttcattt ttcttaattt 960 

ttggtaatca tatctcagcc tgttgaattt aacttaaact ctgaaagaat tttggttgcc 1020 

atttaatttt taggtttcct taatgatagg gacctaataa tttgttttaa aaaatttgtc 1080 

ttggctggga gcagtggctc atgcctgtaa tcccagcact ttaggaagcc aacattggag 1140 

gattgcatga gcccaggatt tcgagaccag cctgggcaac acagtgaaac ctcatctcta 1200 

caaaaagtta aaaaattaac caactgtggt gccacatgcc tgtagtccca gctgcttggg 1260 

aggatgaggt gagaggattt cttgagtcca ggagtttgag gctgcagtga gctatgatca 1320 

cactcctgct cttcagccta ggtgacacag caggacacta tctttgaaaa aaaaaaaaaa 1380 

aaaa 1384 

<210> 76 
<211> 1715 
<212> DNA 

<213> Homo sapiens 
<400> 76 

ggcacgaggg acattggagc tccccacacc actcattgct gcccaccagc tatacaacta 60 

cgtggctgat cacgccagct cttaccacat gaagccattg cgaatggccc ggccaggggg 120 

cccagaacac aacgagtatg ccctggtgtc ggcatggcac agttctggct cctacctgga 180 

ctctgaggga cttcgacacc aggatgactt tgatgtgtct ctgcttgtct gtcactgtgc 240 

tgcacccttt gaggagcaag gagaggctga gcggcacgtt ctgcggctac agttcttcgt 300 

ggtgctcacc agccagcgag agctcttccc caggctcact gctgacatgc gccgcttccg 360 

gaagccaccc agactgcccc ctgagccaga ggctcctggg agttcagctg gcagccctgg 420 

ggaggcctca gggcttattc tagcgcctgg accggctcct ctgttcccac cactggctgc 480 
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agaggtgggc atggcacgag cacggctggc tcagctggtg cggctggctg gagggcactg 540 

ccgtcgggac accctttgga agcgcctctt cttgctggag ccaccggggc ctgatcgact 600 

gcggctaggg gggcgcctgg ccctggcaga gctggaggaa ctcctagaag cagtccatgc 660 

caaatccatt ggggacatcg acccccagct ggactgcttc ctatccatga cggtctcctg 720 

gtaccagagc ctgatcaaag ttctcctaag ccgcttcccc agagctgtcg ccatttccaa 780 

agcccagact tgggaactca gtacctggtt gcgctgaatc agaagttcac tgactgctct 840 

gcgctagtgt tctggactcc acttaggaaa gacgtctctg aagtggtttt ccgagaagcc 900 

cttccagtac agccccagga cacgagaagc ccccctgccc aactggtctc cacctaccac 960 

cacctggagt ctgtcatcaa cacagcctgt ttcacccttc tggacccgcc tcctctgaag 1020 

ggagtggact ggaccactga atgtcactgt tccttgaatc atgggcctac cagattgcct 1080 

gccagaggca ggactgacca gcccttctgg gccccagggc aagccagaca ctgagtgaca 1140 

ccaaaggctt tgtaactatg tcttgagggt ctgctgcccc agcctggcag caggaaccgc 1200 

cctccccaaa cacccacagc cactgaccca tccaggactc cagagagtca ggtcaacccc 1260 

gaggacccct tgggcccttc tggggtactc ctttcggccc ccctggtaga gtctcgggag 1320 

ttcacacagg gtggcaaaca ccccctagag ctcctctgcc tgaatcctgc cccctagcct 1380 

ttgaccactg tcagccacct gtgtcccttg agccttcggg tcttcacttc ccacttggac 1440 

atcactgctg gacattccca tcgagatgac acctgggttc caatcccagc tctgcctttg 1500 

aagcacttgc ggccaccgtc aagtcccttt gctctcggac cctgggtttc tcatccttta 1560 

atgaggtggg ttcagaagct ctcccatctt cacagcaacc ctggcactgg cttctcaatg 1620 

ggagggaagt cagcagagaa actgaagtgt tagacactat gtgtcccacc accccattac 1680 

agagacatat gacaatgaaa aaaaaaaaaa aaaaa 1715 

<210> 77 
<211> 1437 
<212> DNA 

<213> Homo sapiens 
<400> 77 

cgcccgacgc cggaactgcg agctctcagc gggagccgag acggtgcagg gccggagaag 60 

caccttcact cccagcctgc gccccgatgc tgcgcgttct gtgcctcctg cgcccctgga 120 

ggccccttcg ggcccgcggc tgcgcttccg acggggcggc cgggggctca gagatccaag 180 

tgcgcgccct ggcgggtccg gaccaaggga tcactgagat tctgatgaac agaccttctg 240 

cccgcaatgc cttggggaat gtcttcgtca gtgagctgct ggaaactctg gcccagctgc 300 

gggaggaccg gcaagtgcgt gtcctgctct tcagaagtgg agtgaagggc gtgttctgtg 360 

caggtgcaga cctgaaggag cgggaacaga tgagtgaagc agaggtgggg gtgtttgtcc 420 

agcgactccg gggcctgatg aatgacatcg cagccttccc tgcacccacc attgcggcta 480 

tggatgggtt tgccttgggc ggaggcctag agcttgccct ggcctgtgac ctccgagtgg 540 

cagcttcctc ggcagtcatg ggactgattg agaccacgcg agggctcctc ccgggggcag 600 

gagggactca raggctgccc cgttgtctgg gggtggccct ggcgaaggag ctcatcttca 660 

cgggccgacg actgagtgga actgaggccc acgtactggg gctggtgaat cacgctgtgg 720 

cccagaacga ggagggggac gccgcctacc agcgggcacg agcactggcc caggagatcc 780 

tgccccaggc ccccattgcc gtgcggctgg gcaaagtagc cattgaccga ggaacggagg 840 

tggacattgc atctgggatg gccattgaag ggatgtgcta tgcccagaat attccaaccc 900 

gggaccggct agagggcatg gcagccttca gggagaagcg gactcccaaa tttgttggca 960 

aatgaccccc attttaacct tcagcatggg agatgcatgc cctgaagagc aggatccaga 1020 

aggaagattt gtggccagat tgccttcatc atttcacctc tccagacttc catttcttca 1080 

caaggatgat gatggaaata aaatgactgg cgtgatgcct ggaaccaagg tgctgatcct 1140 

accacctact gctaccttcc ttagcttcac cctggctaga aataatcacg agggttgggt 1200 

ttgctttgga aaatgcctgt ctctctactt gaatgataaa gaattaaatt agatctctct 1260 

gagtcttggt atcattggct ctcagcccct gacctctctc agttatcagg cactcattag 1320 

agatgtcaga agattttaag atacccctag tttcttcctg tggaacaaca gaggtaataa 1380 

ataaactctg gacatcggtt gaaccagtgt caggggtcag actgcagatc ccagtct 1437 

<210> 78 
<211> 776 
<212> DNA 

<213> Homo sapiens 
<400> 78 

ggcacgagct gatttctatt tttaggagct acttggattt gtatgtattt tttctacgtg 60 

aaaatatatg tactcttcac ttttgttcca gtactataat tgctcatgca ctctttctcc 120 
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cctttgagaa cattcagtga aatacaactt catcaaagat ttgctcaaag gagaagaatc 180 

gcatgagtgt gaaaagtaga tgctcgtagc cagaacagaa aaggttacac atgatcatgg 240 

cacagaagat aggaggtttg acttggtggg ccataatgtt tattatcctt tttgaaataa 300 

cagggaccag cagcagtttt ctcaggataa atgctctacc ccacttctct atgaacaggt 360 

gtggggaggc ttactttcca ttttcatatt tatacacctc tctacaaaag caatttttaa 420 

tgaaggttag tggaattgtt aaaaatctga gagggaatga tgactggagg tgttttgggg 480 

tttttttctg tattcatttt ttaatgagaa aagttttaaa tgtagtacag gttagaccca 540 

actactacct tactattata ggacgattct atgtttctgt taaagtattc aagtagcttt 600 

ctctggggga aaaagtacca cttggacact taaaggaatt gggatttttg tctactttgg 660 

ataaggcagt tgacttctta agtaaaagca atagtgtaaa atgtcatttt gtttggaatg 720 

ttaagtgagc aaataaaaaa catgttgaaa ttgtaaaaaa aaaaaaaaaa aaaaaa 776 

<210> 79 

<211> 1155 

<212> DNA 

<213> Homo sapiens 

<400> 79 

ccgggtcgac ccacgcgtcc ggtgagtgca ctctaggatg ttcacatgat gacaaaatca 60 

cctaacaatg tagacgcttc agaacatata ccctttgtta atcgatgcat cactgtatat 120 

atgtgtgtat atacacacat atatgtacat atatttaata catttgtgta tgtgtgtgta 180 

tatatatata tatatacttc tcattattta tactctagac ccagagcctc ctagctggtc 240 

tccaaaattg gactctcatc tctctttgag acagccttca aatgatcgtt tttaaagtgc 300 

taattaactc ctcttctcaa aatgcttcaa tggcccacta atctctaccg aatcaaggaa 360 

ttcagccata ctgtcccaag atatctttcc ttggccagtt ggagcctcat ttcagctgct 420 

ctgggtttat cccctgtctc ttctttccca cttccaagcc tgtgctcagc ccacctcctc 480 

ttctggggat gccccacacc ccactctgcc atatctgcca aacctttcat ctccccgtga 540 

agctcttgac accaaataca gtttacttta gaaaatgtat tttttccact ttctcaacta 600 

aacttttcct tgtgtgatct gcttttccgc tgccaaggca catcgttttt aattctctac 660 

agcactgctc atatcttgcc cagtattata gcttctacat attggtcttg cttcttattt 720 

ttgagcacaa aaactaagcc actccacttt ctcttaccag tgaatccagc ttaaaaaaac 780 

tgtgagcaac ctatcagtat tttgttgaca tgaactctat agaaacctta gtccctggat 840 

cttcgactct gcctcccctg acatttatct gctcccacaa agcacgcagg tgtgggaaga 900 

gaagtggctg ttttttgagg tcacatttca gccctgattc atcctaatgt cttcaccctt 960 

tttatccttt ggscactgtg tycctagaga tgtgaattca attccgcacc attctctcct 1020 

ttacaatgat gccaatattc tcaggctttt aagactaaat tttaaattac gagaaaattt 1080 

gatcttcaaa cttaagttgg acctagaaag aacaatctca tgaactcaaa aaaaaaaaaa 1140 

aaaaaaaaaa aaaaa 1155 

<210> 80 

<211> 407 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (352) . . (352) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (376).. (376) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (378) . . (378) 
<223> n equals a,t,g, or c 

<400> 80 

ctttctgggt ttagggaagt ggtggacaag gcaggagaga accacattca tcttctcctc 60 
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ttgtgtttgt cttctgtctt tcaataacgt ccatgaactg tgaggttagt gtcttggctg 120 

agagataagt awggctkggc atkgattctt ytgtkgtwac ctcaagctgt tttctagtcc 180 

ccaagaacag caytytcagt gggtgtggaa gtgggcggga catgaagcaa tggttttaca 240 

ttgcattgcc tggctacags ttggcatttc tttccttttt ctttttcttt gcgtcattgc 300 

cattggtgcc actaattttg cttcccctyt cttttataaa cttgtttcct cnggagttgc 360 

ctaagagtcc tgcatnanaa cctaattggg aatgaagcag tgtgttc 407 

<210> 81 
<211> 711 
<212> DNA 

<213> Homo sapiens 
<400> 81 

ggcacgagcg aagaccctgt tcggaccctg ccccgattcc agactcaggt agatcgtcgg 60 

cataccctct accgtggaca ccaggcagcc ctggggctga tggagagaga tcaggtatcc 120 

cccagggagt aggggctacc ttgaggggat gatagacctc ccccactccc agtgkkactc 180 

tggaaatatg aaggaactag ggagtggaag agatttcaga gctggggaga ggagttcctc 240 

ccttcaaagc cagcaactgc ctttggggaa tgtcgggggg tctctccttt ctcctgcttg 300 

tgtkargtgg tacacagtcc ccccttcacc tggcgggaag ctgtcccgga cagactcatc 360 

tcagctttcc cttggggcag gatcgggggc agcagctcca gcagaaacag caggatctgg 420 

agcaggaagg cctcgaggcc acacaggggc tgctggccgg cgagtgggcc ccacccctct 480 

ggragctggg cagcctcttc caggccttcg tgaagaggga gagccaggct tatgcgtaag 540 

cttcatagct tctgctggcc tggggtggac ccaggacccc tggggcctgg gtgccctgag 600 

tggtggtaaa gtggagcaat cccttcacgc tccttggcca tgttctgagc ggccagcttg 660 

gcctttgcct taataaatgt gctttatttt caaaaaaaaa aaaaaaaaac t 711 

<210> 82 
<211> 2152 
<212> DNA 

<213> Homo sapiens 
<400> 82 

ccgctttgtt ctccagatgt gaatagctcc actataccag cctcgtcttc cttccggggg 60 

acaacgtggg tcagggcaca gagagatatt taatgtcacc ctcttggggc tttcatggga 120 

ctccctctgc cacatttttt ggaggttggg aaagttgcta gaggcttcag aactccagcc 180 

taatggatcc caaactcggg agaatggctg cgtccctgct ggctgtgctg ctgctgctgc 240 

tgctggagcg cggcatgttc tcctcaccct ccccgccccc ggcgctgtta gagaaagtct 300 

tccagtacat tgacctccat caggatgaat ttgtgcagac gctgaaggag tgggtggcca 360 

tcgagagcga ctctgtccag cctgtgcctc gcttcagaca agagctcttc agaatgatgg 420 

ccgtggctgc ggacacgctg cagcgcctgg gggcccgtgt ggcctcggtg gacatgggtc 480 

ctcagcagct gcccgatggt cagagtcttc caatacctcc cgtcatcctg gccgaactgg 540 

ggagcgatcc cacgaaaggc accgtgtgct tctacggcca cttggacgtg cagcctgctg 600 

accggggcga tgggtggctc acggacccct atgtgctgac ggaggtagac gggaaacttt 660 

atggacgagg agcgaccgac aacaaaggcc ctgtcttggc ttggatcaat gctgtgagcg 720 

ccttcagagc cctggagcaa gatcttcctg tgaatatcaa attcatcatt gaggggatgg 780 

aagaggctgg ctctgttgcc ctggaggaac ttgtggaaaa agaaaaggac cgattcttct 840 

ctggtgtgga ctacattgta atttcagata acctgtggat cagccaaagg aagccagcaa 900 

tcacttatgg aacccggggg aacagctact tcatggtgga ggtgaaatgc agagaccagg 960 

attttcactc aggaaccttt ggtggcatcc ttcatgaacc aatggctgat ctggttgctc 1020 

ttctcggtag cctggtagac tcgtctggtc atatcctggt ccctggaatc tatgatgaag 1080 

tggttcctct tacagaagag gaaataaata catacaaagc catccatcta gacctagaag 1140 

aataccggaa tagcagccgg gttgagaaat ttctgttcga tactaaggag gagattctaa 1200 

tgcacctctg gaggtaccca tctctttcta ttcatgggat cgagggcgcg tttgatgagc 1260 

ctggaactaa aacagtcata cctggccgag ttataggaaa attttcaatc cgtctagtcc 1320 

ctcacatgaa tgtgtctgcg gtggaaaaac aggtgacacg acatcttgaa gatgtgttct 1380 

ccaaaagaaa tagttccaac aagatggttg tttccatgac tctaggacta cacccgtgga 1440 

ttgcaaatat tgatgacacc cagtatctcg cagcaaaaag agcgatcaga acagtgtttg 1500 

gaacagaacc agatatgatc cgggatggat ccaccattcc aattgccaaa atgttccagg 1560 

agatcgtcca caagagcgtg gtgctaattc cgctgggagc tgttgatgat ggagaacatt 1620 

cgcagaatga gaaaatcaac aggtggaact acatagaggg aaccaaatta tttgctgcct 1680 

ttttcttaga gatggcccag ctccattaat cacaagaacc ttctagtctg atctgatcca 1740 
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ctgacagatt cacctccccc acatccctag acagggatgg aatgtaaata tccagagaat 1800 

ttgggtctag tatagtacat tttcccttcc atttaaaatg tcttgggata tctggatcag 1860 

taataaaata tttcaaaggc acagatgttg gaaatggttt aaggtccccc actgcacacc 1920 

ttcctcaagt catagctgct tgcagcaact tgatttcccc aagtcctgtg caatagcccc 1980 

aggattggat tccttccaac cttttagcat atctccaacc ttgcaatttg attggcataa 2040 

tcactccggt ttgctttcta ggtcctcaag tgctcgtgac acataatcat tccatccaat 2100 

gatcgccttt gctttaccay tctttccttt tatcttatta ataaaaatgt tg 2152 

<210> 83 
<211> 1555 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1248) . . (1248) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (1389) . . (1389) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (1391) . . (1391) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (1393) . . (1393) 
<223> n equals a,t,g f or c 

<220> 

<221> misc__f eature 
<222> (1396) . . (1396) 
<223> n equals a,t,g, or c 

<220> 

<221> misc__feature 

<222> (1551) . . (1551) 

<223> n equals a,t,g, or c. 

<400> 83 

ggcacgagct gggcagatgc aaaatctgga gagcgcgagg gccgggcggt cagtcagcac 60 

ccagactggc agcatgaccg gtcagatacc aaggctttct aaagtcaacc ttttcactct 120 

gctcagcctc tggatggagc tctttccagc agaagcccag cggcaaaaat ctcagaaaaa 180 

tgaagaggga aagcatggac ccttaggaga taatgaagag aggaccagag tatctactga 240 

caaaagacag gattactggg agcagctaag atgcctarat gaaaggttta ccatcactgc 300 

tggttaggaa atggattatg agaactcgaa cagagggaag gtgaaatgca accggaggaa 360 

acactctgat atgaggtttg aggccttcaa aattgctttg cagcataagc cacagtgagt 420 

caggagtacc agggagtgga tagaatgttt atttgtttaa ctgagacttt ttagttcatc 480 

aattattttg aagggtagaa cactctgtgg gctctctttc tatttccttc tgggtacaat 540 

cacaaaaaaa aaatctctcc tagctgaaat tacatgcagt actagcaaag ggtctctttg 600 

ttataaactg ttcattaatt gacgaacatt tgtgtactta actatgtata aggcatctca 660 

tcgttcaatt tcaaatacaa attaaaatat tttttcacat ttgttatcct gttatgtttt 720 

ctcttttaca aattgtctgt tcgtatcttt ttgtctctct ttaggcctta ttcttgtcaa 780 

ttcatatgtg ctctaatgaa ttgaaatatt ttctgtatat taaacattac taacctttcc 840 

tctgtcacac tgattgaaaa atgatctatt tagtttgttg ttttgtcttt aattttgtaa 900 

gctttaaaaa gttaatattg cccttcagac accatcccaa catcacataa gaattttttc 960 

atgttataaa ttctttgtgg acatatttga taactgtttt attatgagga ggaccataat 1020 
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taattcaacc attcccctat tttggtcatt taggtttttg ggtttgggtt ttttgtttgt 1080 

ttaacgtctt tgcttgctat tttaaagaat gctgcactaa atgtgaatgc ttgagatttc 1140 

ttctctgtat ttagaatatt ttcctagaat ggattctcag aagaattctc agtctgtgga 1200 

gaggaacatt tttaatgcat ggaagagctg gagtgaaccg aatttcanac tgccctgctg 1260 

atccagaaat aagtttgctt acggaggctt ctagttctga agatgcaaag ttagatgcca 1320 

aagcagtgga aagattgaag tcaaacagtc gggcccatgt gtgtgtctta cttcaacctt 1380 

tggtgtgtna nanggngcag tttgtagagg agacctctta caaatgtgac tttattcaaa 1440 

aaattacaaa aacattgccg gatgctaaca ctgactttta ttatgaatgt aaacaagaaa 1500 

gaataaaaga atatgaaatg ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa naaaa 1555 

<210> 84 
<211> 1532 
<212> DHA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1412) . . (1412) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (1433) (1433) 
<223> n equals a,t,g, 'or c 

<220> 

<221> mis cofeature 
<222> (1446) . . (1446) 

<223> n equals a,t,g, or c 

<220> 

<221> mis cofeature 
<222> (1505) . . (1505) 
<223> n equals a,t,g, or c 

<400> 84 

gctggagcgg tttcattgcc tacattccgc attggaaaat agaagcaggc acaatgaagg 60 

gaaaaggccg aggcatcagc gtgtgaagac cgcaaagacg atcccgagta cagttgtgaa 120 

cagcattgct gctaggctcc tcctgcagat catctgaaat gaacctctct tattgatttt 180 

tattggccta gagccaggag tactgcattc agttgacttt cagggtaaaa agaaaacagt 240 

cctggttgtt gtcatcataa acatatggac cagtgtgatg gtgaaatgag atgaggctcc 300 

gcaatggaac tgtagccact gctttagcat ttatcacttc cttccttact ttgtcttggt 360 

atactacatg gcaaaatggg aaaggtaagg aaaatgactc ggaaaatgtg catgaaatgt 420 

actagggttt ttgcttggtt aaggtgccta aatgcttagg tcaaataccc tggcaatctg 480 

catgttacat gctatctgct ggcagtttct ttctgatata aaaatgaaac agtattcttg 540 

gacagaggac acagaatttc taattccagt ggggcttgtt ttgctttcag tttcttataa 600 

ttgtacttgg agaaacagat actgatcagt gttttatatt ctaaaagaca gccaagttga 660 

ataataaaga ctttcgtttt ggcattttgt tctttttact aaacataatt aagtgtttaa 720 

taagcttcct tgtaccgagt gttgcataaa acacttaaaa ggacacaatt agtgccttcg 780 

tgagatttac atgctaatta tgctaaygat tgggtgctat gtagttaatg atttaaactg 840 

catgcattga cagattactc cttaggcaaa agtatttaag aagggataag tagaaattct 900 

gattggaata ttaaaacatt ttttaaaaat taattatgkt tagactgktg aaccgkgtta 960 

tataatttta ggataawgga ttwatttgct tttttttttt ttaagagaaa ctacttgaag 1020 

taaattccta cccatacttc ttacttgtct cctttccttt gattaatcta aggaatgktg 1080 

atgatgagaa gaaagatgga aatgttgagg tggttgcata tttggtttgt tagaatatct 1140 

gtcatcacct gggctwtttg aagctgctgt tgctgatgtt gttttattga ctcatgaaga 1200 

caactgaaaa gattgctttg taaccttatt tttttctgat gtgtgtttac atccatgtct 1260 

atatatacat attgcatatg tatatatctg tatgtgcatg tatatgttaa aaatctgata 1320 

taagtgaaaa catgctctgt gctttgaaac aaaaaaaaaa aaaaaaaact cgaggggggg 1380 

cccggtaccc aattcgccct atagtgagtc gnattacaat tcactggccg cgntttacaa 1440 
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cgtcgngact gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct 1500 

ttcgncagct ggcgtaatag cgaagaggcc eg 1532 

<210> 85 

<211> 1559 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1445) . . (1445) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (1551) . . (1551) 
<223> n equals a,t,g, or c 

<400> 85 

atccagcagt ggggagacag cgtgctgggc aggcgctgcc gagaccttct cctgcagctc 60 

tacctacagc ggceggaget gcgggtgccc gtgectgagg tcctactgca cagegaaggg 120 

gctgccagca geagegtctg caagctggac ggactcatcc accgcttcat cacgctcctt 180 

gcggacacca gcgactcccg ggcgttggag aaccgagggg eggatgecag catggcctgc 240 

eggaagctgg cggtggcgca cccgctgctg ctgctcaggc acctgcccat gatcgeggeg 300 

ctcctgcacg gccgcaccca cctcaacttc caggagttcc ggcagcagaa ccacctgagc 360 

tgcttcctgc acgtgctggg ectgetggag ctgctgcagc cgcacgtgtt ccgcagcgag 420 

caccaggggg cgctgtggga ctgccttctg tccttcatcc gcctgctgct gaattacagg 480 

aagtcctccc gccatctggc tgccttcatc aacaagtttg tgcagttcat ccataagtac 540 

attacctaca atgccccagc agccatctcc ttcctgeaga agcacgccga cccgctccac 600 

gacctgtcct tcgacaacag tgacctggtg atgctgaaat ccctccttgc agggctcagc 660 

ctgcccagca gggacgacag gaccgaccga ggcctggacg aagagggega ggaggagagc 720 

tcagccggct ccttgcccct ggtcagcgtc tccctgttca cccctctgac cgcggccgag 780 

atggccccct acatgaaacg gctttcccgg ggecaaaegg tggaggatct gctggaggtt 840 

ctgagtgaca tagacgagat gtcccggcgg agacccgaga tcctgagctt cttctcgacc 900 

aacctgcagc ggctgatgag ctcggccgag gagtgttgcc gcaacctcgc cttcagcctg 960 

gccctgcgct ecatgeagaa cagccccagc attgeagecg ctttcctgcc cacgttcatg 1020 

tactgectgg geagecagga ctttgaggtg gtgeagaegg ccctccggaa cctgcctgag 1080 

tacgctctcc tgtgccaaga gcacgcggct gtgctgctcc accgggcctt cctggtgggc 1140 

atgtacggcc agatggaccc cagcgcgcag atctccgagg ccctgaggat ectgeatatg 1200 

gaggccgtga tgtgagcctg tggcagcega cccccctcca agccccggcc cgtcccgtcc 1260 

ccggggatcc tcgaggcaaa geccaggaag cgtgggcgtt gctggtctgt ccgaggaggt 1320 

gagggegecg agecctgagg ccaggcaggc ccaggagcaa tactccgagc cctggggtgg 1380 

ctccgggccg gccgctggca teaggggecg tccagcaagc cctcattcac cttctgggcc 1440 

acagncctgc geggagegge ggatcccccc gggcatggcc tgggctggtt ttgaatgaaa 1500 

cgacctgaac tgtcaaaaaa aaaaaaaaaa aaacccgrgg gggggecegg nacccaatt 1559 

<210> 86 
<211> 1231 
<212> DNA 

<213> Homo sapiens 
<400> 86 

ggcacgagtg . aatgtcgagg agttccagga tctctggcct cagttgtcct tggttattga 60 

tgggggacaa attggggatg gccagagccc cgagtgtcgc cttggctcaa ctgtggttga 120 

tttgtctgtg cccggaaagt ttggcatcat tcgtccaggc tgtgccctgg aaagtactac 180 

agccatcctc caacagaagt aeggactget cccctcacat gcgtcctacc tgtgaaactc 240 

tgggaagcag gaaggcccaa gacctggtgc tggatactat gtgtctgtcc actgacgact 300 

gtcaaggect catttgeaga ggccaccgga gctagggcac tagectgact tttaaggcag 360 

tgtgtctttc tgagcactgt agaccaagcc cttggagctg ctggtttagc cttgcacctg 420 

gggaaaggat gtatttattt gtattttcat atatcageca aaagctgaat ggaaaagtta 480 

agaacattcc taggtggcct tattctaata agtttcttct gtctgttttg tttttcaatt 540 
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gaaaagtaat taaataacag attagaatct agtgagagcc tcctctctgg tgggtggtgg 600 

catttaaggt caaaccagcc agaagtgctg gtgctgttta aaaagtctca ggtggctgcg 660 

tgtggtggct catgcctgta atcccaacat tctgggaggc ccaggcggga gaactgcttg 720 

agccccagga gttcagaatc agcctgggca acatagcaat actccgtctc ataaaaatta 780 

ataaataaaa agtctcaggt gaccaaaggc tcctgaagct agaaccaggt ttggataaag 840 

attgaagagc cacaggccac tcttccctct gagccattgg gcctagtggt gtcatgtatt 900 

gtaattgctc gcagggagag cagtcttttt ggtgtaatag tgggatgtct gcttagttgg 960 

caggggttca gtccaaatgg aagaatattg ggaaataaac ctccactatc ctttatagcc 1020 

agggactttt ttcctattta ttcataaaat aaattatagt taattatacc cataacacct 1080 

ttatttaaat ccagtgttct ccgcagcctt ttgtctattt atatgtgtac caagtgttaa 1140 

acataattat tattgggcat ttgaactttg tttttcttta aagaaatgct gctattaaac 1200 

atatttgtaa atggaaaaaa aaaaaaaaaa a 1231 

<210> 87 
<211> 1189 
<212> DNA 

<213> Homo sapiens 
<400> 87 

gcgtccgctg ggctggaaca gcacagaacc cacagggctg ccgtccacac tctcccggtc 60 

agagtcctgg gaccacatgg ggacgctgcc atggcttctt gccttcttca ttctgggtct 120 

ccaggcttgg gatactccca ccatcgtctc ccgcaaggag tggggggcaa gaccgctcgc 180 

ctgcagggcc ctgctgaccc tgcctgtggc ctacatcatc acagaccagc tcccagggat 240 

gcagtgccag cagcagagcg tttgcagcca gatgctgcgg gggttgcagt cccattccgt 300 

ctacaccata ggctggtgcg acgtggcgta caacttcctg gttggggatg atggcagggt 360 

gtatgaaggt gttggctgga acatccaagg cttgcacacc cagggctaca acaacatttc 420 

cctgggcatc gccttctttg gcaataagat aagcagcagt cccagccctg ctgccttatc 480 

agctgcagag ggtctgatct cctatgccat ccagaagggt cacctgtcgc ccaggtatat 540 

tcagccactt cttctgaaag aagagacctg cctggaccct caacatccag tgatgcccag 600 

gaaggtttgc cccaacatca tcaaacgatc tgcttgggaa gccagagaga cacactgccc 660 

taaaatgaac ctcccagcca aatatgtcat catcatccac accgctggca caagctgcac 720 

tgtatccaca gactgccaga ctgtcgtccg aaacatacag tcctttcaca tggacacacg 780 

gaacttttgt gacattggat atcaataagg ccaggcgtgg cggcgattac gtctgtaatc 840 

ccaggacttt gggaggccaa ggcgggcaga tcacttcagg ccaggaattc aagagcagcc 900 

tggccaatat ggcgaaactc tgtctctact gaaaacaaac aaacaaacaa acaaacaaac 960 

aaagaaacaa caaaaattag ccgggtgtgg tggcacacgc ctgtagtccc agctactcag 1020 

gaggctgagg cataagaatt gcttgaaccc tggaggcgga ggttgcagtg agctgagatt 1080 

gggccaccgc actccagtct gggagacaga gtgagactgt ctcaaaacaa caacaaaaaa 1140 

atccctaaca taatctcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 1189 

<210> 88 

<211> 496 

<212> DNA 

<213> Homo sapiens 

<400> 88 

tcgacccacg cgtccgaact gacacaatga aactgtcagg catgtttctg ctcctctctc 60 

tggctctttt ctgcttttta acaggtgtct tcagtcaggg aggacaggtt gactgtggtg 120 

agttccagga caccaaggtc tactgcactc gggaatctaa cccacactgt ggctctgatg 180 

gccagacata tggcaataaa tgtgccttct gtaaggccat agtgaaaagt ggtggaaaga 240 

ttagcctaaa gcatcctgga aaatgctgag ttaaagccaa tgtttcttgg tgacttgcca 300 

gcttttgcag ccttcttttc tcacttctgc ttatactttt gctggtggat tcctttaatt 360 

cataaagaca tacctactct gcctgggtct tgaggagttc aatgtatgtc tatttctctt 420 

gattcacttg tcaataaagt acattctgca aaagcaaaaa aaaaaaaaaa aaaaaaaaaa 480 

aaaaaaaaaa aaaaaa 496 

<210> 89 

<211> 3153 

<212> DMA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> (1) . . (1) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2584) (2584) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc__f eature 
<222> (2590) . . (2590) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (3153) . . (3153) 
<223> n equals a,t,g, or c 

* 

<400> 89 

nggccgtggg tgtacgcggc gcagcgcggc agtcctgatg gcccggcatg ggttaccgct 60 

gctgcccctg ctgtcgctcc tggtcggcgc gtggctcaag ctaggaaatg gacaggctac 120 

tagcatggtc caactgcagg gtgggagatt cctgatggga acaaattctc cagacagcag 180 

agatggtgaa gggcctgtgc gggaggcgac agtgaaaccc tttgccatcg acatatttcc 240 

tgtcaccaac aaagatttca gggattttgt cagggagaaa aagtatcgga cagaagctga 300 

gatgtttgga tggagctttg tctttgagga ctttgtctct gatgagctga gaaacaaagc 360 

cacccagcca atgaagtctg tactctggtg gcttccagtg gaaaaggcat tttggaggca 420 

gcctgcaggt cctggctctg gcatccgaga gagactggag cacccagtgt tacacgtgag 480 

ctggaatgac gcccgtgcct actgtgcttg gcggggaaaa cgactgccca cggaggaaga 540 

gtgggagttt gccgcccgag ggggcttgaa gggtcaagtt tacccatggg ggaactggtt 600 

ccagccaaac cgcaccaacc tgtggcaggg aaagttcccc aagggagaca aagctgagga 660 

tggcttccat ggagtctccc cagtgaatgc tttccccgcc cagaacaact acgggctcta 720 

tgacctcctg gggaacgtgt gggagtggac agcatcaccg taccaggctg ctgagcagga 780 

catgcgcgtc ctccgggggg catcctggat cgacacagct gatggctctg ccaatcaccg 840 

ggcccgggtc accaccagga tgggcaacac tccagattca gcctcagaca acctcggttt 900 

ccgctgtgct gcagacgcag gccggccgcc aggggagctg taagcagccg ggtggtgaca 960 

aggagaaaag ccttctaggg tcactgtcat tccctggcca tgttgcaaac agcgcaattc 1020 

caagctcgag agcttcagcc tcaggaaaga acttcccctt ccctgtctcc catccctctg 1080 

tggcaggcgc ctctcaccag ggcaggagag gactcagcct cctgtgtttt ggagaagggg 1140 

cccaatgtgt gttgacgatg gctgggggcc aggtgtttct gttagaggcc aagtattatt 1200 

gacacaggat tgcaaacaca caaacaattg gaacagagca ctctgaaagg ccatttttta 1260 

agcattttaa aatctattct ctcccccttt ctccctggat gattcaggaa gctgacattg 1320 

tttcctcaag gcagaatttt cctggttctg ttttctcagc cagttgctgt ggaaggagaa 1380 

tgctttcttt gtggcctcat ctgtggtttc gtgtccctct gaaggaaact agtttccact 1440 

gtgtaacagg cagacatgta actatttaaa gcacagttca gtcctaaaag ggtctgggag 1500 

aaccagatga tgtactaggt gaagcattgc attgtgggaa tcacaaagca aatagtactc 1560 

cagaaagaca aatatcagaa gcttcctatt cttttttttt tttttttttt tttgagacag 1620 

ggtctttctc tgttgcccag gctagagtgc actggtgatc acggctcact ctagccttga 1680 

attcctgggc ccaagcaatt ctcccacctc agcctcctga gtagctggga ctacaagtgt 1740 

gcaccaccat gcctggctaa ttttttgaat ttttgtagtg atgggatctc gctctgttgc 1800 

ccagggtggt ctcgaactcc tggcctcaag cgatcctccc acctcgacct cccaaagtgc 1860 

tgggattaca ggtgtgagcc acctcgcctg ggcccccttc tccatatgcc tccaaaaaca 1920 

tgtccctgga gagtagcctg ctcccacact gtcactggat gtcatggggc caataaaatc 1980 

tcctgcaatt gtgtatctca gacatttgtg tctttgatcc tcaccctgtg accctaaagg 2040 

gaagaaagcc tgagtgtcaa gtaactctgg gcctccccta aagagaaatg gagatggtgg 2100 

ctcatctagg aagtagagga gcagggggtt cctggttctc aggccacgtg tgatctctgc 2160 

ccacccaggg cctgccccag cctgcaggta ttgctgtgtg gtgggaacac ccacttccct 2220 

tgtgcacagc ctttgagagg ggatcgtggc ctcagttcca ggggttcctg gccagggcca 2280 

agtgctcctt ctgcagaggc ctgcacgcat ctcacccctt tgacttgtat ttccatggct 2340 

tcccctcccc acctgccccc tagccctccc tgactggcca gcccctcagt agtcctcctc 2400 
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ggccagggag aggagcacgg ccttgggtgt gttctcgaaa agggctgccc ggttctgctg 2460 

ctgccccttc ttcacccagt ggccatagat tcggaaagcg taggcgtcga tgagccggcg 2520 

• cagaggccgg agggcatagg ggtctcggat gacgatctcc cgggtcacag gcttcacccg 2580 

gcgntactgn tagtagatcc gcactgaagc cagcacggtc agagcgatca ccttgaactt 2640 

cccccggggg ctgaagtgcc gcacttcctc taccaagtac tgctggaaga aggggtgtgc 2700 

caaggcctct tccgctgtgt agcggttctg gggttgcacc accaggaatc gggagaccag 2760 

gtccttcacg gtgtccgagt aatcatccca ctcgggcgag ccaaactggt agttgccgct 2820 

catgatcatc ctcagcatca gcatctgctt ccggtgccag aagggcgggg agccggccag 2880 

cagcgtgtac atgatgacgc cagtgctcca catgtccacc tctttcccgt agcccgggtg 2940 

gtcctcattc atggagcact cgataatctc aggggccagg taactggggg tcccgcagac 3000 

ctctcgcagc ctctctcccg gctccagctg gcaggaaaag ccaaagtctg tgagcttgat 3060 

gttcatgttg tcatccaaga gaatgttctc gggcttcagg tcccggtgca cgatgttgag 3120 

tttgtgcaag gtgcagatca cctccagcag agn 3153 

<210> 90 
<211> 2496 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> (2340) . . (2340) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (2373) . . (2373) 

<223> n equals a,t,g, or c 

<400> 90 

ggccttacct actagcggaa tcgactgaag agacgcctgc cagtgcggga ggtaggaagc 60 

tcgatcccca aagaaaagag cgagtgggca ggcagctgcg agacagaacc ggagtgtgca 120 

gggtccctag aggccggttc ctggtctgtg ctgctctcct ggaagccatg gtacaggcag 180 

agctcagggc gatccccagg tgagggcagc ggctctgcct gggattccac cgcagtacaa 240 

ccgggtagat gcggggtgga gaagaaagga tgttgcctgc actgctcgcc aatagcaccc 300 

tgagaggcta catttgcaga agcagcagca gcagaagaca cagcgccggt ccaggaggcg 360 

gctcgagctg ttcgtaaagt cgcccgacag ctttttctcc gtagtatgcg agttgacaaa 420 

acagccagag aacagggctc cccattacaa tcttttcgag atcttttccc ttgctaaccg 480 

gatctgattt gtgcgaaaac atgccttgca cttgtacctg gaggaactgg agacagtgga 540 

ttcgaccttt agtagcggtc atctacctgg tgtcaatagt ggttgcggtt cccctatgcg 600 

tgtgggaatt acagaaactg gaggttggaa tacacaccaa ggcttggttt attgctggaa 660 

tctttttgct gttgactatt cctatatcac tgtgggtgat attgcaacac ttagtgcatt 720 

atacacaacc tgaactacaa aaaccaataa taaggattct ttggatggta cctatttaca 780 

gtttagatag ttggatagct ttgaaatatc ccggaattgc aatatatgtg gatacctgca 840 

gagaatgcta tgaagcttat gtaatttaca actttatggg attccttacc aattatctaa 900 

ctaaccggta tccaaatctg gtattaatcc ttgaagccaa agatcaacag aaacatttcc 960 

ctcctttatg ttgctgtcca ccatgggcta tgggagaagt attgctgttt aggtgcaaac 1020 

taggtgtatt acagtacaca gttgtcagac ctttcaccac catcgttgct ttaatctgtg 1080 

agctgcttgg tatatatgac gaagggaact ttagcttttc aaatgcttgg acttatttgg 1140 

ttataataaa caacatgtca cagttgtttg ccatgtattg tctcctgctc ttttataaag 1200 

tactaaaaga agaactgagc ccaatccaac ctgttggcaa atttctttgt gtaaagctgg 1260 

tggtttttgt ttctttttgg caagcagtag ttattgcttt gttggtaaaa gttggcgtta 1320 

tttctgaaaa gcatacgtgg gaatggcaaa ctgtagaagc tgtggccacc ggactccagg 1380 

attttattat ctgtattgag atgttcctcg ctgccattgc tcatcattac acattctcat 1440 

ataaaccata tgtccaagaa gcagaagagg gctcatgctt tgattccttt cttgccatgt 1500 

gggatgtctc agatattaga gatgatattt ctgaacaagt aaggcatgtt ggacggacag 1560 

tcaggggaca tcccaggaaa aaattgtttc ccgaggatca agatcaaaat gaacatacaa 1620 

gtttattatc atcatcatca caagatgcaa tttccattgc ttcttctatg ccaccttcac 1680 

ccatgggtca ctaccaaggg tttggacaca ctgtgactcc ccagactaca cctaccacag 1740 

ctaagatatc tgatgaaatc cttagtgata ctataggaga gaaaaaagaa ccttcagata 1800 

aatccgtgga ttcctgaaca gtatggaaaa gcaaactgtg caactactac attatatcat I860 
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tacctggtat cccatggatt ttgtgcttgg gacagaccat aaatgatgga aaatgtcaac 1920 

acaaaaatag ctgaaagcca ggtacaacta ctgcatttat atatgtaagt tttgtatatc 1980 

aaaaataatt ggtctaaatt tcctagactt agacttgatt tcttaacatt agggtatcgc 2040 

atactcaaat ggtagacaat gaccccaact aaatcttcct gatgttacac tgctttatca 2100 

agaggatgga cttttttttt ttgaggcaga cagagtcttg gctctgtcac ccaggctgga 2160 

gtgcagtggc gcaatctcgg gtcactgcaa gctctgcctc ccaagttcat gccattctcc 2220 

tgcctcagcc tcccaagtag ctgcgactac aagcacctgc caccatgccc agctaatttt 2280 

ttttttcagt agagacaggg tctcaccatg ttagccacga tgctcttgat ctgaccttgn 2340 

gatcccgcga cctcggcctt ccaaagtgct ggnaatacag gcgtgagcca ctgggccttg 2400 

ccaagattgg gcacttttta acatcagaac ttcctatcac tgctgcattg agttgctccg 2460 

catttattag aagcattatg cctgtacgga ttgggg 2496 

<210> 91 
<211> 1001 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (919) . . (919) 
<223> n equals a,t,g, or c 



<210> 92 
<211> 1142 
<212> DNA 

<213> Homo sapiens 
<400> 92 

gaagaagtga cacttcttgg tcagaatgtt aatagttttc gggacaattc ggaggtccag 60 

ttcaacagtg cagtgcctac caatctcagt cgtggcttta ccaccaacta taaaaccaag 120 

caaggaggac ttcgttttgc tcatcttctg gatcaggtct ccagagtaga tcctgaaatg 180 

aggatccgtt ttacctctcc ccaccccaag gattttcctg atgaggttct gcagctgatt 240 

catgagagag ataacatctg taaacagatc cacctgccag cccagagtgg aagcagccgt 300 

gtgttggagg ccatgcggag gggatattca agagaagctt atgtggagtt agttcaccat 360 

attagagaat ctattccagg tgtgagcctc agcagcgatt tcattgctgg cttttgtggt 420 

gagacggagg aagatcacgt ccagacagtc tctttgctcc gggaagttca gtacaacatg 480 

ggcttcctct ttgcctacag catgagacag aagacacggg catatcatag gctgaaggat 540 

gatgtcccgg aagaggtaaa attaaggcgt ttggaggaac tcatcactat cttccgagaa 600 

gaagcaacaa aagccaatca gacctctgtg ggctgtaccc agttggtgct agtggaaggg 660 

ctcagtaaac gctctgccac tgacctgtgt ggcaggaatg atggaaacct taaggtgatc 720 

ttccctgatg cagagatgga ggatgtcaat aaccctgggc tcagggtcag agcccagcct 780 

ggggactatg tgctggtgaa gatcacctca gccagttctc agacacttag gggacatgtt 840 



<400> 91 



cgcgctggaa ccctgtggcg gcggccatgg ccatatggcg ctgcccgcct ggctgcagcc 
aggtatagga agaatgcgta tcttttcatc tattacttaa tccagttctg tggccactct 
tggatattta caaatatgac agtcagattc ttttcatttg gaaaaggtaa aactccgaaa 
cagttttttt atttttaact tttaatcctt gttttcacct catcctgctt atattaaatt 
tctacacacc tcaaccttct accacgggat acagattcaa tggttgacac tttttatgct 
attggacttg tgatgcgact ttgccaatcc gtatctctcc tggaactgct gcacatatat 
gttggcattg agtcaaacca tcttctccca aggtttttgc agctcacaga aagaataatc 
atcctttttg tggtgatcac cagtcaagag gaagtccaag agaaatatgt ggtgtgtgtt 
ttattcgtct tttggaatct attggatatg gttaggtaca cttatagcat gttatcagtc 
ataggaatat cctatgctgt cttgacatgg ctcagtcaaa cactatggat gccaatttat 
cctttgtgtg ttcttgctga agcatttgcc atctatcaat cgctgcctta ttttgaatca 
tttggcactt attccaccaa gctgcccttt gacttatcca tctatttccc atatgtgctg 
aaaatatatc tcatgatgct ctttataggt atgtatttta cctacagtca tctatactca 
gaaagaagag acatcctcgg aatctttccc attaaaaaaa agaagatgtg aagtacagca 
ttccagtgtg acacgagaaa agacaggctg tggattcagt gcagtaaata aaacacagga 
agtattctgg tggaaaaana aaaaaaaaaa aaaaaaaaar aaraaaaaaa aawaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1001 
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ctctgcagga ccactctgag ggactcttct gcatattgct gacctgagag gatggcctca 900 

gagctgactt gggcaatcct ccccaacagg aaggggagac attgcctgcc actgaggaaa 960 

caggtcatga aggtggagat aagctgcaag gggcgaagca actttatgtc agtggaaaac 1020 

gtgtctcttt aaagctgcta tgtgaacagc ttttacagtc attaaattta cctaaactaa 1080 

ggttaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaag ggcggccgct 1140 

ct 1142 

<210> 93 
<2U> 2238 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (12) . . (12) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (45) . . (45) 

<223> n equals a,t,g, or c 

<400> 93 

tgtccttgtt gncgcggctg aagtacgccg tgggccactt cctcnaacga cctgtgcgcg 60 

tccatgtggt tcacctacct gctgctctac ctgcactcgg tgcgcgccta cagctcccgc 120 

ggcgcggggc tgctgctgct gctgggccag gtggccgacg ggctgfcgcac accgctcgtg 180 

ggctacgagg ccgaccgcgc cgccagctgc tgcgcccgct acggcccgcg caaggcctgg 240 

cacctggtcg gcaccgtctg cgtcctgctg tccttcccct tcatcttcag cccctgcctg 300 

ggctgtgggg cggccacgcc cgagtgggct gccctcctct actacggccc gttcatcgtg 360 

atcttccagt ttggctgggc ctccacacag atctcccacc tcagcctcat cccggagctc 420 

gtcaccaacg accatgagaa ggtggagctc acggcactca ggtatgcgtt caccgtggtg 480 

gccaacatca ccgtctacgg cgccgcctgg ctcctgctgc acctgcaggg ctcgtcgcgg 540 

gtggagccca cccaagacat cagcatcagc gaccagctgg ggggccagga cgtgcccgtg 600 

ttccggaacc tgtccctgct ggtggtgggt gtcggcgccg tgttctcact gctattccac 660 

ctgggcaccc gggagaggcg ccggccgcat gcggaggagc caggcgagca cacccccctg 720 

ttggcccctg ccacggccca gcccctgctg ctctggaagc actggctccg ggagccggct 780 

ttctaccagg tgggcatact gtacatgacc accaggctca tcgtgaacct gtcccagacc 840 

tacatggcca tgtacctcac ctactcgctc cacctgccca agaagttcat cgcgaccatt 900 

cccctggtga tgtacctcag cggcttcttg tcctccttcc tcatgaagcc catcaacaag 960 

tgcattggga ggaacatgac ctacttctca ggcctcctgg tgatcctggc ctttgccgcc 1020 

tgggtggcgc tggcggaggg actgggtgtg gccgtgtacg cagcggctgt gctgctgggt 1080 

gctggctgtg ccaccatcct cgtcacctcg ctggccatga cggccgacct catcggtccc 1140 

cacacgaaca gcggagcgtt cgtgtacggc tccatgagct tcttggataa ggtggccaat 1200 

gggctggcag tcatggccat ccagagcctg cacccttgcc cctcagagct ctgctgcagg 1260 

gcctgcgtga gcttttacca ctgggcgatg gtggctgtga cgggcggcgt gggcgtggcc 1320 

gctgccctgt gtctctgtag cctcctgctg tggccgaccc gcctgcgacg ctcccgaggc 1380 

ggagaacacc gaacacccag tgaaggtgag gggatcagca cggcgccgcc accgtgctgg 1440 

aacgagactc agccacaagg aggtgcgaag ctctgaccca ggccacagtg cggatgcacc 1500 

ttgaggatgt cacgctcagt gagagacacc agacacagaa gggtacgctg tgatcccact 1560 

tctatgaaat gtccaggaca gaccaatcca cagaatcagg gagaggattc gtgggtgccg 1620 

ggactgagga gggggacctg ggggtgacta ggtgacataa tggggatcag cacgggcggc 1680 

accagcacac acgtgctgga atgagaactc agccacaagg ggaggtgcga agttctgacc 1740 

caggccacag tgcggatgca cctttgagga tgtcacgctc agtgagagac accagacaca 1800 

gaagggtacg ctgtgatccc acttctatga aatgtccagg acagaccaat ccacagaatc 1860 

agggagagga ttcgtgggtg ccgggactgg ggagggggac ctgggggtga ctaggtgaca 1920 

taatggggac aagggctgcc tttctggggt gatgagaatg ttctggaatc agaatgggat 1980 

ggctgcacgg cgtggttgaa gggtactgaa ggccaccctt cagtgtacga aggggtagat 2040 

ttttgtattt ttaccaacaa taaacaaaaa gaaaagaaaa caaacaaaaa aaaaaagaaa 2100 

aagataaaaa gaatagagaa caagaaagaa aaaaaaaaaa acaagggggg actttttggg 2160 

cgggggagaa agggtataaa gggcgcgggg tcaaaggttt cgagcaccct tttttacaag 2220 

agggaatagg ggcccata 2238 
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<210> 94 
<211> 1052 
<212> DNA 

<213> Homo sapiens 
<400> 94 

acgcgtccgg ccagccagtc cgcccgtccg gagcccggct cgctggggca gcatggcggg 60 

gtcgccgctg ctctgggggc cgcgggccgg gggcgtcggc cttttggtgc tgctgctgct 120 

cggcctgttt cggccgcccc ccgcgctctg cgcgcggccg gtaaaggagc cccgcggcct 180 

aagcgcagcg tctccgccct tggctgagac tggcgctcct cgccgcttcc ggcggtcagt 240 

gccccgaggt gaggcggcgg gggcggtgca ggacctggcg cgggcgctgg cgcatctgct 300 

ggaggccgaa cgtcaggagc gggcgcgggc cgaggcgcag gaggctgagg atcagcaggc 360 

gcgcgtcctg gcgcagctgc tgcgcgtctg gggcgccccc cgcaactctg atccggctct 420 

gggcctggac gacgaccccg acgcgcctgc agcgcagctc gctcgcgctc tgctccgcgc 480 

ccgccttgac cctgccgccc tagcagccca gcttgtcccc gcgcccgtcc ccgccgcggc 540 

gctccgaccc cggcccccgg tctacgacga cggccccgcg ggcccggatg ctgaggaggc 600 

aggcgacgag acacccgacg tggaccccga gctgttgagg tacttgctgg gacggattct 660 

tgcgggaagc gcggactccg agggggtggc agccccgcgc cgcctccgcc gtgccgccga 720 

ccacgatgtg ggctctgagc tgccccctga gggcgtgctg ggggcgctgc tgcgtgtgaa 780 

acgcctagag accccggcgc cccaggtgcc tgcacgccgc ctcttgccac cctgagcact 840 

gcccggatcc cgtgcaccct gggacccaga agtgcccccg ccatcccgcc accaggactg 900 

ctccccgcca gcacgtccag agcaacttac cccggccagc cagccctctc acccgaggat 960 

ccctaccccc tggccccaca ataaacatga tctgaagcag caaaaaaaaa aaaaaaaaaa 1020 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 1052 

<210> 95 

<211> 1492 

<212> DNA 

<213> Homo sapiens 

<40Q> 95 

gccttcccac actccattcc ctgtcaagtt atggctgtcc cctcacccca gctgctccta 60 

gagaggccct tkttacctgt gtcattcatg tttctaacaa gccaccctcc accccgtctt? 120 

gtgtgcccca tgcacctgtg catctgtgct gtgtgggtgt tggtggccct tttgcgcatg 180 

catggggcat cccctgccca gaccagcggg acaaggagcg ggaacggcgg ctgcaggagg 240 

cacggggccg gccaggggag gggcgcggca acacagccac tgagaccacc acgaggcaca 300 

gccagcgggc agctgatggc tctgctgtca gcactgttac caagactgag cggctcgtcc 360 

actccaatga tggcacacgg acggcccgca ccaccacagt ggagtcgagt ttcgtgaggc 420 

gctcggagaa tggcagtggc agcaccatga tgcaaaccaa gaccttctcc tcttcctcct 480 

catccaagaa gatgggcagc atcttcgacc gcgargacca ggccagccca cgggccggca 540 

gcctggcggc gctcgagaaa cggcaggccg agaagaagaa agagctgatg aaggcgcaga 600 

gtctgcccaa gacctcagcc tcccaggcgc gcaaggccat gattgaraag ctggagaagg 660 

agggcgcggc cggcagccct ggcggacccc gcgcagccgt gcagcgatcc accagcttcg . 720 

gggtccccaa cgccaacagc atcaagcaga tgctgctgga ctggtgtcga gccaagactc 780 

gcggctacga gcacgtcgac atccagaact tctcctccag ctggagtgat gggatggcct 840 

tctgtgccct ggtgcacaac ttcttccctg aggccttcga ctatgggcag cttagccctc 900 

agaaccgacg ccagaacttc gaggtggcct tctcatctgc ggagacccat gcggactgcc 960 

cgcagctcct ggatacagag gacatggtgc ggcttcgaga gcctgactgg aagtgcgtgt 1020 

acacgtacat ccaggaattc taccgctgtc tggtccagaa ggggctggta aaaaccaaaa 1080 

agtcctaamc cctgctcggg gccccacgga tgctggtgga ctgtgtgccc ctggtggagg 1140 

tggacgacat gatgatcatg ggcaagaagc ctgaccccaa gtgtgtcttc acctatgtgc 1200 

agtcgctcta caaccacctg cgacgccacg aactgcgcct gcgcggcaag aatgtctagc 1260 

ctgcccgccc gcatggccag ccagtggcaa gctgccgccc ccactctccg ggcaccgtct 1320 

cctgcctgtg cgtccgccca ccgctgccct gtctgttgcg acaccctccc ccccacatac 1380 

acacgcagcg ttttgataaa ttattggttt tcaamraaaa aaaaaaaaaa aaaaaaaaaa 1440 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa ag 1492 

<210> 96 
<211> 954 
<212> DNA 
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<213> Homo 



sapiens 



<400> 96 



gaattcggca 
ctgctggctc 
ctcctctccc 
ggcctggctg 
ctctggtgtt 
aacctatacc 
gagaagagga 
ctggaggaaa 
ctggctgcct 
cctggacttg 
gaagaaggct 
ttcatgcacc 
gcatccctgg 
ccccactgcc 
ccatgcaccc 
cagccaaaaa 



cgagcccaca ccaaacctgt ggacgccgac ccgggaccgc cgctggctgg 
actcgaccgt catggagacc ctgggggccc ttctggtgct ggagtttctg 
cggtggaggc ccagcaggcc acggagcatc gcctgaagcc gtggctggtg 
cggtagtcgg cttcctgttc atcgtctatt tggtcttgct ggccaaccgc 
ccaaggccag ggctgaggac gaggaggaga ccacgttcag aatggagtcc 
aggaccagag tgaagacaag agagagaaga aagaggccaa ggagaaagaa 
agaaggagaa aaagacagca aaggaaggag agagcaactt gggactggat 
aagagcccgg agaccatgag agagcaaaga gcacagtcat gtgaagattc 
cttccaggca gtcccccaga gatgcctctt ctgcccccta aaagcagtgc 
aagcccgtga aatgactcca tctgggattc agaatacagt gttctcaagt 
tggaacccac cccacctccc tcattggggg ctctctgggc aaacatggtt 
cctcttcctg agcttggtcc ctgcctggtg attcttctta tactcggaga 
ttgaggagac acccgcaatc ctccacgatc tcatggctcc acctgcttct 
tgatttcttt tctctctgcc tgatgtctac tgaacagaac ttcccctctc 
actgccagct gagagctgct tcccaatggc ctgcattaaa gcattcgtaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaac tcga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
954 



<210> 
<211> 
<212> 
<213> 



97 

1794 

DNA 

Homo 



sapiens 



<220> 

<221> misc_f eature 

<222> (1675) . . (1675) 

<223> n equals a,t,g, or c 

<400> 97 

gctcctctag tgactgctgg ggtgctagtt ttcacggtta ccacagaact tgagagagga 60 

aaatgagaac ggcaaattaa aatgccataa aattccattc ttaacaagcg ttagtttttt 120 

tttcctatat atccactccc tcacctatct tatgacatat gagaaatcct aacatgggac 180 

ttggtgtata attagcattg aatgaattta agttttcttt ctttctttct tttcttttat 240 

cttctgtggt ctcctgcaga atcagtctat aaaaagggca tggtaaaaaa aaatctatct 300 

catagcattg ttgaaaagat taaatgacat aataagatga tgcatatata gtagctagca 360 

ctgtacctga tgcatattag gagcttgata atattactaa cattatcatc atcaatgcta 420 

tctgagcaaa gaggctgttc cttctttcca gccagagttc ttctgttgga taatatttcc 480 

agctgtgaac cccaaccaga gaccttgaag cctttatttc cttttctctc attggccttt 540 

ggcfcgaagtc tcctttctgc agagaacaaa gtgaccagct tttgatgaac tattttcctc 600 

ttttatccat ttccaagtgt tagccatagt gaagaagtgt agctgggtgc tagcccaact 660 

caagaagtga taatgtwata tccaacccaa gataaactca aggataactt tcaacacggc 720 

tactcaagca gttcaggggg gaggcatctt gtagagagga gaccaggaag tcacttggca 780 

gctgggccag cacacagagg gccattgctc cagtaaagca gctaacctcc atctcttcat 840 

caaaccatcc tagactcagc actctgcaga gaaggagcag atggagggaa tgtgtggaga 900 

gattagataa gaaggatttc taatctagtg ggggagcgag taaaatgtac agaagtttga 960 

gaagccaagc tcttatgtam maatccrccc ccatcactca acaatcccca ctgtatgaat 1020 

aaagcctgga aagtttccaa ttaaaacagt gcttgtgaat attggcaagg ggtatctttg 1080 

tgtgcagggt acatttaaag ggagaagggt gaagatacac ccttgcctty taggagtaca 1140 

ctttctgaga gcttgttcac caggctgtga gtttctcagt ctatctgttt ctcagtctgt 1200 

tagtaatgaa tgtatctccc acttagcaca gcagctagca catagtagat gcctaacaaa 1260 

tgtgtgttaa attgaatgtt ggaagtctgt gtcctgaaag cttttcttca catattacaa 1320 

ggctttttat ttgttcaaca catttttacc aagttttttt ctgtgttcca ggtcttttga 1380 

ttagctgttt ttaagtcaca gaaatgggtg tcgggaacaa aaccagtcaa aagtcctcat 1440 

tctacatcat ttacactttc ccttcctata tttataagtt ttaatatcag cttctacaafc 1500 

aggttccaga acaagtggtg ctcaaggaat ggagaaaatg actattccaa ccctagctgt 1560 

aggtgaacca aaaaccccag agaaatcaaa gtgtagttta aagcagtgct tctcaagttg 1620 

taatgtgcat atagatcacc tggggttgtt attaaaatgc aaattctaaa- agagncagga 1680 

gaatatcttg agcctgggag ccagaggttg cagtgagccg agatcattcc actgcattcc 1740 

agcctgggtg acacagcgag actccatctt aaaaaaaaaa aaaaaaaact cgta 1794 
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<210> 98 
<211> 1262 
<212> DNA 

<213> Homo sapiens 
<400> 98 

cctaatggcc cgasctgaat acttgaagga gctcaagatg agggaatctc gctgggaagc 60 

tgacaccctg gacaaagagg gactgtcgga atctgttcgt agctcttgca cccttcagtg 120 

accctagaag aatgattgga cagatgtgag ccatctggag cagaggggca ctaacccagg 180 

ctgacgccaa gaatgaagtg gcccactgca gccctggcga gcaggcttct tggatggaca 240 

gtgctgagac ccccatatcc cagagtcccc agcctccctc aggttactct gcaccccaca 300 

gatggtttga tggctgtgct gtatactgga ggggagggca ggactctggg agaacagcac 360 

ttctttcatg agacctttgt tactcggtgg ttactgggtc ctgtgcctgt ccgttttggg 420 

gcatgcagcc ctctatcatt tttggctccg agaagagggc aaggggcccc cgcaggtarc 480 

ttctgtgctt gccctcgccc tgccagcagg cagctgtgcc cctggcctgc ccttcccggg 540 

accccttatt ccaactcagc tcctctttgc actggaatgg ggcactccaa cacccctcag 600 

ggaccaccct ccccacagta tgcactcagc cccacagaac ccaccagtct ttctgggaac 660 

tcacacctgc ccgccatctt ggtactttag gttaatccct caagcatgaa agctggatct 720 

tttggggttt aagaagccca agccttgttc ctgccctggc ctagggagca ctcaggaggg 780 

ttccttggtc ctcatctctc ccacctccgt tccctctggg ccccacacta gccacagcgc 840 

gggccttgtg ctggagtttg agcctgggac agggagaggg aggcttggag acagtctgac 900 

ccagtgccct ctaggccacc cacttctagg cctgccctgc cgccgtggag ccctgggcaa 960 

gctctttccc ctttctgggc ctgggtctcc ccatctcttc aatggggctg ataccttcac 1020 

agcccacagc atgggcactt atgaggacaa agtgaattta acctggaaaa gaatgtattt 1080 

gagagtttct tttaaataat cagcgggtgt tggtgatttg tagcccttct gcccttaaat 1140 

gcttccttgg gcaagagctg tctgtcctcc ctgcaggagg ctgagtgtga agagtatcat 1200 

tcattgtttc tctattaaat tattttctgc taaaaaaaaa aaaaaaaaat ttctgcggtc 1260 

eg 1262 

<210> 99 
<211> 2572 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (2527) (2527) 
<223> n equals a,t,g, or c 

<400> 99 

aatteggcac gagtctggac cttcttagmt tgcttgatat caggttgttt ttgtagccat 60 

tttgettcac agtcacctgg aatgccggga gcccctgctc atcccgatcc tctccttgta 120 

catgggegea cttgtgcgct gcaccaccct gtgcctgggc tactacaaga acattcacga 180 

catcatccct gacagaagtg gcccggagct ggggggagat gcaacaataa gaaagatget 240 

gagcttctgg tggcctttgg ctctaattct ggccacacag agaatcagtc ggcctattgt 300 

caacctcttt gtttcccggg accttggtgg cagttctgea gecacagagg cagtggcgat 360 

tttgacagee acataccctg tggtcacatg ccatacggct ggttgacgga aatccgtgct 420 

gtgtatcctg ctttcgacaa gaataacccc agcaacaaac tggtgagcac gagcaacaca 480 

gtcacggcag cccacatcaa gaagttcacc ttegtctgea tggctctgtc actcacgctc 540 

tgtttcgtga tgttttggac acccaacgtg tctgagaaaa tcttgataga catcategga 600 

gtggactttg ectttgeaga actctgtgtt gttcctttgc ggatcttctc cttcttccca 660 

gttccagtca cagtgagggc gcatctcacc gggtggctga tgacactgaa gaaaaccttc 720 

gtccttgccc ccagctctgt getgeggate atcgtcctca tcgccagcct cgtggtccta 780 

ccctacctgg gggtgcacgg tgcgaccctg ggcgtgggct ccctcctggc gggctttgtg 840 

ggagaatcca ccatggtcgc catcgctgcg tgctatgtct aceggaagea gaaaaagaag 900 

atggagaatg agtcggccac ggagggggaa gactctgcca tgacagacat gcctccgaca 960 

gaggaggtga cagacatcgt ggaaatgaga gaggagaatg aataaggcac gggacgecat 1020 

gggcactgea gggacagtca gtcaggatga cacttcggca tcatctcttc cc tctcccat 1080 

cgtattttgt tccctttttt ttgttttgtt ttggtaatga aagaggcett gatttaaagg 1140 

tttcgtgtca attctctagc atactgggta tgctcacact gaegggggga cctagtgaat 1200 
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ggtctttact gttgctatgt aaaaacaaac gaaacaactg acttcatacc cctgcctcac 1260 

gaaaacccaa aagacacagc tgcctcacgg ttgacgttgt gtcctcctcc cctggacaat 1320 

ctcctcttgg aaccaaagga ctgcagctgt gccatcgcgc ctcggtcacc ctgcacagca 1380 

ggccacagac tctcctgtcc cccttcatcg ctcttaagaa tcaacaggtt aaaactcggc 1440 

ttcctttgat ttgcttccca gtcacatggc cgtacaaaga gatggagccc cggtggcctc 1500 

ttaaatttcc cttccgccac ggagttcgaa accatctact ccacacatgc aggaggcggg 1560 

tggcacgctg cagcccggag tccccgttca cactgaggaa cggagacctg tgaccacagc 1620 

aggctgacag atggacagaa tctcccgtag aaaggtttgg tttgaaatgc cccgggggca 1680 

gcaaactgac atggttgaat gatagcattt cactctgcgt tctcctagat ctgagcaagc 1740 

tgtcagttct cacccccacc gtgtatatac atgagctaac ttttttaaat tgtcacaaaa 1800 

gcgcatctcc agattccaga ccctgccgca tgacttttcc tgaaggcttg cttttccctc 1860 

gcctttcctg aaggtcgcat tagagcgagt cacatggagc atcctaactt tgcattttag 1920 

tttttacagt gaactgaagc tttaagtctc atccagcatt ctaatgccag gttgctgtag 1980 

ggtaactttt gaagtagata tattacctgg ttctgctatc cttagtcata actctgcggt 2040 

acaggtaatt gagaatgtac tacggtactt ccctcccaca ccatacgata aagcaagaca 2100 

ttttataacg ataccagagt cactatgtgg tcctccctga aataacgcat tcgaaatcca 2160 

tgcagtgcag tatatttttc taagttttgg aaagcaggtt ttttccttta aaaaaattat 2220 

agacacggtt cactaaattg atttagtcag aattcctaga ctgaaagaac ctaaacaaaa 2280 

aaatatttta aagatataaa tatatgctgt atatgttatg taatttattt taggctataa 2340 

tacatttcct attttcgcat tttcaataaa atgtctctaa tacaatacgg tgattgcttg 2400 

tgtgctcaac atacctgcag ttgaaacgta ttgtatcaat gaacattgta ccttattggc 2460 

agcagtttta taaagtccgt catttgcatt tgaatgtaag gctcagtaaa tgacagaact 2520 

atttttncat tatgggtaac tgggggaata aatggggtca ctgggagtag gg 2572 

<210> 100 
<211> 1488 
<212> DNA 

<213> Homo sapiens 
<400> 100 

cgccaagttt ccggagggag agggtagaaa ctggaggggg tggacctgtc actcacggga 60 

ctgagggtcc ttttctcccg ctcccaggag gaacgagaat gaatatgact caagcccggg 120 

ttctggtggc tgcagtggtg gggttggtgg ctgtcctgct ctacgcctcc atccacaaga 180 

ttgaggaggg ccatctggct gtgtactaca ggggaggagc tttactaact agccccagtg 240 

gaccaggcta tcatatcatg ttgcctttca ttactacgtt cagatctgtg cagacaacac 300 

tacaaactga tgaagttaaa aatgtgcctt gtggaacaag tggtggggtc atgatctata 3 60 

ttgaccgaat agaagtggtt aatatgttgg ctccttatgc agtgtttgat atcgtgagga 420 

actatactgc agattatgac aagaccttaa tcttcaataa aatccaccat gagctgaacc 480 

agttctgcag tgcccacaca cttcaggaag tttacattga attgtttgat caaatagatg 540 

aaaacctgaa gcaagctctg cagaaagact taaacctcat ggccccaggt ctcactatac 600 

aggctgtgcg tgttacaaaa cccaaaatcc cagaagccat aagaagaaat tttgagttaa 660 

tggaggctga gaagacaaaa ctccttatag ctgcacagaa acaaaaggtt gtggaaaaag 720 

aagctgagac agagaggaaa aaggcagtta tagaagcaga gaagattgca caagtggcaa 780 

aaattcggtt tcagcagaaa gtgatggaaa aagaaactga aaagcgcatt tctgaaatcg 840 

aagatgctgc attcctggcc cgagagaaag cgaaagcaga tgctgaatat tatgctgcac 900 

acaaatatgc cacctcaaac aagcacaagt tgaccccgga atatctggag ctcaaaaagt 960 

accaggccat tgcttctaac agtaagatct attttggcag caacatccct aacatgttcg 1020 

tggactcctc atgtgctttg aaatattcag atattaggac tggaagagaa agctcactcc 1080 

cctctaagga ggctcttgaa ccctctggag agaacgtcat ccaaaacaaa gagagcacag 1140 

gttgatgcaa gaggtggaaa tgttctccat atcaagatgt ggcccaaggg gttaagtggg 1200 

aacaatcatt atacggactc ttcagattta cagagaactt acacttcatc tgttccacct 1260 

ctcctgcgat agtcctgggt gctccactga ttggaggata gagccagctg tctgacacac 1320 

aaatggtctt ttcagccaca gtcttatcaa gtatcctata tgtattcctt tctaaactgc 1380 

tactcatgaa tgaggaaagt ctgatgctaa gatactgcct gcactggaat gttaaacact 1440 

aaatatataa caagctgtgt tttcctaagc tgaaaaaaaa aaaaaaaa 1488 

<210> 101 
<211> 704 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 
<222> (287) . . (287) 
<223> n equals a,t,g, or c 

<400> 101 

actgtgtctg tcttgtctct gatatttata tgccattatg tggcctctac tgccttagga 60 

ttctaatgtt cccactaaga tcagctaact cagttccact acagtgttta ccaccatcat 120 

ctctcgcaaa caaagacagc cacttcagag ctcctaggaa atagtggtgc tcccatcatc 180 

attgcattcc ttaatsacat ggtgaaaatt aacaatggct aaggagcctt tgtgttttct 240 

cctctacaat atgcccagga atttctggca ttttggccat cttattnata ggctattact 300 

gaatttmagc ctmatcctmc caaattatta atgccaaaat attaactctt gattcttagg 360 

tgagtgcacc catgccaata aatttgccat gatctaacct taaatgtatt ctcatatatg 420 

ctgtccaagt ttctrctgat taaaatggca aggcctttag ttctcctaca taggttttct 480 

ctctccagag aaggcctcaa ttctctgact aggctatgtt gggatataac tggaggcact 540 

aataggtagt agggtaaatt ctttatttta ttatttttgg agacagggag ggtcttgctt 600 

tgttcagact ggagtgcagt ggtgtgatca tggctcattg caactttgaa ctcctgggcg 660 

acagagcaag actccatctc aaaaaaaaaa aaaaaaaaac tcga 704 

<210> 102 
<211> 1022 
<212> DNA 

<213> Homo sapiens 
<400> 102 

ttcccgggtc gacccacgcg tccgcccacg cgtccggctt ggggccagca ccctgtctca 60 

aagatggcaa aatgaggcta gttctggatg agctagctgg tgtgggttcc aaccatagga 120 

acacactgat gctcaaatcc taaggtgcca agctctaggc cctggaggct ggtagaacag 180 

gatctatgcc tggaatcctg gcagggattc ctgtcaagga cttgtgttta agcctgcttc 240 

agggcttcag gctgcttctg ctctgtgtct gcccaggctg gctgagcggg tggatgggtg 300 

gacagaaggg ctcaccaagg attgtggaca tagggtaggc cctggtacca cgggtttcag 360 

gctgttatca cttcccttgt aggaacatag ccagaagcag atgagccagg gtagagggct 420 

ggcccctcct ctcatcttcc cttcagtctt aaattgtctc cagcgatggg aagaggccag 480 

ggactgtaac ccttgtgctg tgtattctct gagcctctgc tcactctcag ggccaagcag 540 

ctcccaagcc ggggccctct cttggccaaa atctgaggag cagtctaggt tacaggcttt 600 

ttggtaggta ggttctggct gcctgttaat gcagttaggc cccctgatta ggtacagtga 660 

gaaacaagct agaacaaccc tggcccagaa gactgtgcac tccagcaaga tccagggatg 720 

atagccttgc agggccactg ggagtttgtg cccaagcttc tccctcttct ctccccaggg 780 

ggcactggga ctggtccctg" ccctcatcct tagcctgggc cttccccaga ggtattaaag 840 

agaagtatga ttcctctgtc ttcagttctt ttcaggggca tcctgcccat agtacccagt 900 

tcccaagggg cccccagtca cgtggtgaag cctagcactc atgcagctct tagggaacca 960 

aaaaccagca ctgaaataaa gctgaatgac tgactgaaaa aaaaaaaaaa aaagggcggc 1020 

eg 1022 

<210> 103 
<211> 1766 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_JE eature 

<222> (14) . . (14) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (36) . . (36) 

<223> n equals a,t,g, or c 

<220> 

<221> misc feature 



i 
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<222> (1750) .. (1750) 
<223> n equals a,t,g, or c 

<400> 103 

acgggggctt taangggaaa cccttcccgg aatttncggg tcgacccacg cgtccggttt 60 

tgtttatgga gggtccagta agtgcaaaca accattgcct ggtcctaagg gttcagagtc 120 

cccgaattcc ttcttggacc aggaaagccg gagacgaaga ttcaccattg cagactcgga 180 

tcagttgcct gggtactcgg tggaaaccaa cattctgccc acaaaaatga gagagaaaac 240 

accatcttat ggcaagccac ggcctttgtc catgcctgct gatgggaact ggatggggat 300 

tgtggaccct tttgccagac ctcgaggtca tggcaggaaa ggggaggatg ccctttgccg 360 

gtatttcagt aacgagcgga ttcctccgat cattgaagag agctcctctc ccccataccg 420 

gttctccaga cccacgaccg agcggcatct ggtccggggt gcggactaca tccgaggaag 480 

caggtgctac atcaactcag atctccacag cagcgccacg attccattcc aggaggaagg 540 

gaccaaaaag aaatctggct cctcagctac gagtcctcgt ccacagaacc gtccctcctg 600 

gtcagctggt ttacgcgcct caaactgttg actcactgag agggaccctg ctcaggccac 660 

ctgcctggct cctgscccaa gtgccttgct tttacagtgg acagcctctt ctcgtttcag 720 

cctcagtatt atgtagggac cttatgcaat ttctttttct tttgaaaagt tatctactgc 780 

ccttcttgga agtttgcagg attggatggg aacaaattca gaggatctta ggtgctggct 840 

tgtggagaca aaaggaggga aatgggtaga gcctgtttgt cttgcttccc cagagataga 900 

atgtgaagac acgcgctaga aatcgcagtc ctggccagag acgttatggt cattgtgagg 960 

gactggtggc attgttcctt tttgaggggc tggggggact caaattggtg gctgttttca 1020 

cacagatgtg ttggtttgtg gtccaacttc tttatctgaa aaagccagtg agaaaacatt 1080 

tttgatttga tttttctaaa ctatctacca tattttaagt gtagcagctt tgactttgca 1140 

ataacgtggc aagtatctga tttctccttt gaggcagagg tttaagtgta ggcctgttac 1200 

acttgtttga tacctttttc atgacagtct cagtatagat cagttggtac agaaatacat 1260 

gaacacattt tgatagggct tatttcacac aaagaagttt atggttattt gtgtggggtg 1320 

gtgttgttat atattattgt ctttaaggga aaagaagcta taagattcgc tgacagccaa 1380 

agtatcattt agaaaagtga agcaacaaga tttaggttga tgaaagatac atgagtttgc 1440 

attttgacct gttcagtgtc tgtcttccag cacggtgtgt acacttcttc aaaattgtac 1500 

acagtttgct aattagaaat atcttggaaa gcctcatggt cactaatttt caactagcat 1560 

caggtatttt gaaaacgtgt gtctggatat taactcttgt ttaaactgaa tgtatgatat 1620 

tttgttagaa tggaaaagta ctatcttgtt aatttaagta ttttaaatat agttgtatat 1680 

ttttcttaaa aaaaaaaaaa aaaaaaaaaa aaagggcggc cgctctagag gatcccgcga 1740 

ggggccccan attacgcgtg agcgtt 1766 

<210> 104 
<211> 2286 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (2262) . . (2262) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2264) . . (2264) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2272) . . (2272) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2278) . . (2279) 
<223> n equals a,t,g, or c 

<400> 104 
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gctgatgtcg aggttcatcc tgaaccacct ggtgctggcc attccactga gggtgctggt 60 

ggttctgtgg gccttcgtct tgggcctatc cagggtcatg ctggggcggc acaatgtcac 120 

cgacgtagct tttggctttt ttctgggcta catgcagtac agcatcgtgg actattgctg 180 

gctctcaccc cataatgctc cggtcctctt tttactgtgg agtcaacgat gacaccatct 240 

cattgattat ggcaccagga agtctgaagg tttccacatt cgatgatgtc aacctaaacc 300 

agcagccatc ccgcttgtcc ctcttaggca tttcaggctt cctttgggat ttcaggtgtc 360 

ccatgatctt gatgtgctgc taggctggag cacacactgg ccattactga acacagccat 420 

attagggaaa gcaaaaaaac ccaaaaaatc ctctattgta tatttattca acaactgttt 480 

atgtttccag gacaactgca aagaaaacaa gctgaggtgg ttatactgtt gctgttaaaa 540 

gttggtatca gtaagatttg tgttttgtga taatccctaa atcaacatac cacttgtaaa 600 

ctgaacttcg agaaagaaac atgatgttca ttctgtaaat atacatgcag acaggtcatg 660 

tactaatcct agtccttttc ctgaggtaga ttttaaacag tatttttaaa gtccaagaca 720 

taggtttttc tagtttattc cctgaagatc tgttgccaca gttgggagat ttcttcttaa 780 

tcctgatttt cttggtaagc ttttttactt tattatctct ataatttatt atctctatcc 840 

atatttgtgg atcgggtagt gggaaaagag attataatac ttgtctttct ctcctctccc 900 

tccatccctc aaaagatctt tatgcatttc ccactactcc cttactgtct tttagcattc 960 

agagaaaaag ccaacttgct taaagaggaa tcacttaaaa ggtaggcata tctaagatgc 1020 

tcatagaaga ggaagaatgg gacatggccc catgcttatt tttgtttaca acgtaacatg 1080 

gcatgagaga gggcagagaa actaagttgc tggggaaagt tagaggaact gaaagtttgg 1140 

gaataggctg accacatatt atgccagtga ccagtatgac aggagatggg gccctgctgc 1200 

cagtcatctc cactgaataa agaataatgc tcctctttca gggtaataaa gtggggaaaa 1260 

ggaacgtctt ctcaatgcaa gaacataagc tttctcgtat atacctgtat gctacagttt 1320 

ttcacatgga attccgtttt ctgaggtaca gcacatttta ggtaacagta tttaacttga 1380 

aattcatcat gggagtctgc tgctatacca ggcacaagat aaaactccaa aatttctgtt 1440 

tacattgacc tttacattta aagctgttca tccatggtgc ctccccaaat cataagacca 1500 

aagaccacca aacgcagggt ggactctgct cattattctt tgacccagaa agactggaga 1560 

aggtatgtgc tttaagtgct gctctacctg aaaagaaatc ctttaaatta cctatggaag 1620 

tgatgtcctc agataatctt aatgactatt ttggcattta taaatagaaa tgattatgga 1680 

ctttgatctg ccatacggag gttcggaacc tggagaaygg ctgtgataag taggttttga 1740 

ttgagtgaaa gcatgagctt gttcagagtg aggggcatag tgaaaaagga acagccatgc 1800 

ctcawaatca aatcatttgc rttcccacag catcctgaat accgactacc tcttcacttg 1860 

ctaaagcagc taaactgtga agctctaagt ggtttgggtt tgttgtttaa ccttagcgag 1920 

atcctttaac tgcagcaata ttcaagccag atatttggaa gcaaatgata tttcctcttg 1980 

cagtgtccac aaatctgaat attaggggca tgaaattagg cttaccatct gatttgtaat 2040 

tacaattttg gaattctctg ttttagttgc tgaggcctga gttttctggc tcttaaagca 2100 

tagatcattt cacctgatgt ttttgaagca tcctaagtac agtagagtag aaaactgatt 2160 

tctttgttaa ttgtacactg aataatgcct tttaaaaatc aaaataaaat taacaaataa 2220 

tggtgaaaaa aaaaaaaaaa aaaaaaaact cgaggggggg cncnaaaaca antcgacnna 2280 

tagtga 2286 

<210> 105 

<211> 1240 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1225) . . (1225) 
<223> n equals a,t,g, or c 

<400> 105 

gtgaaacgcc tgggctcaag ctgattcacc tgcctccacc tcccacagtg ctgggattac 60 

aaacatgatc ccccacgccc agccaacaca aaacttctga tgctctgttt tctcatctgt 120 

gaactggagc taaggctaag tggtctgtct gtttaataag agtttgaatc agatggcctg 180 

gcatgaagag tcactggcct gagagaatgt caggggcatt tgtaaatgtg taaagggctg 240 

aaaaatcctg agggattatt attattgcta ttgttgttat tattcacaga cacatccaac 300 

agccattgtc tgcctcctta tctgtcatgc tttctgcacg agcgtcagcc tgagcttcaa 360 

tctgtgtgta tatctgcagc ttacgtcctt gccacccctc cagaacccag tttcatcctt 420 

gtaggttttt ccgaagcagg atttgcacaa gtggcgtgtt ttcttaagta tttattttgc 480 

aggccattta ctcggcatgg ctatttttac agtgggtaag gagcaaggct aaaaataact 540 

tagctcataa ccagacaggt tctgcatttg acatttacgt ggaattcatt tgcatctcat 600 
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ttgttcgcct ttctgtttaa caggtagaat gtaagaaagc tcagccgaaa gaagtcatgt 660 

tcccacctgg gacaagaggc cgggcccggg gactgcctta caccatggac gcgttcatgc 720 

ttggcatggg gatgctgggt gagtctggac aggaccgcag gtcaccatgg actgggaggg 780 

ctatggaggc ctctactccc aactgggtca cctaccagtg gggcaaactg cttcaccttt 840 

ctaagcctca gtttccttgt ctgtagatga ggatgataat tccccgttcc aagacagttg 900 

tgatgattaa gtgtgggtgt gtgtgtgtgc atgcatgtgt gtgtgtgtgt gtgtgtttgt 960 

atttataata ttgccccatg cctggcttat aggatatgtt agactatttt ctctcttttc 1020 

catctccttc ctcaaaagaa ggaaaagtcc ccctctattg cctcagccct ctcatctgag 1080 

tgggagttct taagatgtaa ggactcctgg ctgacttgac ttgtgtgggc taaggctacg 1140 

ttttctaaaa ctkgggagag gagggaagtg gtaagggtgg gcgataatcc tgtctattta 1200 

aatgattaac atttttctct tgggntatca aaatttgcat 1240 

<210> 106 
<211> 997 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (963) . . (963) 
<223> n equals a,t,g, or c 

<400> 106 

cccgactcta ggccggaagc gcgcggagac catgtagtga gaccctcgcg aggtctgaga 60 

gtcactggag ctaccagaag catcatgggg ccctggggag agccagagct cctggtgtgg 120 

cgccccgagg cggtagcttc agagcctcca gtgcctgtgg ggctggaggt gaagttgggg 180 

gccctggtgc tgctgctggt gctcaccctc ctctgcagcc tggtgcccat ctgtgtgctg 240 

cgccggccag gagctaacca tgaaggctca gcttcccgcc agaaagccct gagcctagta 300 

agctgtttcg cggggggcgt ctttttggcc actfgtctcc tggacctgct gcctgactac 360 

ctggctgcca tagatgaggc cctggcagcc ttgcacgtga cgctccagtt cccactgcaa 420 

gagttcatcc tggccatggg cttcttcctg gtcctggtga tggagcagat cacactggct 480 

tacaaggagc agtcagggcc gtcacctctg gaggaaacaa gggctctgct gggaacagtg 540 

aatggtgggc cgcagcattg gcatgatggg ccaggggtcc cacaggcgag tggagcccca 600 

gcaaccccct cagccttgcg tgcctgtgta ctggtgttct ccctggccct ccactccgtg 660 

ttcgaggggc tggcggtagg gctgcagcga gaccgggctc gggccatgga gctgtgcctg 720 

gctttgctgc tccacaaggg catcctggct gtcagcctgt ccctgcggct gttgcagagc 780 

caccttaggg cacaggtggt ggctggctgt gggatcctct tctcatgcat gacacctcta 840 

ggcatcgggc tgggtgcagc tctggcagag tcggcaggac ctctgcacca gctggcccag 900 

tctgtgctag agggcatggc agctggcacc tttytytata tcacctttyt ggaaatcctg 960 

ctntttcatc ccaaatttaa gggggtttca agaagaa 997 

<210> 107 
<211> 312 
<212> DNA 

<213> Homo sapiens 
<400> 107 

aattcccggg tcgacccacg cgtccgtgat gagtggattt gtactcttac ccaggtcctg 60 
agggccagcc cacccagcat ccccacccct gatgacgctg tccctacaac tggctgaact 120 
ggtgcatttt gtgtgtgcct tccagagcca gtggactggt gtgtatccaa tgatgccacc 180 
tctgaaacct acagaaccac tatgctttgc atgtgtaccc tgcagggtct gagggccagg 240 
ctgtctggta gctctgctcc tgggtgacag agcaagactc tgtctcaaaa aaaaaaaaaa 300 
agggcggccg ct 312 

<210> 108 

<211> 864 

<212> DNA' 

<213> Homo sapiens 

<400> 108 

ggcacgagcg gaccgggccc gcggggctgc tgcggggcga tcgggccggg ccgctgccgc 60 
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gc'catggact 
ctgccagccg 
gggcgtggtc 
tgggctgctg 
tcgtggggga 
cgtgcgtgga 
vccaggatgc 
ggcaaaamcc 
cctctgcggg 
cccctcggcc 
ggctcatggc 
agctactact 
acgtgcttaa 
aaaaaaaaaa 



cccgtgtcca 
aggaagcccc 
acagtcatcg 
gtgctacctg 
tggggagacc 
gagaaaggcc 
aaaggccycc 
atacggatgc 
tttcgtccat 
tccaggtgag 
cctgtagcgc 
gtaatgcgtg 
gcccttttgc 
aaaaaaaaaa 



gcctgagttc cagcctcact gagtggccac ccccaaagtg 
cagcactgac catgtctatt atggaccaca gccccaccac 
tcatcctcat tgccatcgcg gccctggggg cctttgatcc 
cggctgcagc gcatcagcca gtcagaggac gaggagagca 
aaggaaccct tcctgctggt gcagtattcg gccaarggac 
aagctgatka mtcccaaacg gsccggaart ycacggstga 
tggtccctgt ttgcaagccg gccaagargg ggctgggagg 
gctgctgtct gagaggaagg gctgacactt gctggcatgg 
cgcatgcact gatgcccggg gacttggctg tcctgggctt 
gctgcccatt gcaggcactg ggtaggcctg accttgctgg 
ttttgttact tgaatgtcta gctgagcctg tttttgatgg 
aactaacaaa cctgtgaact gtaaataggc ccctggaagc 
tgatttttaa aaatatcatc tagcgcaaaa aaaaaaaaaa 
aaaa 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
864 



<210> 109 
<211> 1258 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<222> (1)..(1) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (1196) . . (1196) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc__f eature 
<222> (1200) . . (1200) 
<223> n equals a,t,g, or c 

<220> 

<221> raisc_feature 
<222> (1237) . . (1237) 
<223> n equals a,t,g, or c 

<400> 109 

nagatggcgc tacgtctgct gcggagggcg gcgcgcggag ctgcggcggc ggcgctgctg 60 

aggctgaaag cgtctctagc agctgatatc cccagacttg gatatagttc ctcatcccat 120 

cacaagtaca tcccccggag ggcagtgctt tatgtacctg gaaatgatga aaagaaaata 180 

aagaagattc catccctgaa tgtagattgt gcagtgctcg actgtgagga tggagtggct 240 

gcaaacaaaa agaatgaagc tcgactgaga attgtaaaaa ctcttgaaga cattgatctg 300 

ggccctactg aaaaatgtgt gagagtcaac tcagtttcca gtggtctggc ggaagaagac 360 

ctagagaccc ttttgcaatc ccgggtcctt ccttccagcc tgatgctacc aaaggtggaa 420 

agtcctgaag aaatccagtg gtttgcagac aaattttcat tccacttaaa aggccgaaaa 480 

cttgaacaac caatgaattt aatccctttt gtggaaactg caatgggttt gctcaatttt 540 

aaggcagtgt gtgaagaaac cctgaaggtc gggcctcaag taggtctctt tctagatgca 600 

gtcgtttttg gaggagaaga ctttcgagcc agcataggtg caacaagtag taaagaaacc 660 

ctggatattc tctacgcccg gcaaaagatt gttgtcatag cgaaagcctt tggtctccaa 720 

gccgtagatc tggtgtacat tgactttcga gatggagctg ggctgcttag acagtcacga 780 

gaaggagccg ccatgggctt cactggtaag caggtgattc accctaacca aattgccgtg 840 

gtccaggagc agttttctcc ttcccctgaa aaaattaagt gggctgaaga actgattgct 900 

gcctttaaag aacatcaaca attaggaaag ggggccttta ctttccaagg gagtatgatc 960 

gacatgccat tactgaagca ggcccagaac actgttacgc ttgccacctc catcaaggaa 1020 

aaatgatctg ttaaatgaag ctgtcatcag gctaaagggt attgaagctg cagagggatc 1080 

aacttgtgct tgccagagga cgccaatgaa gtttgaaaca ccaacaatca gagattttgt 1140 

ttctgttcct cattaaatca tgagcttttg tgcccgagac tctggacgga ctgttncttn 1200 
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aggaatttaa ccggatggga agttttttaa acttttncaa ccaacttttt taaggccc 1258 

<210> 110 
<211> 883 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (19) . . (19) 

<223> n equals a,t,g, or c 

<400> 110 

gtcaccgtgg gcgtttaant atgatccccg gctcagattc gcagactgca ctgaacttcg 60 

gctctacgtt gatgaagaag aagtctgatc ctgagggtcc cgcgctgctc ttccctgaga 120 

gtgaactttc catccggata ggtagagctg ggcttctttc agacaagagt gagaatggtg 180 

aggcatatca gagaaagaag gcggcagcca ctggccttcc agagggtcct gctgtccctg 240 

tgccttctcg agggaatctg gcacagcccg gcggcagcag ctggaggagg atcgcactgc 300 

tcatcttggc catcactata cacaacgttc cagagggtct cgctgttgga gttggatttg 360 

gggctataga aaagacggca tctgctacct ttgagagtgc caggaatttg gccattggaa 420 

tcgggatcca gaatttcccc gagggcctgg ctgtcagcct tcccttgcga ggggcaggct 480 

tctccacctg gagagctttc tggtatgggc agctgagcgg catggtggag cccctggccg 540 

gggtctttgg tgcctttgcc gtggtgctgg ctgagcccat cctgccctac gctctggcct 600 

ttgctgccgg tgccatggtc tacgtggtca tggacgacat catccccgaa gcccagatca 660 

gtggtaatgg gaaactggca tcctgggcct ccatcctggg atttgtagtg atgatgtcac 720 

tggacgttgg cctgggctag ggctgagacg cttcggaccc cgggaaaggc catacgaaga 780 

aacagcagtg gttggcttct atgggacaac aagcttcttt cttcacatta aaactttttt 840 

ccktcctctc ttcttcaaaa aaaaaaaaaa aaaaaaactc gag 883 

<210> 111 
<211> 1465 
<212> DNA 

<213> Homo sapiens 
<400> 111 

ggcacgagcg agccaagttt gcaccactgc actccagcct gggcgacaga gcaagactca 60 

gtctcgaaaa aaaaaaagtt ggaagcagaa gtaaaaaaca tggtaaagaa tgagaactaa 120 

ataaatataa taattgagag gtctgcatta gatgtggcag ggagaacaag caaaaagaga 180 

tttcagagaa gatcactgga attggcagag gccttgaagg gcagagtcta gcatacagaa 240 

gatgtaaagc cacattctgt gaaggtaagt agatgtgttt acctcttttg cactgtactg 300 

gtgcattatg gggtaaatrt gtattacttt tcctgtattg cttagcacag agttttgcct 360 

atagcaggca ccagactgtg ggcttggtag tacatgacta ttggtgatta cagatcaaaa 420 

aggacttgaa atgatcagtt taaggtcttg atgggtattg aagactcaaa ggatgatggc 480 

accctgggag tgatccacag aaggacagat tatttgaaga tgttaataac taaagacaac 540 

atggatgtta aatgatgaaa aaaagttgga tggaaaataa accattggat ctgcytctgg 600 

agtccaagaa gaatattatt cttcctacct cccccttact ctggctcttc ctattgtagc 660 

cacatgggtc agtaatgcca ttgaaaaaca aaattttaga ctaagtgggg tcgcagaaat 720 

tttggtctat cttaaattga tgacatctta ttaaagaaty tattgtataa agtgtgctta 780 

ttctggcatt tttttaatga agaaaaagtg taattcagtg cacatttatg aatttcaaag 840 

atcaataaaa atgggcaaag tatatgaacg cataatccat agaagaagat atctgacaaa 900 

tgcagttcaa taaatatttt tttaaataaa aattagcctg tggtaagaat tgaaatggag 960 

aaagaaatag aaggtagcca ggttacctac agctttgtat atgacactat agagctagga 1020 

ctttattcca tagatgaggg aagtctgcta aagtgtctac attgaaccca ttattttgct 1080 

agcattgtag ttgatgtacc taaaacagac ttgagccggt agagtaagta ggcagacttg 1140 

tccaagtgag aaaaagatga aactgtggtg aggataaaga gagaaaagga gcagatttaa 1200 

gaaatattaa aacttgaaag tactaagact tgatgatgaa ctagatgtgt tagataagag 1260 

atagcatgga gtctagtaaa agttctgttt ttctcactcg tgtgactgcc tcaataacac 1320 

aaagcttgat aggaaataaa catgagatag cacatggatc tattacaagt ttttgaaatt 1380 

gagcttgaaa agctacttca aaaaataaat tctaggccag gtgtgagycc atgcgcttga 1440 

ttaaaaaaaa aaaaaaaaac tcgta 1465 
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<210> 112 

<211> 1369 

<212> DNA 

<213> Homo sapiens 

<400> 112 

cccacgcgtc cgcccacgcg tccggctggc aagatggcgg gaggggtgcg cccgctgmgg 60 

ggcctccgcg ccttgtgtcg cgtgctgctc ttcctctcgc agttctgcat tctgtcgggc 120 

ggtgaaagta ctgaaatccc accttatgtg atgaagtgtc cgagcaatgg tttgtgtagc 180 

aggcttcctg cagactgtat agactgcaca acaaatttct cctgtaccta tgggaagcct 240 

gtcacttttg actgtgcagt gaaaccatct gttacctgtg ttgatcaaga cttcaaatcc 300 

caaaagaact tcatcattaa catgacttgc agattttgct ggcagcttcc tgaaacagat 360 

tacgagtgta ccaactccac cagctgcatg acggtgtcct gtcctcggca gcgctaccct 420 

gccaactgca cggtgcggga ccacgtccac tgcttgggta accgtacttt tcccaaaatg 480 

ctatattgca attggactgg aggctataag tggtctacgg ctctggctct aagcatcacc 540 

ctcggtgggt ttggagcaga ccgtttctac ctgggccagt ggsgggaagg cctcggcaag 600 

ctcttcagct tcggtggcct gggaatatgg acgctgatag acgtcctgct cattggagtt 660 

ggctatgttg gaccagcaga tggctctttg tacatttagc tgtggtgtgt gcttcagaaa 720 

ggagcagggc ttagaaaaag cccttttgtc cgtagagttg atgtggtgtg agtgatatat 780 

ttctatgttt ttaatgtaca gcatctgtac tttgtttgcc ttgataaagg taagataaat 840 

gaaacgctga actatgctaa tctggaattt gtttttattt gcctgaaata tatttttttc 900 

tgtgaaaaaa ttaaaacgta cttaagccag gagaatgaat tatacagtga ttgaaaatcc 960 

atttaattcc tatgactttt gttttgtatt gcccaagtca aactacatca cttgtatctc 1020 

cagcccaaat gtagtctgcc ttgaaaagtc tttcagctgt gactgcagga agtgggagtg 1080 

tttttattgt tagctaattg ctgtgactgc aggaagtggg agtgtttctg ttgttggcta 1140 

attgaagtta ttaggctcag cttcagtcat gtgtaagttt tgcagtgtaa tacatatgta 1200 

gtctggtctg tatatatgaa aatttgaatt aaactgcaga atgtttatgt ctagttatgg 1260 

tttaaatttt cttagtagta tataaaaggt aagagtactg aaaaattaat aaaattgcaa 1320 

gttaaraaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg agggggggc 1369 

<210> 113 
<211> 596 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<222> (4) . . (4) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_ feature 

<222> (8) . . (8) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (28) . . (28) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 

<222> (57) . . (57) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 

<222> (61) . . (61) 

<223> n equals a,t,g, or c 

<400> 113 



67 



WO 02/076488 PCT/US02/08276 

gganaccngc tttgcccctt ggtttccnca aagctcgaat ttaccctcac taagggnacc 60 

naaagctgga gctcccaccg cgttggcggc ccgctctaga actagtggac cccccgggct 120 

gcaggaattc ggcacgagtc ctgacctcag gtgatccacc cacctcggct tcccaaagtg 180 

ctaggattat aggcttgagc tactgtgccc ggcccatggt gtttttcttt agggctcttc 240 

ctacagcctt gagaagtaga taggcatcag agtatggtac tataggaatc agaaaaattc 300 

aaaacaaatg tggattaagt gtttaggctc tatgtggctc acgcagccag aatccttaag 360 

tctgtgtgtt tctgtgtctc aagactgggc tcacattctg gctttgtcca taacaatgct 420 

ctgggatttc agggagttcc ctcatttgta aaatgagggg gtcagagcag gtgatatcca 480 

tgtttcttcc ctttctgata ttgttgtctg tggcatattc tttgtatggc gaatttaata 540 

aattatatta atgtgtctct ttgaaaaaaa aaaaaaaaaa aaaaaaaaaa ctcgta 596 

<210> 114 
<211> 629 
<212> DMA 

<213> Homo sapiens 
<400> 114 

ggcacgagaa cctttggggc tgacacaaga tcctttagtg tttgggatga cctctttcct 60 

gcagacttct tcccctatcc ctaactcatg catggaaaac gtttgtcagg ctggtttccc 120 

gagcctcctg cacctcaaca tcacgctcac ccttttgggt ttagcccagt gttatttagc 180 

aaatttctcc agctgcaggg aaggatcaga gcactatctt tttttttttt tttttctcct 240 

ggagccagga ctgcacaagg caatggccaa atttagttga attcagccta ccatcctttg 300 

ctgatgactc agctctatgc caagtactgg agccacagag atgggtcagt cccagcccct 360 

gtcctcagga agcccatggt cagggaaacg ttgtagggat aagtaataga gggcagttgc 420 

cttcagggct cctggtggct gctggtccct atggtgcctt gatgtgaatt agaagacggt 480 

gccctttcca ggtggattca gacctacact agaacgcaca gctttgggag tgacacacag 540 

gttggatttt agcacccctt gccccttggc cagaggtgcc ctgctgcacg gccatacgct 600 

gcagcctcga gggacacaca ggccaaagt 629 

<210> 115 

<211> 2497 

<212> DNA 

<213> Homo sapiens 

<400> 115 

ctgagccact gtgcccagcc tcggctcagg tttttcaata cagtcttgac cttggcattc 60 

agtatcctca cagcatggtt ctaattaact ttctagctct atttcccttt tcctgctccc 120 

tctctctaca actagtcttt ctctgattgc cccgccctca acccatctaa actagacccc 180 

agggaagcac cttggtcccc ttcctctctc ccactcacca tccaaccaat caccagagcc 240 

tgtacattct atattttcaa catcgattca attgtctact tctttctagc ctgccctctc 300 

tgactgggac tccttgagcc agcctgatca ccccaatcca tccctcacac tgtgcccatc 360 

tttctgaagt aggaatctga tcacaccacc ctgctaaaaa cactctggtt ctccccacgg 420 

catgtggtgc ccttgtatag ctggcaaagc cttgcatggc acggccccag cctgtgcttc 480 

aactcaattg cccgactctc tccagctctg ctgagccacc taagtcacag atggtttctc 540 

ctctcatctc tgctctcttc catgtgccat ttctgtggct tggaatgttc ttccctcatt 600 

ctctttctgg ccctttcccg tcacacctta gacgtgcatc ttcctctcga aaacctctag 660 

tgaagcctcc cagggccagg cagtaccctc ctctggcttc ttctggatac agaggaagaa 720 

tctgagcatc gattctccat ctcagcaggc ctctgtgtgc ctgctgactc cgactagacc 780 

agagatccgt aaggacaggg atcgagtttt ttttctttta attcactgcc tcaaaaatcc 840 

tctgtgcatt acctattcat cctcttctct cccttaacct gaaccagtga tcttactgtc 900. 

tccatcattg tttttttctt ttcttttctt ttcttttttt tttttgaggt ggagtctggc 960 

tcttcaccca ggctggagtg cagtgatgcg atctcgactc actgcaacct ccatctcctg 1020 

ggttcaagcg attctcctgc ctcagcctcc ccagtagctg ggattacagg catgcgctac 1080 

catccccaac taatttttgc ctccataatt ttgccttttc tagaatgtca tacaggtgga 1140 

attactcagt atgctgcctt tttcagattg gcttctttca cttagtaata tgcttgtttt 1200 

ttgagacagg gtcttgctct gtcgcccagg ctagagtgtg gtggtgcgat cttagctcac 1260 

tgaaacctcc acctcccagg ttcaagtgac tctcctgcct cagcctcccg agtagctggg 1320 

actacaggca cgtgccacca tacccggcta atttgtggat ttttagtaca gacggggttt 1380 

cgtcatgttg gccagggtgt tgttgaattc ctgacctcaa gtgatccacc tgcctcagcc 1440 

tcccaaagtg ttgcgattac aggtgtgagc cactgcgcca agcctcattt agtaatatgc 1500 

. atttaaactt tctccatgtc tttaatggct tgatagctca tttattttta tcatggaata 1560 
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tttcattgtc 
ttgcttctaa 
gtgtggacct 
cagtaagagt 
tgagagtttc 
tggccattct 
acatcttttt 
aacgtgcagg 
tagtcattta 
ttgccatctg 

cagaaactaa 
atgtcctatg 
cgtatgtcct 
acactaccca 
aagccccaca 
aagagaattc 



tggatggacc 
gttttggcaa 
attatcaact 
gtgtttagtt 
tgttgctcca 
agtaggtgtt 
ttttttaatt 
tttgttacat 
gcattaggta 
tatatcttcc 

catagcagtt 
tttttttgtt 
tcccccatgc 
accatcaagg 
gcactgcttt 
gatatcaagc 



acagtttatt 
ttatgaataa 
aattcgggta 
ttaagtggct 
cattctcact 
tacatggtat 
ttattattat 
atgtatacat 
tatctcctaa 
ctgatgagtt 

ggtacagagt 
tttttttttt 
ctctaccctg 
ctgagacttc 
tttcaacttc 
ttatcgatac 



tctccattca 
agctgctata 
aatctcaagg 
gtgccatttt 
accattcggt 
ctagtcattt 
tatactttaa 
gtgccatgtt 
tgctattgga 
ggggatgcat 

tggtgctcaa 
tttaaaaagg 
cccttccaca 
ccgtggccag 
tcttgtggtt 
cgtcgac 



cctactgaag 
accatcaagt 
agtgcaattg 
gcattcccac 
gttgtcagtg 
gaatgggcat 
gttttagggt 
ggtgtgctgc 
acatcttttc 
tctttccatc 

caaacatcag 
aatgtgagct 
aactttctga 
cagtgtctca 
tagactgtct 



gacatctcgg 
gcaggttttt 
ctggatcaca 
cagcaatgaa 
ttttgcattt 
atgatgtgga 
acatgtgcac 
acccattaac 
atgtgtttat 
tcagagtccc 

cttaggaact 
gttcccaaaa 
tcttcagcac 
tgctggcttc 
ttagcccagc 



<210> 116 
<211> 1217 
<212> DNA 

<213> Homo sapiens 



<400> 116 

tctggaacct 

tcagtttgtg 

tgaaaaactc 

aattcctttt 

cattagctca 

taccttgaat 

tgggtttcag 

cattttattt 

aactcctggg 

tgcgccacca 

taggcagttc 

ttyctgtttc 

atggcttttt 

gtagcattat 

ccacattttg 

tattatggat 

tagttcactt 

ttcgtaggaa 

actgtatgac 

tatataatag 

tcaagcttat 



ctctcttaat 
atattggata 
aacatcacta 
actattctat 
ttttattgac 
tttttttcac 
ttatttcaaa 
ttattttttt 
ctcaagtgat 
cacccaccct 
cacattccgt 
tgtggatttg 
gtgcctggct 
cagcactttg 
tttatccgtt 
gatgcttctc 
gagtatgtac 
ctgccaaatt 
agttccgttt 
ctattttagt 
cgatacc 



tcatatttcc 
gatgcaaata 
ctgtataaat 
acattgcaca 
cattcttgta 
tttttaagtg 
tacaaggttg 
tcagcagtgg 
cctcctgcct 
caaaacattt 
tcttcctatg 
tctattctgg 
tctttcactt 
tttcttttat 
aatcagttga 
tgaatatttg 
ctaggatgga 
gtttcctaaa 
atccacatcc 
gggtatgaag 



cgtaagtctg 
agcaaagctg 
tattttctag 
ttgctctggg 
ttcagcaagt 
agatataatt 
twcatatatc 
ggtcctgcca 
cagtctccca 
ttatttccta 
atccagctct 
acatagcatg 
agcataatgt 
ggctaaataa 
tggatatttg 
tgtacaagtt 
attactgggt 
gtggctacgg 
actccaacac 
tcgtatttca 



tccatctgtt 
ttacttttca 
tctatctgtg 
ggtaaaaatc 
atcccaagta 
tacataccat 
atcactgtct 
tgttgcccag 
aagtcctggg 
aaaaagaaac 
tggcagctac 
taattggagt 
ttttaagatt 
cactgcattg 
ggttgtttcc 
tttgtgtgga 
catgtggtaa 
tattttacat 
ttgttgttat 
aaaaaaaaag 



gtgtggaatc 
tagtttcaaa 
tttattttta 
cartataaac 
cagtggtcca 
aacaacttag 
aattccagaa 
gctggtcttg 
attacaggtg 
cccacatcca 
tatagttkgt 
catacaatat 
cattcatgtt 
tgtgsacata 
acttcggggc 
catttgtttt 
ctgttttaat 
tcccatcagt 
ctgttttgat 
gaattcgata 



<210> 117 
<211> 529 
<212> DNA 
<213> Homo sapiens 



<400> 117 

acgcgtccga 

tccccggagg 

ctggccctgc 

ctgctggcct 

ctgttcggca 

ttcattgtgc 

ccccggcagc 

atatactggc 

tttcattcat 



ttacttacgt 
tgctgggcct 
tcctgggcct 
acagtggcta 
gcgatggcta 
gctctttgcg 
gtctccagct 
tgactttcca 
tgaagatttg 



gctcctggct 
gtgtgcaagc 
ctacctggcc 
caaatacgtg 
ctacgtggcg 
gacagcagcc 
ctacctgact 
cctggtccgg 
atttccttga 



gggatggcac 
acagcgctgg 
accgtgcgca 
ggaatgatcc 
ctggcctgga 
ctgggccccg 
ctgggagctg 
tgaccccctg 
aaaaaaaaaa 



tgggcattca 
tgtgggtggt 
gtgacctgag 
tcagtgtgct 
cctcatcggc 
acagcatggg 
cagccttcca 
gccccagatg 
aaaaaaaaa 



gaaaaggttc 
gatggaggtg 
cacctttcac 
cacggggctg 
gctcatgtac 
gggccccgtc 
gcccctcatc 
gcactgagtt 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 

2220 
2280 
2340 
2400 
2460 
2497 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1217 



60 
120 
180 
240 
300 
360 
420 
480 
529 



69 



WO 02/076458 



PCT/US02/08276 



<210> 118 

<211> 1146 

<212> DNA 

<213> Homo sapiens 

<400> 118 

cccgtccaca atgcagcaga ctcttcccaa ggccacctag caagcaaggt tgatcggatc 60 

atctaaactg gccgcctcct gaatatttca ctgaatcctg gcgttcatgt tgaagcagac 120 

aaaatgagaa aggaggaggg cattgctcac ctctcaatag cttttttcgt tcaagttcta 180 

tgtctttatc agctcttgcc tgtgatttta ccccaattca accttgggag tgggaagaat 240 

atgaacagat aacccttggc ctaacagctc catcaaacct ccttgagagc aactacctag 300 

gccaggctag tgagtgcttt gtgaggaagc tggtcagaag gttccctcaa ctccttcctg 360 

gtcctcctgg acactgcaga aaagacttag gggatcccca gcagaggcca attgctctcc 420 

ttccttccct gccccaccag gaaaggaata acgtccacag acttgaagca gatagtgaag 480 

tagatctgtg agaggttcta ggtacttagt gtgtagactt tgacgaatat ttctcaagtt 540 

gggagccctt gttaaaaatg atgtttaagg gagtggttgg ggggaagatg aaggcatgga 600 

ggaggaagaa gagaaggaag cccttgccat ataaaattca tgcagactaa acagtttccc 660 

tgacagaata aataaagtgg atgctacccc actccagaat caaaagcaat ttaattaaag 720 

tctcttaagt tgtaaagagt tttaaatgat ccgtgttgaa ggcgaatsct gcyaaatgca 780 

gtgggtctga cgtcagctgc cgggcctggg ctgggaggcc atttgctatt ctgtttaagg 840 

caggctggat tgtcttattt tggaaccagc ttggtggggg gtttgctttg ctactgcttc 900 

tgagccctga gctfccaaagg ctgaaattaa tggtgaacaa aattgtgcgg ctctggccat 960 

cccatgcggg caagcccatt gagggttatc attaagtaaa gaaataaaga gggggaaaaa 1020 

agcctgcctg ttccaaaaac ctcatcagat aatgacctca gtgattgggt tttcattacc 1080 

aaacagcatc cagagattat caacccatag aagaagggag gggaaaaaaa aaaaaaaaaa 1140 

aaattc 1146 

<210> 119 
<211> 1346 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (537) . . (537) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (880) . . (880) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 

<222> (1115) . . (1115) 

<223> n equals a,t,g, or c 



<400> 119 

ggcagaggct 

tgccaggagt 

tttttctttt 

cctccctccg 

cctccctctc 

accaccaytt 

taaatatttt 

actggttatt 

ctaaaacagc 

gagtatatcc 

cttgaggaac 

agccagtgtt 



tgtgaagggt 
aatgtgctcc 
ccttccttcc 
tcttcctcca 
acatccttta 
atgcacaaaa 
ttawacacgt 
gccttgaggg 
aataccaaag 
cttaaccact 
tctaaggccc 
tactgaggac 



aaagtttaaa cccctctgct tagcccctgc ctccagcctc 
catagtactc tgatccactt gtatttggtg cttccttttt 
tcctttcctt tcccttccty ttcctsttcc tccattcttc 
ttcttccctc cctccctatt cctccattct tccctccctc 
ggactcagca tcacctcctc taggcagtct ttcctggayt 
cacctaagca ytaccttatg tggcctcatt tatcactgct 
gctgtgatgt ggcacatgca ggtgtcattc ttgakgatcc 
gatgacaact gcccggtagg gtv/acctggt gtgactgcac 
ggcccattgc cagttctgtc actgaccagc tggggcntct 
ttggacctta atttggcatc tgtcaaatga gatggtggaa 
ctactgtgca ggtcttatta atgattacaa cagcagcagc 
ttacaaagca ccaagcactt tgcctatcct aatccttaca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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tcaactctac gaagttagta tggttactat 
aagagaggtt atatgacttg accacaaggt 
attctgactt tactgttctt agccacataa 
actgtttggt ggtcactcac agactccatc 
catttttaag gctcaaactg gagtctgggt 
acaattatta gagggaaatg ggtgaggacc 
aattctcaca gagagtcttg ctttggacag 
ctctttagga ggggagagta ggcctgagtc 
ggtaccaggt gcagtggttc acgcctgtaa 
tatcacttga ggccagaagt ttgagagcag 
tttaaaaaaa aaaaaaaaaa actcga 

<210> 120 
<211> 1079 
<212> DNA 
<213> Homo sapiens 

<400> 120 

ggcacgaggt gttagaaagt tttcgaagca gtgtgagtct tgtacctttg tggtcctgtc 60 

tcacagacac ctgtctattc cctgaccctt ttaaatgcta actttctgcc tgtaggaaat 120 

cttccctttg tgcttaggtc tttttcttct gtgagcttta gataaacaac ctagtgttta 180 

aactttttaa taagggattc attttttaat acatgagaat tcatttcaaa attttggttt 240 

tagttattta ttttattcta cttggctctt tttcagacag atgttctctc ctggattgta 300 

aaagtcgaat tcaaaggatt tttatttgta atatacttaa cctttctctt gtaagttgcc 360 

atctgtgtag atacagcttt gattgcctga caagaggaaa atgtttccca ttatcttttc 420 

ctgcctgaac tatacggtca cttgtgttcc agcatagtgg ttcttaaccc tcatagtgtg 480 

tcagaatcac tttgcagagc ttttaaaaac tctagatgcc tggggaccac cccaaagact 540 

ccattttgtt gtcatgggtc aaagcacagt cttctagttt gcagctagtg ttgagtacaa 600 

ctagagttta acccagttga attttagttt aatcttggct ggtcttgaag atgttagtaa 660 

tctctattca tttttttkga aaagtaccaa tgaratcaga aagttaatta gaaaacatct 720 

agttgaatcc cctgttttta atagatgggg aaaccaagac ccagagaata taatccaaag 780 

ctacctgtca cataggccac aatttctttt ccaatattct gttcttcgct gttcttctaa 840 

tttgcagaac tcctctttaa aaaacctttg gagaatgtat tggcctcata ccctcttcct 900 

tcagcctgaa agacatgcac ctgtcactta tttatgatat ttaaatgcaa cctctagaac 960 

aggggtgtcc aatcttctgg cttccctggg ccacattgga agaagaaatg tcctgggcca 1020 

cacataaaat acactaatga tagccgatga acttaaaaaa aaaaaaaaaa aaactcgta 1079 

<210> 121 

<211> 2103 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (2101) (2102) 
<223> n equals a,t,g, or c 

<400> 121 

ttcctcgtag cgagcctagt ggcgggtgtt tgcattgaaa cgtgagcgcg acccgacctt 60 

aaagagtggg gagcaaaggg aggacagagc cctttaaaac gaggcgggtg gtgcctgccc 120 

ctttaagggc ggggcgtccg gacgactgta tctgagcccc agactgcccc gagtttctgt 180 

cgcaggctgc gaggaaaggc ccctaggctg ggtctgggtg cttggcggcg gcggcttcct 240 

ccccgctcgt cctccccggg cccagaggca cctcggcttc agtcatgctg agcagagtat 300 

ggaagcacct gactacgaat gctatccgtg cgagaacagc tattccacga gaggatccgc 360 

gagtgtatta tatcaacact tctgtttgca acactgtaca tcctctgcca catcttcctg 420 

acccgcttca agaagcctgc tgagttcacc acagggtgtc ctgggccggg tctmtgagac 480 

agtggtgatg ttgatgctcc tcactctgct ggtgctaggt atggtgtggg tggcatcagc 540 

cattgtggac aagaacaagg ccaacagaga gtcactctat gacttttggg agtactatct 600 

cccctacctc tactcatgca tctccttcct tggggttctg ctgctcctgg ctgctggaag 660 

acctggagga gcagctgtac tgctcagcct ttgaggaggc agccctgacc cgcaggatct 720 

gtaatcctac ttcctgctgg ctgcctttag acatggagct gctacacaga caggtcctgg 780 



ccctatttta cagatgagga gactaaggct 780 

cataataaag aaacagattt gaatccaggc 840 

tgggcacasn ttygacacac rgttttgtgt 900 

ccagactctg catgaaccat ccctgttcta 960 

gaaacctggg gacagaagac tgctatagtc 1020 

agtggccagc tctgttcatg aacctttgac 1080 

agacnactta cgttgctgtt ttcagttacc 1140 

atgcttcaga cacagatt;aa aatcagattt 1200 

tcccagcact ttgggaggct gagttaggag 1260 

cctgggcgac atagtgagac atcctctctc 1320 

1346 
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ctctgcagac acagagggtc ctgctgggta tgtggcttcg tagggcttgg gatacctggg 840 

tttccccaag gagagtagcc cctggttcca ggtgcttgct gacagcctcc catccctgca 900 

cagagaagag gcggaaggct tcagcctgkc aacggaacct gggctacccc ctggctatgc 960 

tgtgcttgct ggtgctgacg ggcctgtctg tgctcattgt ggccatccac atcctggagc 1020 

tgctcatcga tgaggctgcc atgccccgag gcatgcaggg tacctcctta ggccaggtct 1080 

ccttctccaa gctgggctcc tttggtgccg tcattcaggt tgtactcatc ttttacctaa 1140 

tggtgtcctc agttgtgggc ttctatagct ctccactctt ccggagcctg cggcccagat 1200 

ggcacgacac tgccatgacg cagataattg ggaactgtgt ctgtctcctg gtcctaagct 1260 

cagcacttcc tgtcttctct cgaaccctgg ggctcactcg ctttgacctg ctgggtgact 1320 

ttggacgctt caactggctg ggcaatttct acattgtgtt cctctacaac gcagcctttg 1380 

caggcctcac cacactctgt ctggtgaaga ccttcactgc agctgtgcgg gcagagctga 1440 

tccgggcctt tgggctggac agactgccgc tgcccgtctc cggtttcccc caggcatcta 1500 

ggaagaccca gcaccagtga cctccagctg ggggtgggaa ggaaaaaact ggacactgcc 1560 

atctgctgcc taggcctgga gggaagccca aggctacttg gacctcagga cctggaatct 1620 

gagagggtgg gtggcagagg ggagcagagc catctgcact attgcataat ctgagccaga 1680 

gtttgggacc aggacctcct gcttttccat acttaactgt ggcctcagca tggggtaggg 1740 

ctgggtgact gggtctagcc cctgatccca aatctgttta cacatcaatc tgcctcactg 1800 

ctgttctggg ccatccccat agccatgttt acatgatttg atgtgcaata gggtggggta 1860 

ggggcaggga aaggactggg ccagggcagg ctcgggagat agattgtctc ccttgcctct 1920 

ggcccagcag agcctaagca ctgtgctatc ctggaggggc tttggaccac ctgaaagacc 1980 

aaggggatag ggaggaggag gcttcagcca tcagcaataa agttgatccc agggtttgct 2040 

ttgttttttt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2100 

nna 2103 

<210> 122 

<211> 1212 

<212> DNA 

<213> Homo sapiens 

<400> 122 

gccaagcttg gcacgargtt ggtggcggcg tccggaggtg ctggtttgtt ctcggtgaac 60 

ggcgcgcggg gtctctcctg agtgcgagct acgggacctt cgccatgccg gggatggtac 120 

tcttcggccg gcgctgggcc atcgccagcg acgacttggt cttcccaggg ttcttcgagc 180 

tggtcgtgcg agtgctgtgg tggattggca ttctgacgtt gtatctcatg cacagaggaa 240 

agctggactg tgctggtgga gccttgctca gcagttactt gatcgtcctc atgattctcc 300 

tggcagttgt catatgtact gtgtcagcca tcatgtgtgt cagcatgaga ggaacgattt 360 

gtaaccctgg accgcggaag tctatgtcta agctgcttta catccgcctg gcgctgtttt 420 

ttccagagat ggtctgggcc tctctggggg ctgcctgggt ggcagatggt gttcagtgcg 480 

acaggacagt tgtaaacggc atcatcgcaa ccgtcgtggt cagttggatc atcatcgctg 540 

ccacagtggt ttccattatc attgtctttg accctcttgg ggggaaaatg gctccatatt 600 

cctctgccgg ccccagccac ctggatagtc atgattcaag ccagttactt aatggcctca 660 

agacagcagc tacaagcgtg tgggaaacca gaatcaagct cttgtgctgt tgcattggga 720 

aagacgacca tactcgggtt gcttyttcga gtacggcaga gcttttctca acctactttt 780 

cagacacaga tctggtgccc agcgacattg cggcgggcct cgccctgctt catcagcaac 840 

aggacaatat caggaacaac caagacctgc ccaggtggtc tgccatgccc cagggagctc 900 

ccaggaagct gatctggatg cagaattaga aaactgccat cattacatgc agtttgcagc 960 

agcggcctat gggtggsccc tctacatcta cagaaacccc ctcacggggc tgtgcaggay 1020 

tggtggtgac tgaaattagc tggacatggt tgcacacacc tgtaatcaca gctactcggg 1080 

aggttgaggc gggagaatcg cttgaaccag ggagttggag gttgcagtga gtggagatca 1140 

caccattgcc ctgcagccta agcaacagag caagattctg tctcaaaaaa aaaaaaaaaa 1200 

aaaaaactcg ag 1212 

<210> 123 
<211> 616 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> (17) . . (17) 

<223> n equals a,t,g, or c 
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<220> 

<221> misc_f eature 
<222> (580).. (580) 
<223> n equals a,t,g, or c 

<400> 123 

cmgctrctra gcaactnagt gggatscccc gggctgcagg aattcggcac gaggagaacg 60 

gctgcacgtg ggagatgctc cgtggatgtt tgtagaacgc tggcttccgt gtttcctcgt 120 

tgtggctgtg gtggtgtggg tctttgcctg tggacccgtg gaagacaaag aagacagttt 180 

tggatggtca agctattttc ttgcttcagg gctccctccc ctgctttttg aagcctcaca 240 

aaccaggact gtgagggcag gaaggcttgg ggtctttgtg tgctgagcct cattagggtt 300 

ttaagaacct ccctcctttc at.ctctagct tacgagaggg atgattcatt atcttccctc 360 

ctcaggctgc agtagaagca gacagtctct gcctccctgc ttgcctttcc tccctcccat 420 

tcactgttga ttattgccct caagaataac aggttgccca gctactcgag argcttaagt 480 

gggaggattg cttgacccca ggagttcgag gctgcagtga gctatgatcg cttcactgcg 540 

ctatagcctg gcagacacag agagacccta tctcaagcan acagacaaac aaaaaaaaaa 600 

aaaaaaaaaa ctcgag 616 

<210> 124 
<211> 536 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__feature 
<222> (536) . . (536) 
<223> n equals a,t,g, or c 

<400> 124 

ggacgagtgg ggagctggaa ggaggatgga gtgggaagat aatcttccct tggagttcag 60 

ctgtcccgtg accaaactcc tctctgtccc cagctggact cctctagatg ctcagatgct 120 

ccttctcttc tttccttctc tgtcacacca ttcttctgtt ccttggctct tctgctcatc 180 

tccttgtgga gscawaggtt tggggtttat atgagtacag gataggtgac atggtggatc 240 

aaaaggcaac attttgtgtg caaaaacagg aatgcctgtt cccattaggg tcatgggttk 300 

ccagggttga gggtggggcc tttgctaggg aaccaccctc ttctacccag tattttcctg 360 

tctcctgtct gtatcaatag gtacacaata twtattaaat taatkaatga ctatacatta 420 

tgaaatggga aatgcaaggt ataaaggaga attgctgtcc ttgaaaagaa atttagtttg 480 

tttttttgtt gagatggagt cttgctctaa gctagagtgc agaatgtaat caaggn 53 6 

<210> 125 
<211> 796 
<212> DNA 
<213> Homo sapiens 

<400> 125 

ggcacgaggt gacgtgtttc tgcatctgtt gccatgacaa gctccctgct tcacccattg 60 

ctgtatcccc agcacctctc tcactgcctg gcaagggaaa gcactcagaa gacgctgaat 120 

gaccargtag agtgatgggt tgtacagcac tgttactcct tttccatctc tgtgtcccat 180 

gtgaacctta tggcacccat gagaaggagc ttgtaccagg tttatacttt ctagtttaca 240 

gatgagaaaa caggatcaga gtggtacaga tattggtcta agtcacagag aaagtgaatt 300 

gtaaaagcag aaacagagca caggctgcct gacttctagt ccagtgcttt ttgctcaaat 360 

tgcctcttat ttctcaggtt attcttgaaa tggcagatgg ggattctgtt taatgaaaca 420 

aaagtgacaa ttctttcttt cttggagaga aggtggagac agggtctcac tctatcacac 480 

aggctggagt gcagtggctc aatcatggct cactgcagcc tcaatctcct gggctcaagt 540 

gattcttcca ccttagcctc cttgactcac tgggactaca ggtgcacacc accatacctg 600 

gctaattttt aaagtttttt gtagagacag ggtctcacta tattgtgcat tctggtcttg 660 

aactcctggt cccaagtgat cttcctgcct cggctttcca aagtgctgga attacaggca 720 

tcacccccat gcctagcctg aaaattcttt ctatgtcctt aacatcttct ttcccagtat 780 

ttctccatcc actcga 796 
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<210> 126 

<211> 1037 

<212> DNA 

<213> Homo sapiens 

<400> 126 

ggcacgagct cgtgccraat tcggcacgag ggtcatagtc cacagaggta aaagttaaca 60 

attctgatgc tcttgtatgt gcataccaga ggctctaggg aagaattccc tctttctttc 120 

ttccaccttc ttgtggctgc tggcattctt tggcttgtgg tcacatcact cctatcttga 180 

aggccagcat cttcaaatct gtttcttctt cacatagcct tctgtgtgtg cagtgccctc 240 

tacctctctc ttataaagac atttgtgatt aaatggaggg tttaggataa tctcgtcaag 300 

atccttaact taatcacaac tgcaaaaacc tctttcccaa ataaggtaac attcacaggt 360 

tccagggatt aggacctatt atctttggta agtattattc agcctaccac aatagctaaa 420 

acaattctga aaaagaagaa taaagtgaga gaaatcagtt tatctgattt cga tact tat 480 

tgtatagcta tggtaaataa ggctgcatgg tattaaagaa aggacatata tgaatgaaac 540 

agaatagagg acccagaaat agacccacac aaaggagccc aaattatttt taaccaaggt 600 

agaagacaat ttattggagg aaagacagcc ttttcaacaa atggtactat aacaattaga 660 

tatccatagg caaaaaaaaa aaaaagaatc ttgatctaag gctcacacct tatataaaat 720 

aatattaaac tcatggccag gcacagtgac tcatgcctat aatcccaata cactgggagg 780 

ctgaggcaag agtatcactt gaggccaggg gttcaagact agcctgggca acacagtgaa 840 

actctatctc tacaaaaaaa ttataaacta gctgggcatg gtggcacatg cctgtagtca 900 

caactactca cgaggctgag aagatcactt aagctgagtt gttcaaggtt ctaatgagct 960 

acaatcgtgc cactgcactc cagcctaggt gacagacaaa gaccccatct caaaaaaaaa 1020 

aaaaaaaaaa actcgta 1037 

<210> 127 
<211> 841 
<212> DMA 
<213> Homo sapiens 

<400> 127 

gggctgcagg aattcggcac gage teg tgc cgactctcag agcagggaac agegggggaa 60 

aatgtttaca ctccatgcac aatctgtgct tccagtccct caccctatgt ggeccaatag 120 

ctggctggat ttcacactta attggtattt ttttctgect tcttcccctg cccccactga 180 

ctcctctcct ctccctttga ttgtactcaa ggttctgggg cctgggccct gggtgggtac 240 

caacagctgc tcgctgttcc catgtcctct ctccagcttt gctgtgtttc tctgctacct 300 

aatctcagtg actgtgaaag gacattgtgt ctgagccatg gccagccgct ggctggcccc 360 

ctgatctgcc ccccttctat tgtttggatg gccatctcct gctgggcctc cctgactgta 420 

aaatctctgt actgtttgtt aggtttttgg tgggaggctg tgataagttc caatgagctg 480 

ccacttccct ggatatgtca agaagctgat ggcaacttgg ccaattctgg cagatatcag 540 

gcccccagtt cagccccagt caccctcttt tacacatgtg ggtcaaccac tgtgtgctca 600 

gagggtcagt cccttcctct gctgtgtttt tcttgagtcc ttgcactcac ttcccctgcc 660 

ccagtcacga tgacccctaa agcttccttt gcccttgctt tctagggcat ccctagtgaa 720 

ggggcaaacc tgagatttct ccgtggacct gaeagecaag gcagggcact gtctcctgag 780 

gccagtgcca geaegtgeat ggttcacaga aaaggatcct gggctcagaa tctcgagggg 840 

g 841 

<210> 128 

<211> 2128 

<212> DNA 

<213> Homo sapiens 

<400> 128 

gtctacctcc gggctgaaac gtcaccatgc ctccccacag acagaeggat ggacagatgg 60 

gcctccctgc acctgctctg tgggtgtggg ggctcctgct cagcagcagt ttccagaccc 120 

ttctccctgc tttccccaag ccacccgcct tgaatctggg gtgctctacc agacccatcc 180 

cctcatttct aaagatttga gccactagtc gtgtccctct ccctcagaaa tgccttggtg 240 

acacttggct gctttcaact cttccaccca tctgcctctt ggtctcatct ttaccttctg 300 

ctaaaggtcc tgacccccac ccccgccacg ccatggggca ccccatggtg gtgegtcett 360 

gggagcagct ctgtcccttt ccccgtggcc tttgccccgc ctcctatgac ttcgattccc 420 

acctgtcccc gacccctggg accactgacc gggcccgatc accctgtcac tgccctgtca 480 
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tctgcttacc ccacacggtg ctctgctgac ccaggtcttg ctgtctccca acagccccac 540 

gaggcttccc gtcgctcctg gacactgcag gctgagcccg ctgccccgcc gcctccatga 600 

ggaaggcttt tcctctgtga gccccaggcc accctttccc tcctttaagt aattacttaa 660 

gtcccttgcc agggccctcc cagtaccctt tctaaagaca cccctgcccc agcatgctgc 720 

aggctcctgc tccactttcc tctcaggccc tcgtcgctgt ggtgctgcct ttgttttctg 780 

tctctgccac ggcagggggt cagctccttg gaggtggggc ttctgccctt gctgtaccac 840 

tgcctggcac acagtaggtg ctcaataaag acttgcaggg tgagctgcct gaagaatagt 900 

caccagaggc cagaaatgtc tagagctctg ccggtagggt gactggccga ggagcctggc 960 

ctgcatgtgt gcgtgtgtgt gtgtgtgtgt gtgtgtgtga gtcagggttt atatgcaggt 1020 

gtctacagga gacatgctgg gttctgtgct gggtgtgagg aatatgggag cagaacccca 1080 

gggaggtggc agagacttgg gggccaaagg gctggggtgc aggggggcaa cagccaggtg 1140 

ccactggcca ccccagccgc agggagccct gcccaccctc caggtgcctg gatgtccaac 1200 

ctcactgcta ttcccacctc aagccaggcc tggagatgga ggccccatga ctcagccagg 1260 

gccggtttgc agctgcggct gacccagacg ggcgggcagc ccccagcccc cgggcctgca 1320 

cccaggacag ggccgccctc cctccctccc ccgcttctgg ctcctaggac aggattctct 1380 

gaattcagct cccctgaggc tggggccagg ttggaggcca ggcctggggg ctctgggctg 1440 

gggtcccaga taggggctgg gcggccaggc ttggaatctg gaatccagcc ccattcctgg 1500 

catctgcagg agcctcgtgg ggagggagac ttgggatgga cttcaaccag ccagggctgg 1560 

attcttgccc cggaacctgc attcctgggg cagccaaggg atccttccca cttctgggcc 1620 

cagcttggcc ctgcctggca ttcgaggccc atctggggct tgggggtgtc tccccaactc 1680 

tcagacataa ggacaccctt ccaagcttgt tccttcacct ggcggggccc tgagccccac 1740 

acccctcccc tgtcctttct ccatccgaca tcaagcgcct ccctgcctct gctcgcacag 1800 

tctctgagat ggggaactca gcacctcaca ggtgggccca gctctggtgc tgtctgtgtt 1860 

gggggagctg gggcagcccc caaaagacct tggagacaga ccctcagagg caggagcaga 1920 

ggctggcagt ggatgctgtg cctggaggcc ttgagggcga ggtgtgatga tgaggcccag 1980 

gctgcagggc tctttctggc tctccagctc cggagaacaa gggatttcct cctgctctgc 2040 

ccaccctccc cagccagtgc atgctcagcc tcagcaccgc acctgggcgc cctccatgat 2100 

ctgccccacc tggacacatg gctcgagg 2128 

<210> 129 
<211> 748 
<212> DNA 

<213> Homo sapiens 
<400> 129 

ggcacgagcg aaactgtttt ccaatgtggc tgaaccactc tgcatttcca ccagtaatga 60 

gaatgagagt tgctgttgct ccacggcctc accagcattt ggtggtgtca gtgtcttgga 120 

ttttagccat cctaataagt gttagtggct atcattgttt tcatttgcaa ttctcttaca 180 

tggtgtkgaa catctttccc catgtttatt tgtcatctgc atatcttctt cggccagtta 240 

tctgttcaga tcttttgccc gtttttgttt gcttgcatgt ttgtttgtgt ttgatttttt 300 

aaagaaagct ttttttatta ttgagttgta atagtgcttg tatagtgtgg ataacagttc 360 

tctatcagat aggtcttttg caaatatttt ccccaatctg tggactgtct tctcattctt 420 

ttgataaatg gctttaaaat aataatctgg ccgggcgcag tggctcatgc ctgtaattcc 480 

agcactttgg gaggccaagg gcagatcatc tgaggtcggg agttcgagac cagcctgacc 540 

aacatggaga aaccccatct ctactaaaaa tataaaatta gtcgggcgtg gaggcacatg 600 

cctgtaatcc cagctacttg agaggctgag acaggagaat ctcttgaacc cgggaggtgg 6.60 

aggttgcagt gagccgaaat cgtgccactg tattccagcc tggacaataa gagcaaaact 720 

ccatctcaaa aaaaaaaaaa aactcgag 748 

<210> 130 
<211> 297 
<212> DNA 

<213> Homo sapiens 
<400> 130 

ggcacgaggt gtgtttgtgt gtgtgtgtgg tgtgtgtatg tgtgtggtgt gtgtatgtgt 60 

gtggtgtatg tgtgtgtgtg gtgtatgtgt gtgtttgtgt gtgtgtggtg tgtgtatgtg 120 

tatttctttg aatgagaaat tggctcccat gattatggag ctgacaactc ccaaggtctg 180 

caggcagcaa gctggaggcc caggagggcc ggtggtgtgg ctgcagccag tgtctgaagg 240 

cctgagaacc aggagggcgg gtggtgcagc tgcagtgtga aagccggcag gctcgga 297 
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<210> 131 

<211> 1894 

<212> DNA 

<213> Homo sapiens 

<400> 131 

ggcacgagca gtctacctgg aaattgtcac attatacaaa tgtcaacttt tgtgtgtgtg 60 

tgtgtgtttt gttttgtttt gcggtcagag gcaagggcta aaagaaagca agatcagaga 120 

aataccaaga ggtgtttact gactaaaggg caaagggatc tatcagttaa ccaaagcaag 180 

ataaatagaa ctgccaatta actttatatt ctcagaagca gtgagcaaag aacgctgcct 240 

gaacaatgaa agtgttgctg caactttcat atttgctgtt gtctgcatgt aatttgtttc 300 

cttttacata gaaatatgtg gtattaacag agggatgtga ttagaatacc agcggaagct 360 

ctctttgata ggagacacac aggcaggtgc ctaacagcct atggagatca ggacagtttc 420 

tctccagtaa actcacaaat tgtggggacc atgatctgct taataagtaa aagggcaatg 480 

gggccaagat tacaatgttg aaaacatcca ggcttcccac ctggagtcct ggcctcacag 540 

taataataag aataaagatg tattgagata tatctagacc taactatata aatagacaga 600 

tagatataca cacatacaca cactgtgcta agatgcttca catgaactcc ctcatttcac 660 

cctcaaacaa ccacagggta gatggtttat caccgtttta gagataagaa aactccagtt 720 

agtacgtcac tgaagatcta cacagtgacg tagatgttgt gatagacatt tcttaaaaat 780 

attccaatta atcctcagaa cacctgtgag aagtatacta aatatactaa gctccatttc 840 

atgaatgagg aatcagagtc aaggagacga gataacatgt cccaggtgac ggtattagcg 900 

gtcatagcag gatttgagcc cagctctgtc tgtcttcaaa actcatgttt aggagactct 960 

tctgctttcc accaaagccc ttgatttgaa cctttgctct ctcctgaatc cacacttctc 1020 

ctgaaggagg agcaaggtgg agatgggata gggcacagga tggctgactc tctgactgga 1080 

gggcctaaga aaccccactt tgacacacac acagaaaact gtgccctggg tgggggtgtg 1140 

gggcttcatg agaaaatcaa gtagcaagag agagtcttaa catgcttaga tggcatgtgc 1200 

ctgttctcct gatttaatgg atgagaaaac tgagatccag ggcaagggca gtgagatagt 1260 

gagggtctct tagaatgagt acagccttca gggacccacc ccatgtaccc gtgggatcaa 1320 

gacgagccag aggatacctc ctaagtaaga acagaaggaa cagaaaaccc ttaaggtttg 1380 

ttgttgttgt tgtgacagaa tctcgctctg ttgcccaggc tggagagcag tggcacagtc 1440 

tcggctcact gcaacttctg cctcccaggt tcaagcgatt ctcctgcctc accctcccga 1500 

gtagctggga ttacaggcac ccgccaccat gcctggctaa tttttgtatt tttaatacag 1560 

acgaggtttc accatgttgg ccaggctggt ctcaaactcc tgacctcaag tgatccaccc 1620 

accccggcct cccaaagtgc tgggattaca ggcgtgagcc accgcacctg gcctgaaaac 1680 

acgtatcata cttgctatgt gccagacaca attctaacca cttttccaca gattaactca 1740 

gccttcaaac aatcctaaaa agtaggtatg attatttcct gcattttaca gccaaagaaa 1800 

ctgaagcaca gagagattaa gaggacttgt gcaaggtcat ggagggctat agtcctaccc 1860 

tctgaagtaa gttaaaccct ctccagaaaa agcc 1894 

<210> 132 

<211> 1355 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1327) (1327) 
<223> n equals a,t,g, or c 



<400> 132 

gcccctgctg 

gcctttacgg 

catttccctt 

tcactcacct 

tggggcacga 

tgtgcgttat 

caggctggag 

cgattctcct 

gctaattttt 

tttcctgacc 

cacaccacac 



gatggcactg tgggtaacct 
agcagactcc ttggcaaata 
ctgtgtcctc tttcgtgatc 
ggcatgcaaa ctcttttgtt 
ataaataatt ttccacacag 
gtctgtgtgt gtatccttat 
tgcagtggcg cgatctcggc 
gcctcagcct cccgagtagc 
gtatttttag tagagatggg 
ttgtgatccg cctgcctcgg 
ccggtcctgt gtatgttttg 



gcatcctttc 
aatgcctcag 
ttgaggtggg 
attgtgaact 
ctcacaactg 
ttttttgaga 
tcactgcaac 
tgggattaca 
gtttctccat 
cctcccaaag 
agacggagtc 



actgtgcaca tggttctcat 
tgcaggagcc acacgcaagg 
acttgggttt gaaggctttg 
ctctgacagt gctttaagtc 
tagggcttac atccagtgtg 
cggagtctcc ctctgtcacc 
ctccgcctcc tgggttcaaa 
ggcacccacc amcacgcctg 
gttggtcagg ctggtctcga 
tgctgtgatt ataggtgtga 
tcactctgtc acccaggctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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aagtgcagtg gcaggatctc ttctcactgc aacctccacc tcctgggctc aagtgattct 720 

cctgcctcag cctcccaagt agctggtatt tcagacttgc accatgatgc ctggctactt 780 

tttatatttt tagtagagac ggagtttcac cagcctggtc tcgaactcct gacctcaagt* 840 

gatccaccca ccttggcctc ccaaagtact gggattacag acatgagcca tcacgcccgg 900 

cccctaagtg gatttttagg cattctttca ggtgggcctc tgtggtgaaa ccttttgtgc 960 

acatttcaca aacggcttct ccgctgtgtg gcatttctca gctttctcca ctgccttcac 1020 

aggaaacttc ttcccgcact cctggccgac gtcgctccct aggtgactgt gcggcaaaag 1080 

ctcagacctc aggacactgg tggctgttgt ccagcctagt gtctgcttac cccgcactca 1140 

tcccgtagtc acacgtgaag gcttgagggg tctggaactt cctggccgta gcaatggact 1200 

ttctgaactt tcttgctctt tcagaattgc gttttgaccc tgagtgtggt cgtgggtgac 1260 

tcgccggcct cccgccccgg ggtgtggtgc ctttgttctg agtcatcaca agtgccatca 1320 

tcctgancct agcwtctttc agatcaccct ctcga 1355 

<210> 133 

<211> 1382 

<212> DMA 

<213> Homo sapiens 

<400> 133 

cccacgcgtc cgctgaattg cggccgtatg cgcggctctg tggagtgcac ctggggttgg 60 

gggcactgtg cccccagccc cctgctcctt tggactctac ttctgtttgc agccccattt 120 

ggcctgctgg gggagaagac ccgccagctg cttgagtttg acagcaccaa cgtgtccgat 180 

acggcagcaa agcctttggg aagaccatat cctccatact ccttggccga tttctcttgg 240 

aacaacatca ctgattcatt ggatcctgcc accctgagtg ccacatttca aggccacccc 300 

atgaacgacc ctaccaggac ttttgccaat ggcagcctgg ccttcagggt ccaggccttt 360 

tccaggtcca gccgaccagc ccaaccccct cgcctcctgc acacagcaga cacctgtcag 420 

ctagaggtgg ccctgattgg agcctctccc cggggaaacc gttccctgtt tgggctggag 480 

gtagccacat tgggccaggg ccctgactgc ccctcaatgc aggagcagca ctccatcgac 540 

gatgaatatg caccggccgt cttccagttg gaccagctac tgtggggctc cctcccatca 600 

ggctttgcac agtggcgacc agtggcttac tcccagaagc cggggggccg agaatcagcc 660 

ctgccctgcc aagcttcccc tcttcatcct gccttagcat actctcttcc ccagtcaccc 720 

attgtccgag ccttctttgg gtcccagaat aacttctgtg ccttcaatct gacgttcggg 780 

gcttccacag gccctggcta ttgggaccaa cactacctca gctggtcgat gctcctgggt 840 

gtgggcttcc ctccagtgga cggcttgtcc ccactagtcc tgggcatcat ggcagtggcc 900 

ctgggtgccc cagggctcat gctgctaggg ggcggcttgg ttctgctgct gcaccacaag 960 

aagtactcag agtaccagtc cataaattaa ggcccgctct ctggagggaa ggacattact 1020 

gaacctgtct tgctgtgcct cgaaactctg gaggttggag catcaagttc cagccggccc 1080 

cttcactccc ccatcttgct tttctgtgga acctcagagg ccagcctcga cttcctggag 1140 

acccccaggt ggggcttcct tcatactttg ttgggggact ttggaggcgg gcaggggaca 1200 

gggctattga taaggtcccc ttggtgttgc cttcttgcat ctccacacat ttcccttgga 1260 

tgggacttgc aggcctaaat gagaggcatt ctgactggtt ggctgccctg gaaggcaaga 1320 

aaatagattt attttttttc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1380 

aa 1382 

<210> 134 
<211> 791 
<212> DNA 
<213> Homo sapiens 

<400> 134 

cccacgcgtc cggttctcct gcttgccatc aatggagtga cagagtgttt cacatttgct 60 

gccatgagca aagaggaggt cgacaggtac aattttgtga tgctggccct gtcctcctca 120 

ttcctggtgt tatcctatct cttgacccgt tggtgtggca gcgtgggctt catcttggcc 180 

aactgcttta acatgggcat tcggatcacg cagagccttt gcttcatcca ccgctactac 240 

cgaagagccc ccacaggccc ctggctggcc tgcacctatc gccagtcctg ctcgggacat 300 

ttgccctcag tggtggggtt actgctgttt cggaggtatt cctctgctgt gagcagggct 360 

ggccagccag actggcacac attgctgtgg gggccttctg tctgggagca actctcggga 420 

cagcattcct cacagagacc aagctgatcc atttcctcag gactcagtta ggtgtgccca 480 

gacgcactga caaaatgacg tgacttcagg gaagcctgga cacccgaggc acctggacca 540 

gctatgggta gttctgtggg tggaacacat tctgtgtaag agccccac tg agggctctgc 600 

agcggagtga cagcaacccc agagatgagg caccagagag tgccactgca tgagacacct 660 
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gtgaccattc gaagtctgaa atgcgggggg ggagtttcat ttttaagtga agaccaaaag 720 

ccctttaaaa ataatagttt tttatcattt tatagtaaaa aaaaaaaaaa aaaaaaaaaa 780 

agggcggccg c 791 

<210> 135 
<211> 2163 
<212> DNA 

<213> Homo sapiens 
<400> 135 

cgcccacgcg tccgaggcgg cggagcccca gccccaccca gtgcggagcg cgccgcgagc 60 

cccgccgyaa gctgagcgcc tccgcccgcc aggcgcgccg gcgccgggcc atgtactcgg 120 

ggaaccgcag cggcggccac ggctactggg acggcggcgg ggccgcgggc gctgaggggc 180 

cggcgccggc ggggacactg agccccgcgc ccctcttcag ccccggcacc tacgagcgcc 240 

tggcgctgct gctgggctcc attgggctgc tgggcgtcgg caacaacctg ctggtgctcg 300 

tcctctacta caagttccag cggctccgca ctcccactca cctcctcctg gtcaacatca 360 

gcctcagcga cctgctggtg tccctcttcg gggtcacctt taccttcgtg tcctgcctga 420 

ggaacggctg ggtgtgggac accgtgggct gcgtgtggga cgggtttagc ggcagcctct 480 

tcgggattgt ttccattgcc accctaaccg tgctggccta tgaacgttac attcgcgtgg 540 

tccatgccag agtgatcaat ttttcctggg cctggagggc cattacctac atctggctct 600 

actcactggc gtgggcagga gcacctctcc tgggatggaa caggtacatc ctggacgtac 660 

acggactagg ctgcactgtg gactggaaat ccaaggatgc caacgattcc tcctttgtgc 720 

ttttcttatt tcttggctgc ctggtggtgc ccctgggtgt catagcccat tgctatggcc 780 

atattctata ttccattcga atgcttcgtt gtgtggaaga tcttcagaca attcaagtga 840 

tcaagatttt aaaatatgaa aagaaactgg ccaaaatgtg ctttttaatg atattcacct 900 

tcctggtctg ttggatgcct tatatcgtga tctgcttctt ggtggttaat ggtcatggtc 960 

acctggtcac tccaacaata tctattgttt cgtacctctt tgctaaatcg aacactgtat 1020 

acaatccagt gatttatgtc ttcatgatca gaaagtttcg aagatccctt t.tgcagcttc 1080 

tgtgcctccg actgctgagg tgccagaggc ctgctaaaga cctaccagca gctggaagtg 1140 

aaatgcagat cagacccatt gtgatgtcac agaaagatgg ggacaggcca aagaaaaaag 1200 

tgactttcaa ctcttcttcc atcattttta tcatcaccag tgatgaatca ctgtcagttg 1260 

acgacagcga caaaaccaat gggtccaaag ttgatgtaat ccaagttcgt cctttgtagg 1320 

aatgaagaat ggcaacgaaa gatggggcct taaattggat gccacttttg gactttcatc 1380 

ataagaagtg tctggaatac ccgttctatg taatatcaac agaaccttgt ggtccagcag 1440 

gaaatccgaa ttgcccatat gctcttgggc ctcaggaaga ggttgaacaa aaacaaattc 1500 

ttttaattca' acgggtgctt tacataatga aaaaaccact tgtggcacac gatgggcatc 1560 

taacatcatc atcttctaat gtgttggaga ttttcatttc aaatatattt tttaaattac 1620 

tctattttcc aaaacacgta atgcattttt ctcgaaaata ccttactgta aaaataactg 1680 

tcgcgtacac atgtgtgaag tagctagaac atactgaatt ttttttgtac tgttggactc 1740 

tattcagtgt catgtcctat atctgatcaa gttatcaagg agataattct agaatgaaaa 1800 

agaaaatcct cttgttggaa acaaaagacg ttttatatgt gcagtatgac aaagaggagt 1860 

ttcagagaca actttgaatc cttgtcagcc tggagaccag caccagagga atctacaagg 1920 

caaactccca tatatttgct tcccccaaat tgctgcccct acagactcaa agctcttttt 1980 

ctttgttttg ttgtttctct aaaaatttac tgttctttgt cgatgctata taagccaggg 2040 

agttctaaga cgccagctct ttgagatttg ctcattcccc tgtatttccc acatatatat 2100 

tacatatacc cgctaataaa tttatgtttg ttttaaaaaa aaaaaaaaaa aactcgaggg 2160 

ggg 2163 

<210> 136 
<211> 2087 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> (1) . . (1) 

<223> n equals a,t,g, or c 

<400> 136 

ncccacgcgt ccgctgttgc tcaaaggaaa taggagttgg tgtgcttgtg accaaggggt 60 

tacacttcca gcttttaaaa ttctccttta catgtgctca gtgttttgtt ttgtgttttg 120 
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gtttctgttt tttattttaa ttcccacatt gggcacaaga atcagaatat ggatagctag 180 

tttaagaaac ttttgtgggt gcactgtagc atagatgaca gaatttgatg ttccccccat 240 

ctccaattca gttcagggca ttccacagtt aaacagaaat gggaacgtgg ggctcttata 300 

aatgaaatgg gcgctcacag ttttggtttt cagctcttca tgtctgtaag tgtgctttgg 360 

gggaggctat gtctgtatgg tcgattctca gttatcacat ttgcctctcc tcccactacc 420 

ttcatggaca ttcagtgctg tttcgcactg cagttagaga gaagggacgg acagttggtg 480 

acactcagcc acattgctac ttttatctgt tctggtaaga agttagatag atggtagatt 540 

gaagcaattg ggtagaatta gttgggggaa tatttatgag ttgctgtgtt tgttgattag 600 

ttccatctct ttcccatttt aactgagaat tgattatata tagctctaag tatataggta 660 

tttaaacaac cccacaagcg gctgtatcag taacatttat taattccact atagtgaggg 720 

aggatttcca ttctaaatac cttattttga gggatttata aaacttagtt gtaaaagaga 780 

aagcccacat agtgggaata aattgcttca gccattttta gtatttgaga gcactaggga 840 

agatgtttag tagctgtgtg gatgcctttt ttcacaccct gtctattgaa tgctgcatcc 900 

attcacgaag ttaaatgtta catgcagtta gtccttaatg tggactggat ctgtactttt 960 

gttttggatt aaaacattta aagatttttg aagtgcagct actccccacg tggcatttga 1020 

tacacataaa agtcatactg tgtgtgcaca aagagtacat ggattttcca gcatattgct 1080 

ttaaaaaatt atataaactg ttaaaatatt aacacctcag gctacctgct gtattctgtc 1140 

ccattgaccc ctggaattgg atttactgca agtgattgat aattcaatta tgtggctttt 1200 

cccctttaat cttgccattt aaattacagt agaaagacaa aatcaagtaa aataaagtgt 1260 

tagataatag aaagagtgtt aagaccagcc cacttttctc atgtttatgt tctttcattt 1320 

ggaccaagaa tctccgcatg gaggttgatt tgccactggg gactttggct aagactatta 1380 

ggtttgcttt caactagatg ttcctgagac aagcagaggg acactgcaat tccccttcca 1440 

tgcctgctgt fcctcccccat gtaagtcttc tttgaaatta acggatgtgt ctcctttgga 1500 

acagccccat aacaaaagag aactactgat ctgagcatag gaaagtagag gctctaccac 1560 

ttttcagttg aaaaagcaag actttctctg tgtttctgaa acaaggcata atgttgtcac 1620 

agaatcagag atccagtctc acttttccac aaatctccaa atctccagtc ttatcttgtg 1680 

tgctctaatg gtttggttca atccctttcc aactcttgtt ttcaaagcat ggggcctgag 1740 

tgttctccac tcctcctaag aaaggagctt gggtggaagg gaccatgctg acctcctcca 1800 

tcagagggct cttccagtag tattctcgga tgcaacctcc atttctcagt taccattatt 1860 

tcctgtatca gctttgtcct ' tcctggaggg atgcacagtg atccggccca ccactgttgt 1920 

tgtcttgtgc ttctgctctt tcctatggtt tcaggttatt ttctgggttt cccctattct 1980 

tcttttattt cctttttttt ttatatttgc tttcctttct actgctttta gatttgcagg 2040 

agatgcaagt ttcagctcaa tgtttggctt ctctcaatat ggaaatt 2087 

<210> 137 
<211> 830 
<212> DMA 

<213> Homo sapiens 
<400> 137 

ggcacgagta aggactgtgt tctttatgca tttcttgatc caggcatggc agttcctctt 60 

ttcctgtaca tattcacact cctgccactt ctaccctttc tcttatccct ctgcttttca 120 

cctctgactg taaaaagaag tagcagttcc gaaagcaaga gttccctatg aacacggaag 180 

aagacattgg caacttttga gtacaacaac tatatttaat agagtaattt aagaacatca 240 

gccagtgaat tttatacaag atagtgaaag agaaaaggaa gattaattag gggtagttta 300 

ggatgccatt aaatagccta gaattagggg agtagtcgtt gaatagaaag gaggccacaa 360 

atttgaggga tataagctaa gaattggtaa gccaagaaga aggaaaaggt ttgggcagta 420 

aggataatga ggaacaaaat agagaactca gaagcaatat ctgactgtta tcattggaag 480 

aatttttttg cttgcttgag gctggatatt gaagtggatc aggatacttg agtgactatc 540 

tgatgggctt ttggaactag ctctcaagag gtgaaaatta gctttttttt ctttttcttt 600 

cttttttttt ttttttgagg caaggtctca ctgttgttga ggctgaacct cctgggctca 660 

agcagttgtc ccattgcagc ctcctcagat actctgtaag ccaaggcagg gggaatattt 720 

tgtgctcagt agtttgaggc tgtggtgagc taagatcaca ctgctgtgct cacttcagcc 780 

tgggcaacac agtgaaaccc cgtctccatc tgtttaaaaa aaaaaaaaaa 830 

<210> 138 
<211> 1939 
<212> DNA 

<213> Homo sapiens 
<400> 138 
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gaacacaaac atgcagtctg tagcagatgg taataggctg ayatattaca cttgttgatg 60 

taaatctgat aggtttcttt ctctccaagg acagcttttt aaatatttaa cagtatcaat 120 

aatttttcag tttctgtgag aattttataa tttataattt gcagacttaa tgtataatct 180 

attttgtcct aacaattaca aatatatttt ttatttcaga ttrtatatat tcctaccaga 240 

tggagataat tacagcttta aaaattttta ttttttcatt ttatttcaca cattgacatt 300 

aaatttttat ggacacataa taactgtaca tatatatggg gtagaatgtg atgttttaat 360 

acatgtactc aatgtgtaat gatcaaatca gggtaatttg cataatgatt tttctgtagg 420 

gagaaaattc aaaatctact cttctggcta ttttcaaata tataatatgt tattgttaac 480 

tatactcatc ctactatgca ataggacacc agaacttatt cctgggttct acatccgtta 540 

aggcaaccaa ggattggaaa tattggaaaa aaaaattgcg tctgtactga acatgtacag 600 

acttttttct tgtccttatt ccttacacaa tatagtacaa taactatttg catgacattt 660 

acatcggata ttatgagtga tctagagttg atatgaagta tatgggagga tgtgcaaagg 720 

tgatgtgcaa atactatgtc attttatatc agggacttga gtatcctttg ttaycctcag 780 

gagatcctga aacyagtccc ccatggatac tgagggctga ctgtatagtc ctatcctcac 840 

ggaactttca ttctaatgrg ggaagactga ctataaacaa aatatatgta ataggtggtg 900 

gtaagtaccg tggagaagta acaaatgggg caaagtgagt tatacagctc catycttaga 960 

aaccttggag tacttttctt agtttatact cgtggtggtt tccttttgtc tcctttatta 1020 

catgggactc tgacatgtgc ccatagctag ggtggcagta ggatctaccc gaaaagcgtc 1080 

ctgctgatac aggaccaaag catcctgttg ttctcgagcc tataaaaaga gctaatggtc 1140 

ttgcttctct taactgtggc ctcctacact gtgttttgga tgattggtga tgtcttggat 1200 

attctgtttc tttggaactt tgaatataca acactttact agggaattag caatggaagc 1260 

agagcaaaga tgtacagagg aaacaatgcr taactctgat ggaattgaag tcatgaggca 1320 

gcagagagct taaattasag ctttaaaaat ttttattttt tagagggaat ttamttggga 1380 

gtaacagcag taatagttaa cggagccaga atgcttgagt catataattg caaagcagag 1440 

ttgggagcaa cagatgctaa agagtagttg ctgtagttcc tctttgggtc gtaggagcag 1500 

ttgtcatrtt mctatayagc tactgcatga agaagagttc ttagtgaggc ctgggtgaac 1560 

agctcttctt agtattctgt gtgaccccat tygacctttt aacaaatccc taagtaaata 1620 

aatagcccct maggwaaact aagtttttct ctgctgtttt tttgcttgag agagctataa 1680 

ctgtaataga cttatatttc tgaacatttt agtgcttgcc aatatttggt aatatttatg 1740 

tttcctatat ttgtaatgaa cattcttctt cmggtacatt tyttgttaaa ttattgttts 1800 

atgsataaaa gttcaccttt tattgtataa aattgactca gattaattta tacacattga 1860 

caatgggtaa atagagtttt tcagattatt aaaagctgaa ggatgcccat gtaagcaaaa 1920 

aaaaaaaaaa aaaactcga 1939 

<210> 139 

<211> 2410 

<212> DMA. 

<213> Homo sapiens 

<400> 139 

ccacgcgtcc gcttcgacga cgacacctgc agaagtgcgg agcccgccat gccgcgccac 60 

ctctcgggac tgctcctgct gctctggccg ctgctgctgc tgctgccgcc gacccccgcc 120 

gcccccggcc ccctggcccg cccgggtttg cggaggctgg gcacgcgggg cccagggggc 180 

agtcccgggc gccgccctgt ctctgctgtc cccacccgcg cgccctattc cggggccggc 240 

cagcccggcg gggcccgagg cgcaggtgtt tgcaggagca ggcccttgga tttggtgttc 300 

atcatcgata gttcccgcag tgtgcggccc ctggagttca ccaaagtgaa gacctttgtc 360 

tcccagataa ttgacactct ggacattggg gcggcagata cacgggtggc agtggtgaac 420 

tatgctagca ccgtgaagat tgagttccat ctccagaccc actcagataa acagtccttg 480 

aaacaggctg tggctcggat cacacccctg tctacaggca ccatgtccgg cctggctatc 540 

cagacagcaa tggatgaggc cttcacggtg gaggcaggag ctcgggggcc cacttccaac 600 

atccctaagg tggccatcat cgtgacagat gggaggcccc aggaccaggt gaatgaggtg 660 

gcggctcggg cccgggcatc tggtattgaa ctctacgccg tgggcgtgga ccgggcagac 720 

atggagtccc tcaagatgat ggccagcgag cccctagacg agcacgtttt ctatgtggag 780 

acctacgggg tcattgagaa actctcctct agattccagg aaaccttttg cgctctggac 840 

ccgtgtgtgc ttggcacaca ccggtgccag cacgtgtgtg tcagtgatgg ggaaggcaag 900 

caccactgtg agtgcagcca aggctactcc ttgaacgccg atcagaagac gtgttcagct 960 

atcgataagt gtgctctgaa cactcacggt tgtgaacaca tctgtgtgaa cgacagaact 1020 

ggctcttacc actgtgagtg ctacgaaggt tacaccctga accaagacag gaagacttgt 1080 

tcggctcaag accaatgtgc ctttggtaca catggctgcc agcacatttg tgtaaatgac 1140 

agagatgggt cccatcactg tgaatgctac gagggttata ctctgaatgc tgacaacaaa 1200 

acgtgttcag ttcgcagcga gtgtgctggg ggctcgcacg gctgccagca cctgtgtgtg 1260 
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gacgacgggc ccgcggccta tcactgcgat tgtttccccg gctacaccct gaccgaagac 1320 

cggaggacgt gcgcagccat tgaagaagca cgaagactcg tctctacaga agatgcttgt 1380 

gggtgtgaag ccaccctggc cttccaggag agggccagct catatctgca gagactgaat 1440 

gccaaactcg atgatatttt gggcaagttg caagcagatg cgtatggaca aatacatcgt 1500 

tgaattactc agatttttca cctggatata cggagagctt ggtctattta atatttttgc 1560 

atacttcaat gttcctgcta ataatttgcc attgcaaatg ctttaatatt actggataag 1620 

tagtatgagg atcttctaga gaatcagtag gacataaacg ttcacatcct taagagcaaa 1680 

ctttagtgtc tctaagctat gactgtgaaa tgattcatgg ggaatagaat gaaaagtttg 1740 

gtatctcttt atttaccaat tgagccattt aatttttaaa tgtttatatt agtaagataa 1800 

ccattcttac aatgggaact ttttatctat tttctcttga tagtatttat agtataaacc 1860 

agttttatta ttgagagtgt aaattataca agtatttaca cataaaaaag ttcatataat 1920 

tgaggtaaat ataatttaga actgtttctt taatgctttg ttttttgctc actttttgct 1980 

ggaatatcac tgaagctgtg atcaggggat tataacacat atcaagatca agtgaacact 2040 

acatgaaata ttgtaagaaa cacataacta aagactttag ttttgaatta agtgttataa 2100 

cttcttacca agttttggta aaaaatccta cattatcttt actgtttcac tttaggattc 2160 

aatcaagaaa attatatact tataaatatt gatctaaaaa gttaacaaca aacccaatgt 2220 

cgccatttta aagtttaagc ttaacttttc ttcacttaca tatttagtat atgtatttta 2280 

tttttccgct tgaaagctta tagctcttag gagaaaacca tcctttaaat tgtgactact 2340 

cattttttct gtttgtattg tctttagtat aataaaaagt tactatcttt ataaaaaaaa 2400 

aaaaaaaaaa 2410 

<210> 140 
<211> 1491 
<212> DNA 

<213> Homo sapiens 
<400> 140 

ccacgcgtcc gggagccatg gcgccgtccg ggccgctgct gctggtgctg ctcgtgccgc 60 

tggccgccgc gcgggccggg ccctacttcc gtcccggccg gggctgccgc ctgcccctgc 120 

ggggggacca gctgtcgggg ctggggcgca ggacctaccc ccggccgcac gagtacctgt 180 

ccccatctga cctgcccaag agctgggact ggcgcaacgt gaacggggtc aactatgcca 240 

gtgccaccag gaaccagcat atcccccagt actgtggctc ctgctgggcc cacggcagca 300 

ccagtgccat ggcgggaccg gatcaacatc aagagaaagg gggcgtggcc ctccaccctg 360 

ctgtccgtgc agcacgtcct cgactgcgcc aacgcgggct cctgtgaggg gggcaacgac 420 

ctgccggtgt ggaggtacgc ccatgagcac ggcatcccgg acgagacctg caacaactac 480 

caggctaagg accaggaatg caacaagttc aaccagtgtg gaacatgcac ggaattcaag 540 

gagtgccact acatccagaa ctacacgctc tggaaagtgg gtgactacgg ctccctctcc 600 

ggcagggaga agatgatggc ggaaatctat gccaacggcc ccatcagctg cggtatcatg 660 

gccacggaga agatggtgaa ctacacggga ggcatctacg cggagtacca ggatcaggcc 720 

tacataaacc acgtcatttc tgtggtcggc tggggcgtca gcgacggcac ggagtactgg 780 

gttgtccgga attcgtgggg ggaaccgtgg ggggagcacg gctggatgag gattgtgacc 840 

agcacctata aagacgggca gggcgccagt tacaacctcg ctgtcgagga cacctgtacg 900 

tttggggacc ccatcgttta agggacaggt ctccccagaa gagcagtgtt atcgtgaacc 960 

ataatcaggg ggtcctatcg ctctgggcac tgggttggtt ccaccatggt ctgaagggac 1020 

tggggactgg catcaaacgt gtctgatggc tgctcgcggc cccgtgcgcc cagaagggag 1080 

aaggggcgcc tgtcagcaca cagcctgccg cggcgccggc cgggagcgcg ctcctgggga 1140 

agagtctgca atgggacggc tgagagcccc gggccggcca ctgccctgcc ccagtgtctg 1200 

cctggccacc gtgtgatccg caaggcccaa acgatgtgac tgccaagctc ctctgtccct 1260 

gatttggtgt ttcctgtctg gcagctgtgg tccatgatgt ggtgcggaag cccaggcttc 1320 

tcaaagctct tacgttgcct gggattcggt gggggggagt cggggggtgg agggagaaga 1380 

cggccctgtg agattgccca agtgatgaat aaagtacgtg accccgcaaa aaaaaaaaaa 1440 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 1491 

<210> 141 1 
<211> 3530 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (30) . . (30) 
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<223> n equals a,t,g, or c 
<220> 

<221> misc_f eature 

<222> (59) . . (59) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (79).. (79) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (3465) . . (3465) 
<223> n equals a,t,g, or c 

<220> 

<221> mis cofeature 
<222> (3471) . . (3471) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (3485) .. (3485) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (3487) . . (3487) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (3505) (3505) 
<223> n equals a,t,g, or c 

<400> 141 

gacatgaagc caaccggcac ccggccccan atttgttata acagtactct. aatttgaanc 60 

catgataaaa ataaccatnc atttgtaaat ccacagagga ccctggtgaa ttcctaacac 120 

atattttctg tcttgttaag ctgttgaatt gcatcatggt ggagaaatgt gtaaatagaa 180 

gtaatgatct tggtgtttgt aaaaggggta tttgaaaggt tcccgaaagt gtgagggggc 240 

tgatagcaga tctgcagctt ctccccccgt gcactaaagc caatcataat tctttctacc 300 

tggtttgtgt catcctccat taggagaccc catcttcaga accaatggag gaagaggaag 360 

atgacgactt ggagctgttt ggtggctatg atagtttccg gagttataac agcagtgtgg 420 

gcagtgagag cage tec tat ctggaggagt caagtgaagc agaaaatgag gategggaag 480 

caggggaact gccgacctcc ccgctgcatt tgctcagccc tgggactcct cgctccttgg 540 

atggcagtgg ttctgageca gctgtctgtg agatgtgtgg tatcgtgggt acaagggaag 600 

ccttcttctc caagaccaag aggttctgea gcgtctcctg ctccaggagc tactcctcca 660 

actccaagaa agecagtate ttggctaggt tacagggaaa accaccgacc aaaaaageca 720 

aagtcctgea caaggctgcc tggtctgeca aaattggagc cttcctccac tctcaaggga 780 

caggacagct ggcagatggg acaccaacag gaeaagaege tctggtcttg ggcttcgact 840 

gggggaagtt cctgaaggat cacagttaca aggctgctcc cgtcagctgt ttcaagcacg 900 

tcccactcta tgaccagtgg gaggatgtga tgaaagggat gaaggtggag gtgetcaaca 960 

gtgatgctgt gctccccagc cgggtgtact ggatcgcctc tgtcatccag acagcagggt 1020 

ategggtget getteggtat gaaggctttg aaaatgaege cagccatgac ttctggtgca 1080 

acctgggaac agtggatgtc caccccattg gctggtgtgc catcaacagc aagatcctag 1140 

tgcccccacg gaccatccat gecaagttea ccgactggaa gggctacctc atgaaaegge 1200 

tggtgggctc caggacgett cccgtggatt tccacatcaa gatggtggag agcatgaagt 1260 

acccctttag gcagggcatg cggctggaag tggtggacaa gtcccaggtg tcacgcactc 1320 

gcatggctgt ggtggacaca gtaatcgggg gtcgcctacg gctcctctac gaggatggtg 1380 
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acagtgacga cgacttctgg tgccacatgt ggagccccct gat ccaccca gtgggttggt 1440 

cacgacgtgt gggccacggc atcaagatgt cagagaggcg aagtgacatg gcccatcacc 1500 

ccaccttccg gaagatctac tgtgatgccg ttccttacct cttcaagaag gtacgagcag 1560 

tctacacaga aggcggttgg tttgaggaag ggatgaagct ggaggccatt gaccccctga 1620 

atctgggcaa catctgcgtg gcaactgtct gtaaggttct cctggatgga tacctgatga 1680 

tctgtgtgga cggggggccc tccacagatg gcttggactg gttctgctac catgcctctt 1740 

cccacgccat cttcccggcc accttctgtc agaagaatga cattgagctc acaccgccaa 1800 

aaggttatga ggcacagact ttcaactggg agaactactt ggagaagacc aagtcgaaag 1860 

ccgctccatc gagactcttt aacatggatt gcccaaacca tggcttcaag gtgggcatga 1920 

agctggaggc cgtggacctg atggagcccc ggctcatctg tgtggccacg gtgaaacgag 1980 

tggtgcatcg gctcctcagc atccactttg acggctggga cagcgagtac gaccagtggg 2040 

tggactgcga gtccccagac atctaccccg tcggctggtg tgagctcacc ggctaccagc 2100 

tccagcctcc tgtggccgca gaaccggcca caccgctgaa ggccaaagag gccacaaaga 2160 

agaaaaagaa acagtttggg aagaaaagga aaagaatccc gcccactaag acgcgacccc 2220 

tcagacaggg gtccaagaag cccctgctgg aggacgaccc tcagggtgcc aggaagatct 2280 

cgtcggagcc tgttcctggc gagatcattg ctgtgcgtgt gaaggaagag catctagacg 2340" 

tggcctcgcc cgacaaggct tcaagtccag agctgcctgt ctccgtcgag aacatcaagc 2400 

aggaaacaga cgactgagcc ttcctgcctc cagcctggct tctagctgga agccagccca 2460 

gcgtttctct accaccacca ccatgcctcc acctgacttt ggcttggaga ctgatcctct 2520 

ctgtgtaaat tctgcccggt gctgtgaagg ctggacggtg gaggacctgc tggggtctcc 2580 

tgggacccgc ctgttgcttc tgccctcccc tgtggaaagg tctatatgac gggccgcctg 2640 

aggccccaga actcgtctgt gaaccacctt ttccagccag agttcccaaa gctggaacgc 2700 

tagctgcctg ctcttcctta agatggcctc cccccgaccc gccacggccc tcagttgcca 2760 

gggatggggc caccactgtc acactgtgga atacaagaca gtgaactctg tctgcctgaa 2820 

cgagtcatgt aaattaagtt ctagagcagc tctctgagca ggataaggtc ccctgacagt 2880 

gagttgtgtg gtgggggcag cctctgcctc aaaaattcac caagcagaat gcctctcagc 2940 

ctcatgtgtt ggtcctctgc tcctcctagc tccccaggga tgttggggac ccagcttgtc 3000 

tcggcagcta agaagcagtg accaggatgt ggattttggc gacctgtgtg gtggccttga 3060 

gctgctttct gtgtttgtga ggactgactc ccatttccta aaggaaatgc ccccggggag 3120 

gacattggga ggaagatggc ctgagtgtgc actttggctc tgctacctgc tcctgaagcc 3180 

ccgctaaaaa taattcatcc aagattcctt tgtagttaaa gggtccagtt ctgactggag 3240 

cctctagaga gctgggcttg tatgttcttt tggccttttg ttcctaccta aatgaagaaa 3300 

ccatgcctgg aggggccgtg aacacagaac cctcaagaca aggatgacag agctggagga 3360 

cacatctagc tgccattgca acctcactgg gctccccaga ctctgtgtgt gagaaattaa 3420 

accccctgct tgcttgagtc ccgtttgtta aggattcttc tattnccttg ncagatttgt 3480 

gttcngnacc ccaccccccc acctngtggt attcggttca gaaagggcca 3530 

<210> 142 

<211> 1145 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (386) . . (386) 
<223> n equals a,t,g, or c 

<400> 142 

aattcggcac gaggacatat tggccattta ctctactaat aaaagagtac tatctactca 60 

gtgtcattta ctgttactgt agtaattaat gcctttggaa gaatcttttg aaatagttct 120 

caaattggta ccactacttg gtttggaatt attttttttt cttttcataa tcaatggtta 180 

tataaatgta tattgtccta gtcagtattt tatatatgct aaggactcac tagctggctt 240 

ggcactaata cctcaataaa aggaatactt cttttggaat catgaaacaa aagtgartaa 300 

acctccaagt tatttttcca accaaccttc tttgaaaaat cttggatgag tcactcaaat 360 

caagacatgt tataaaatta tctgtnattt tggtagaaca tatacattgt yctaataata 420 

atttycaaat attcagtgka acygtaagka tgagaataca ggttgaatat cycttatcca 480 

aaatgcttgg gaccagaagt cttttggatt ycaaattttt aaatatttac atcatactta 540 

ccagtttaac atccctaatt caaaaattca aaattcagaa tgctccaata atcgtttcct 600 

ttgascatca tgtcagtgct caaaaagttg cagattttga ggtatttcag atttttgaat 660 

taggaagact caacttgtac tatcattcta tagactttat gattgggtag actacatgag 720 

tattgaaccc agaaatcatt gtctagcaaa agccagtata gtgattaatt accctgtgac 780 
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tattatataa tgttcaaaaa agctaacata ttagaatgtc cttagcgtgc agagagcaaa 840 

cagagacaaa aagaaaagtt accctgaaaa gtttgtcaga aaaatagaat atcagacgct 900 

raactactca tccagaattt tgtcraaaaa gaaaaataag ataaaattca ctggtagaca 960 

aaaagtagta acataccagt ttgtaatttc tcagtttcaa accatgaata tgtatttgta 1020 

taccaaaaat catttcagga gtcagagaag gaggatatgc cttttatgtg gagactttaa 1080 

acataaaatt ggaaaaaaaa aaaaaaaaaa actcgtaggg ggggtcccgt acccaatcgt 1140 

cctgt 1145 

<210> 143 
<211> 2214 
<212> DNA 

<213> Homo sapiens 
<400> 143 

gcagtcgcag catgctttcc gaggaagccg gtgttgccga gattgccaaa atgctttgga 60 

gtttttaact gaatctaaga aaagtccaaa atagatttga gactgtaaaa acagaaactg 120 

cagcaagggg gattcagtgc caatgcatca acaaaaaaga caaccagagt tagtggaagg 180 

aaatcttcct gttttcgtgt tccccacgga gctcatattt tatgcagatg atcagtcaac 240 

acataagcaa gtgttgacac tgtacaatcc ctatgagttt gccttaaagt tcaaagtttt 300 

gtgtactact ccaaataagt atgttgtcgt tgatgctgca ggtgcagtaa agcctcagtg 360 

ttgtgtggat attgtgattc gtcatcgaga tgttcgatcc tgtcactatg gtgtaataga 420 

caaattccgt ctccaagttt ccgagcaaag ccaaaggaag gctttgggga agaaaagagg 480 

ttgttgctac tcttctccca tcagcaaaag aacaacaaaa ggaagaagag gaaaaaagat 540 

taaaggraca tttaackgaa aktttatttt ttgagcagtc gtttcaacca ggtcttatca 600 

caatggccat acttagaaca tgagcaagga tttcaattga cttctgaagt aaatctgtct 660 

tgaaaatatg aatgtggact gccttttatc tctatttcac tccattaaca tgcaacaaac 720 

tattgaatga tttcaaataa ttgcaaatgt ataatatata ttttaaatta taatttaatt 780 

tgaaggactg cagaacatta ttttacagac agcaaggatg cttctgagtg acacctagga 840 

aattatttga agaaattctt tttatatcta yacctgttgt gtaagaaact ttaaaacatt 900 

kgttattttc tcaccttttt ttctaattca ctttgattgc taggggtcat gtatgcttcg 960 

aagttacagg actaaaagag caaactgacc ggcctaaaac taaaatgaca tttattccct 1020 

agctacaaac atcagcgtta ttatgttaat tataccttgc cctctatcat tataaatggt 1080 

tgccatggtg tttctaaaaa taagtgtttt accattaatg tgtagagggc aaacaaagca 1140 

taaagtacta agggatcatg cttatcctag ggtctcacag aagagaggac atatttaatt 1200 

aatcttgtga attacagaac aggttgtggt ccagacacca agaatcatag gggttttttt 1260 

ttaaaaaacc taatagaagt agggtgacct ctctctttgg tctaagagtt ctaaaggaag 1320 

gtaggcatct gtttaattag ttggttcacc ctggctttac ctctggttaa tgctttgtgt 1380 

taataggaag gaaaaatcac tttatctttt cttccaagcc cctccctgcc tgacttaccc 1440 

agactgggat taccagatac caggtgattt atgtggagat gatttttcac ctttaaactc 1500 

taagccaagt gtaagaaact cttgatagct atgtctattt tatatcagtc actgagactt 1560 

ttttttaagt ttttatttat tattaagaca actttgccaa aaaagtcccc taagcacaac 1620 

tatttacatt tctttatagc ctcttctgat ctctaacaca tatgcagttt taactgttat 1680 

tttcatagta actgatcttt tgtctaagga tttttacctg aaagcacaat gtattgagtc 1740 

tcttgaaaat catctttcag atctttttac agaatgaact tatgcactgc tactgtagta 1800 

ttctcaagga atatatgtaa acacaaatgt atgcctgagg ttggtttttg cagaaaacag 1860 

tctctgcttc taaaaacttc tatgtctagt cttccatagg aaatcctcac tgtttaacca 1920 

tgtgaggagc ctaagtcatt aaacggatca tgtctgtaca ttgtgtaatg aatgaaaagc 1980 

acataaatgt aatctacttt gaactttgta aaaatgatgt gtggaggcta ttcttgtttc 2040 

tccatctcaa gtcctgtgtg tgcacgtgtg tgcaagtgca catgtgtgtg tgtaataaca 2100 

cattgtaaag aacagaaatt actttaaaaa ataaacagaa atggagacct gaaaaaaaaa 2160 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaac tcgagggggg gtcccgtacc caat 2214 

<210> 144 
<211> 813 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (283) . . (283) 

<223> n equals a,t,g, or c 
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<220> 

<221> misc__f eature 
<222> (691) . . (691) 
<223> n equals a,t,g, or c 

<400> 144 

gcaggaattt ctacctatat ttccttcctt actgtgttgt gtgtgtttgt atgtgtgtgt 60 

gtttaatttg tagcatttgc cagtttctgt ggtgtaaata ctcccactac agctgttttc 120 

aagctaacat tgtgatacca caaaaaatgg aattgggaag gcacaatcaa gattaacaag 180 

ccagtttaga ccaagacaat ttttctgccc tattagttgg gcacaagtta gaaaggctga 240 

tagtatctca tgttggaaag gtgagagaga aggccctcat gcnttattaa tggcatatgc 300 

attgaagtgc cctgtttgag ggcatgctgt tagtaacttt taaaatatga gatgtcatac 360 

tttttgactg aaaatacaac tcttgtagga ttctatttta tagaactact taggtgcata 420 

aatatacaaa aataactgtc attgcagcat tatttgtaat agtggaaaca gaagactttt 480 

cattaataag agaatggtta ggccagatcc agtggctcac acctgtaatc ccaacatttt 540 

gggaatccaa ggcaggagga tcgctttagc ctaggagttg gagaccagca tgggcaacat 600 

aacaagacct tgtctctact aaaaaaaaat aataataatt agtctggcat ggtggcacac 660 

ctgtactccc agctacttgg gaggctgggg ncaaggagga ttgcttgagc ccaggagatt 720 

gaggctgcag tgaactgtga tcacaccact gcacaccagc ctgggtgaca cagcaaaact 780 

ctgtctcaaa aaaaaaaaaa aaaaaaactc gta 813 

<210> 145 

<211> 1739 

<212> DNA 

<213> Homo sapiens 

<400> 145 

ggcacgagag atcctcagga tatctttagc caaaggaaaa gctccgcatt cccacctggg 60 

gggaaagctg gattgccatg ggcacgaagt agtggtgcag agtccctggc catcctgaat 120 
atccagaatg gtgtttctga agttcttctg catgagtttc ttctgccacc tgtgtcaagg . 180 

ctacttcgat ggccccctct acccagagat gtccaatggg actctgcacc actacttcgt 240 

gcccgatggg gactatgagg agaacgatga ccccgagaag tgccagctgc tcttcagggt 300 

gagtgaccac aggcgctgct cccaggggga ggggagccag gttggcagcc tgctgagcct 360 

caccctgcgg gaggagttca ccgtgctggg ccaccaggtg gaggatgctg ggcgcgtgct 420 

ggagggcatc agcaaaagca tctcctacga cctagacggg gaagagagct atggcaagta 480 

cctgcggcgg gagtcccacc agatcgggga tgcctactcc aactcggaca aatccctcac 540 

tgagctggag agcaagttca agcagggcca ggaacaggac agccggcagg agagcaggct 600 

caacgaggac tttctgggaa tgctggtcca caccaggtcc ctgctgaagg agacactgga 660 

catctctgtg gggctcaggg acaaatacga gctgctggcc ctcaccatta ggagccatgg 720 

gacccgacta ggtcggctga aaaatgatta tcttaaagta taggtggaag gatacaaatg 780 

ctagaaagag ggaatcaaat cagccccgtt ttggagggtg ggggacagaa gatggggcta 840 

catttccccc atacctacta tttttttata tcccgatttg cactttgaga atacatctaa 900 

ggtcatcttt caaaagagaa aaattggaca cttgagtgac tttgttttta gttttgtttt 960 

tgtacattat ttatgtgatt gttatggaat tgtcacctgg aaagaacaat tttaagcaat 1020 

gtcatttcta gatgggtttc taattctgca gagacacccg tttcagccac atctaaaaga 1080 

gcacagttta tgtggtgcgg aattaaactt ccccatcctg cagattatgt ggaaataccc 1140 

aaagataata gtgcatagct cctttcagcc tctagccttc actcctgggc tccaaaagct 1200 

atcccagttg cctgtttttc aaatgaggtt caaggtgctg ctttgcatgc ctgccaaccc 1260 

atggaagttg tttcttactt cttttctctc ttatttatta accatggtct gagagttgtt 1320 

tttgttctat gtaacagtat tgccacaaaa ctataggcaa atcgtgtttg cagggagatt 1380 

tctgatgcct ctgtgggtgt gtgtaagtta aagtggccac atttaagaag gccaagcttt 1440 

gtagtggttg cacagtcaca ctgatatgct gatttgctct ttctcattgt atgtctatgc 1500 

tttgtcatca gtgctatagt aaattacaaa gaaataggta gattgtatga acatacccac 1560 

aaatgcctat gatttaggtt accaatgtat tctttctcat ttggggtttt gcttctgtct 1620 

gtctgtttat tggaaacttg tacttcaagt agggggaatc ctaattctaa taactcctta 1680 

gctaagtttt attattcagg caataaacat gttttcatgt aaaaaaaaaa aaaaaaaaa 1739 

<210> 146 
<211> 1677 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc__feature 

<222> (1012) . . (1012) 

<223> n equals a,t,g, or c 

<400> 146 

ggtgcagtgg tgccatcaca gttcactgca gccttgacct cccgggctca agcaatcctc 60 

ccacctcagc cacttgagta gctgagacct cagatatgtg ccatcacacc cagctgattt 120 

tttaaaatta attttttgta gagatagggt ctcatatgtt gcccatgctg gtctcaaact 180 

actgggttca aatgatcctc ctgcctcagc cttccaaagt actgggatta caggcatgag 240 

ccaccatgcc gggctgggag gcggaatttt gttcagtcta aagataagct ttttcatagc 300 

tctggctgta gtgggaggga gcagaggagt gaatgattgt cagttgggag ggtgcagagt 360 

gggctcctgc cctagggtgr aggtragggt ggcttaggtg asmcamcaca gaggccctgt 420 

tcagccccac gtcccctccc tgtgctccct cctcctctct cctctcctgc aggcgtggra 480 

ggtatcatca ttcagcagat ttcaccagag gcagtggagg aggcaggtac ctgagccaga 540 

attcagaatg tcttattctc cacttgactc tgccactaac ttgttgtgca actttgggcc 600 

tttccccagg ccttcatttt cttttctttt ctttttcttt yttttttttt gaggcggagt 660 

ctcgctatgt tgcccaggct ggagtgcagt ggcgcagcat catctcggct cactgcaagc 720 

tccaccttct gagttcacgc cattctactg cctcagcctc ccgagtagcc gggactgcag 780 

gcrcccacca ccacgcccgg cttatttttt gtatttttag tagagacagg gtttcaccac 840 

gttagccaag atggtctcga tctcctgacc tcgtgatcca cccgcctggg cttcccaaag 900 

tgctgggatt acaggcgtga gccactgcgc ccggccattt tcttaaatat ctaataaaaa 960 

atatatagca aatgcagttt ttaaactacg acaatatgac cacgcaaaag antattatct 1020 

tccaagactg ctggtccaag gaaaagtcag taataaagtg gaagcattgt agcttatgga 1080 

atgactggtt asatttggga gaagccttag caataatcta gaatctgcat agataataca 1140 

tctgaggatt gggctttgtg gtttacaaag catttttttt tcctcttttg atcccagccg 1200 

tttgtctgga ctgatacaaa gcatttttat tagtttgtct tattcaatcc tcacaccacc 1260 

tcaaatttac agaggatatg gatctggtta atttgtatga ctatgtaacc tcatgtcagt 1320 

ccacagcact gcctggaggt gggtagaggt ggtcctgggc tggaatccca gccccagtgg 1380 

gaccttgagc aagttacttt agctgtctgc acctaaattt cctcactggc aaaacaggaa 1440 

tactggtggt tcacacctgc aattccagca ctttgggagg ctgaggtggg aggattgctt 1500 

gagtccagaa gttcaaaacc agactgggca acatagcaag accatctcta caaaaattaa 1560 

ataaataaaa catttacaag ggttgtggtg aagattaaat gagatcactc acgaaaaagc 1620 

tcagcagacc ctgatgtgca gtaggtgctc aataaatgtt agccagcaaa aaaaaag 1677 

<210> 147 
<211> 2648 
<212> DNA 

<213> Homo sapiens 
<400> 147 

ggcacgagct tgtaggtact cattgaggtt tattgtgtaa gatgaatgaa tgttgcaaat 60 

tcctaaacat gtgattcaga tgcccaatct tactctgtta ctttatgaaa attttttaaa 120 

gctatatgat gttatatcaa aatatgttgt tatactttag gataatcggt gtgttagccc 180 

tgaatttcag cataagtccc atttttttcc atgggagtct aggaaagcta tatgtttatt 240 

cagcagcaaa atacagtttg gaacttaaat aaactattga tcaatttctg gtcttatgct 300 

agaaggaata aagcatcaag aaaaagaaaa gatttgctgt caagaccagg aaaatttgac 360 

aatagagtat tagaatgcag gaaatgaggg gaagtggaaa ggcagcaagt aggagagaaa 420 

aagtgcaggg acagtagaaa gtgaatgtag gagctttctg acccatgcac ttcaggaacg 480 

caattcatcc ctaaaatgct gtttgctgtc ttaggttgca agtaaccaaa ttaaaaccag 540 

tttgaaagta gagtgagaca gctgtcatca taagagtcat ttgatctgtt taaaggtggc 600 

tgcttgtatg cagggaccaa cagtcatgtc cagggcagca gctggtgcac acttcaagca 660 

cagaccataa gagctacccc aggcagcacc tgctaccaat agtgcaaaca actcagagag 720 

acctcgttgg cataagggaa tact etc tec tttctgagta aagagcaagt agaactaaag 780 

gtttcacatt ttaaacatac tttacattcc tcctcttctg gggctcaagc ctacttttgg 840 

gecaaagegg atgttatatc tgacatagag tcctcggagc agcagttgtt cctgaaagtt 900 

ectttttgea tctttgtgcc teatgeagtg gcttacaggt caaccagact tctcccctga 960 

cttttgatgt gtaagagctt gtgtttcaaa tgggtttggt tttcttaatg tcaccctagg 1020 

ttggtggaaa ggagagtaaa tggaaatggg gggagcaggg tcccctgggg aggtttaaac 1080 
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agatggaagt caattgtctc ttgagaatag aggaggctat tgagttttca ttccacactc 1140 

tgctcctgtt ctgtcagcaa agaacaagga ctactctcca gcaattgctt tccactggac 1200 

tcccccaccc cggctcccac aaaaacctag ggatcaactt agttcactcc aaattagaaa 1260 

atttaatagt catttgtttc ttcttgtcca cagggagaac cattttcttt ccttctttca 1320 

aaattgccca ggtcttgtga agggttatta acaccagaaa gaaatacatt ttaataagct 1380 

taaatctcat ttctacatga aaccatcaga ttttagtact gcaatatttt gatccctctg 1440 

tcttttaggc tctgacacca aaattgccat aatgaaggtg tttcacttct tctcatttat 1500 

ttttatggga tcttttattc ccaaatgcct tttcatccca gccaaaggga gaaatgttga 1560 

tagatctgcc atcaagaagg ttccaaagct ggcctgtcag gttttctgtt tccttgttta 1620 

ttatctttga acttttgttt taaatgtttt aaacacttat ttaccatgta actaaatgcc 1680 

tgatagcatt gaaagtactt tatgggtttt aatttattta atgctcatga aaccctatga 1740 

ggtaggtact gatattattt ttattttact gatgaggaaa gtgaagcaaa gagaagtgaa 1800 

atgaaaggta gtgagtgatg ggaccagggt ttggacatgg gcagtctggc tctaaaatgt 1860 

atgcttttaa ctactatgta atgctgcctc accaacaact tgtctcacaa attgatattc 1920 

tggatcagag gatgtcgact ggcctgcaaa tgtattttgt atggctcata cacagttcag 1980 

aagttttaaa aatttacata gaaatctgca tttcctgact tcttttgaaa atgggaatac 2040 

caaacatcat taggcttgaa ttcccaatac ggcaacaaca gctgagcaac aagcagctgt 2100 

ttagactagg caccttccgt tcattccagc ccacaatgca gatcatagta tcgacttaaa 2160 

tttcctgcct gccttagaga agcttctgag cttgtgacct ctattctagc tgctctatga 2220 

atggacgctg ccccagtaca gcgaggacct gctgcaaaat gcatttctta gtcttcaata 2280 

cttattcctc cttgtaactg gatttctggt aagttatgtc tcatggtgga tctgccccaa 2340 

agatggagac tgaatggcag tgagtcactc gccctggcct ccattgttct ggagaaggtt 2400 

ccagccacat ggttgatgtc agctggtttt ccagagccag agctgggttg caggacagac 2460 

acacctgcat ctaatagtga aaggcaaagt tgaaaggcca agaccagcct gaggtctgag 2520 

ggaccaaggg cttcacagag gccagaagtt cagaggtgga cataaaaggt gttaggagaa 2580 

taaggaagtg aaaagaacat agtacagtgt atcagaggag gagctccagg ctggcaaata 2640 

tcactccc 2648 

<210> 148 
<211> 1084 
<212> DNA 

<213> Homo sapiens 
<400> 148 

ggcacgagcc accatgcccg gcctagatta aaaatttgaa gacatattct ctactatgag 60 

ccaatgaaat tactcatttt gtttctatcc catttgctgt cccttgcttt tggaattttg 120 

tgtcttagtg tgactgtgat tctttctctc cttttgtctt tcagcaaacg gggattcagc 180 

gtccgatcct ttggaacagg gactcacgtg aagcttccag gaccagctcc cgacaagccc 240 

aatgtttatg atttcaaaac cacatatgac cagatgtaca atgatcttct taggaaagac 300 

aaagaactct atacacagaa tgggatttta catatgctgg acagaaataa gagaatcaag 360 

ccccggccag aaagattcca gaactgcaaa gacctgtttg atctgatcct cacttgcgaa 420 

gagagagtgt atgaccaggt ggtggaagat ctgaattcca gagaacagga gacctgccag 480 

cccgtgcacg tggtcaatgt ggacatccag gacaaccacg aggaggccac cctgggggcg 540 

tttctcatct gtgagctctg ccagtgtatc cagcacacgg aagacatgga gaacgagatc 600 

gacgagctgc tgcaggagtt cgaggagaag agtggccgca cctttctgca caccgtctgc 660 

ttctactgag cccagcgccc gcatggagcc gcctctggag cttcctgttg ttcatacttt 720 

ttccttcctg acatttgttt ttacttacag gtgttctgct ggtgacggta gcattaccca 780 

aataaactgt gcatatgaaa tgggagagga gatgccaaaa cgccagatga aagcaatcaa 840 

gtttcttctt ttccactttt acttatgagc gggatattga ttacaaagtt tttcttcttt 900 

aaccaaaaag gaaagacaac ggtttgtgtg cacttcccga catacctgtg tcttcgtgtg 960 

cctgccttcc ctccctcctc cccaccgggc cggactgtac agagccctgc tgcggcgtgt 1020 

taggaatgac ctggaattgt caataaacag atgctgctgt caaaaaaaaa aaaaaaaaaa 1080 

aaaa U 1084 

<210> 149 
<211> 2072 
<212> DNA 

<213> Homo sapiens 
<400> 149 

cccacgcgtc cggcgtttta cgcaggctgt ggcagcgacg cggtccccag cctgggtaaa 60 
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gatggcccca tggcccccga agggcctagt cccagctgtg ctctggggcc tcagcctctt 120 

cctcaacctc ccaggaccta tctggctcca gccctctcca cctccccagt cttctccccc 180 

gcctcagccc catccgtgtc atacctgccg gggactggtt gacagcttta acaagggcct 240 

ggagagaacc atccgggaca actttggagg tggaaacact gcctgggagg aagagaattt 300 

gtccaaatac aaagacagtg agacccgcct ggtagaggtg ctggagggtg tgtgcagcaa 360 

gtcagacttc gagtgccacc gcctgctgga gctgagtgag gagctggtgg agagctggtg 420 

gtttcacaag cagcaggagg ccccggacct cttccagtgg ctgtgctcag attccctgaa 480 

gctctgctgc cccgcaggca ccttcgggcc ctcctgcctt ccctgtcctg ggggaacaga 540 

gaggccctgc ggtggctacg ggcagtgtga aggagaaggg acacgagggg gcagcgggca 600 

ctgtgactgc caagccggct acgggggtga ggcctgtggc cagtgtggcc ttggctactt 660 

tgaggcagaa cgcaacgcca gccatctggt atgttcggct tgttttggcc cctgtgcccg 720 

atgctcagga cctgaggaat caaactgttt gcaatgcaag aagggctggg ccctgcatca 780 

cctcaagtgt gtagacattg atgagtgtgg cacagaggga gccaactgtg gagctgacca 840 

attctgcgtg aacactgagg gctcctatga gtgccgagac tgtgccaagg cctgcctagg 900 

ctgcatgggg gcagggccag gtcgctgtaa gaagtgtagc cctggctatc agcaggtggg 960 

ctccaagtgt ctcgatgtgg atgagtgtga gacagaggtg tgtccgggag agaacaagca 1020 

gtgtgaaaac accgagggcg gttatcgctg catctgtgcc gagggctaca agcagatgga 1080 

aggcatctgt gtgaaggagc agatcccaga gtcagcaggc ttcttctcag agatgacaga 1140 

agacgagttg gtggtgctgc agcagatgtt ctttggcatc atcatctgtg cactggccac 1200 

gctggctgct aagggcgact tggtgttcac cgccatcttc attggggctg tggcggccat 1260 

gactggctac tggttgtcag agcgcagtga ccgtgtgctg gagggcttca tcaagggcag 1320 

ataatcgcgg ccaccacctg taggacctcc tcccacccac gctgccccca gagcttgggc 1380 

tgccctcctg ctggacactc aggacagctt ggtttatttt tgagagtggg gtaagcaccc 1440 

ctacctgcct tacagagcag cccaggtacc caggcccggg cagacaaggc ccctggggta 1500 

aaaagtagcc ctgaaggtgg ataccatgag ctcttcacct ggcggggact ggcaggcttc 1560 

acaatgtgtg aatttcaaaa gtttttcctt aatggtggct gctagagctt tggcccctgc 1620 

ttaggattag gtggtcctca caggggtggg gccatcacag ctccctcctg ccagctgcat 1680 

gctgccagtt cctgttctgt gttcaccaca tccccacacc ccattgccac ttatttattc 1740 

atctcaggaa ataaagaaag gtcttggaaa gttaaaaggc atcagtctta ctacctgtcc 1800 

caccaccccc accttaggga aatgtcctag aatcctggga aattgagggc ttctttgatg 1860 

gtgagtggag aaaagataga ggagaaggtt gcccctgaag tgctgttagg agaaggagga 1920 

tagaggaatc agccttagga gggttccatg ccagctgtca tttggcaaag gaccctggac 1980 

agatgacttt tgcctctgaa cttcactctt ctctttcctc aaatgggctt cataatgctt 2040 

tccactcagg cttaacatga gaattaaatg ag 2072 

<210> 150 

<211> 1251 

<212> DNA 

<213> Homo sapiens 

<400> 150 

gacccacgcg tccgagcaaa cccaggaagg tgtggcgtcc ccgcttcgcg ccaagatggt 60 

gctggtgctg cgccatcctt tgtgtgcccg ggaaagggcg ttccgggagc cgggtcgggg 120 

gctcctgact cgcactgggc agcatgacgg tgcgccggct gtcactgctg tgccgggacc 180 

tctgggcgct gtggctgctg ctgaaggccg gcgcagtacg tggggcgcgg gcaggtcctc 240 

gcctccccgg aaggtgttgt ggggcgacat gcggggacgc cgggcggggg tggacgttct 300 

gggcccagcc ctgtcctcag aggctgctgg ggcagaagcc cggggctggg ggatgccggg 360 

gatgggtgtt ggggtgggtg cctccgagac cagaggagcc ctgttccttg gcagggaagg 420 

tgtgcacggg ccttgcccga tggatggttt agggccatgg ccctggggtc cctggtgagc 480 

agtggggccg cctctgccct tggcctgtga gggactgtct gtgctggtcc cagaaggctg 540 

ggatcacctt tccactggct cctttgttcg aggtttttca tagacaggct atgtggacaa 600 

atgagggcag cgcccacgtc tggctggtgg aggggctgcg gctcctcctt ggaggggacg 660 

cctggccact gctgtcccca caatggggcc acccgtggtg caaggcgtga caagctgccc 720 

tctctaggta agcaggactt gggaggcccc tggccaagcc tgtggacccg gctgggcggc 780 

ctctgtggtc tcaggtttgg gtgtgtttgg tctggtcagg gctcaggggc tgctggtcca 840 

cactggcccc atcctgacaa ttggagcttt ggggcaaggt ccctggagaa ggggtcacgt 900 

cgggaggaaa cagcctgggt tttgttgatg cttttctaag aatggagtac tcgttttcaa 960 

gagatttgtc ctaattatat tttccagcgg gtacttatgc caagtattga tgaataattc 1020 

ataaaataag catctttgtg aattttagtg aatcagacct taactatcaa cggcaatgaa 1080 

tgaacatcta aagtttccaa ttttaaagta aagaactggc tgggtacagc agttcacgcc 1140 

tgtaatccca gcactttggg aggccaaggc tagaggatcg cttgagccca ggagtttgag 1200 
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atcagcctgg gcaacatacc aagacctcat ctgttaaaaa aaaaaaaaaa a 1251 

<210> 151 
<211> 1539 
<212> DNA 

<213> Homo sapiens 
<400> 151 

cgatggcccc gcggccgctc tagaaagtcc cgtttttttt tttttttttt tttttttttt 60 

ttttagagta cgttctgcat tttatttytg caggcaacac tttgctcacc agcaagaaca 120 

cagcccragg aagggaccca ataacctttc aaaacscaaa ctgctkcctg cggtgagggc 180 

ccagggtcct ccacggagag gacaggcatc ttcctttccc accaggaagg agtcagcccg 240 

gagcctctgc tatgtgcaag gcggtgtgca agcaccggct gcrgctyttt gctgtctctt 300 

ctttctcttt ggggctgggc tgggtgtgcg ttctggtgct gatgctttgg cctgtgaggc 360 

tgagcttggc acctcgaccc gttcaattac agcaacgaag aagccactgc tgagtgtggt 420 

ctcaggggag gcccggaggc agtgctcggc acccgggaac gtgctcaggc ctcggtgggg 480 

ccaggcaggc agggcgggag ctagcctgaa ggcgcccggg ttctgctgca gcgcatctcg 540 

caccacgtct tcattctcct cctggcagag ggagcacgtg gagtagacga gccgctgcag 600 

ggaagggaaa gtgagcgcgt ggcacagggc tcgctgctgg aaccctgcca gggcatgcag 660 

acgcaccggg ctaggtgtgc ctgccccggg ctcctccagc tgtctgctcg gcatacccga 720 

gccactgcag gaaggatcca gcaggayrta gtggacctca ygrtagcgyg gatcyraggg 780 

ggagaccgcc aggaagtcct cctcagccag ytcacagcar gagacgccag cccrggccag 840 

cagcgtggcc atggatgcca gccgcttggc atccaggtca aaggcaaaga tcttcccttg 900 

gttcttcaga agagcagcca agtgactggt cttattgcct ggggcggcac aggcatcgat 960 

gacatgggag cctggcgggg ggtccagcag catggctggg agacagctgg ccctgtcctg 1020 

cagaatgagg tgtccggccc ggtacagtgg gtgttcatgc agatctgtct gggcgggaaa 1080 

caccagcagc tccggcatca aggggtccag gagaaaatgc ttccccttga gggctcgtaa 1140 

gtcatcgagg ctggaagccc gaccctgata ggagaaacct tgtctcttga aataatcaac 1200 

tacatcatcg gagcaggtct tgagagtgtt cacacgcaca aatcgaggca gctgggaggc 1260 

tggaccaggc ctggatccca cttccaacag gtcctcattc cggctcacac cccgatgaac 1320 

cttgagccga gccaactcag ccttgagcct cgcctggtgc cggcccaaca gagccttcca 1380 

tcggccccca ccccctcgaa agccctttcc caacaacaac tcatacacta gcaccttggc 1440 

caggtgcggc cgctctagag gatccctcga ggggcccaag cttacgcgtg catgcgacgt 1500 

catagctctc tccctagagt gagtcgaatg aggttcata 1539 

<210> 152 
<211> 2077 
<212> DNA 

<213> Homo sapiens 
<400> 152 

cccacgcgtc cgcggacgcg tgggcggacg cgtgggtagg ccgcgagctt agtcctggga 60 

gccgcctccg tcgccgccgt cagagccgcc ctatcagatt atcttaacaa gaaaaccaac 120 

tggaaaaaaa aatgaaattc cttatcttcg catttttcgg tggtgttcac cttttatccc 180 

tgtgctctgg gaaagctata tgcaagaatg gcatctctaa gaggactttt gaagaaataa 240 

aagaagaaat agccagctgt ggagatgttg ctaaagcaat catcaaccta gctgtttatg 300 

gtaaagccca gaacagatcc tatgagcgat tggcacttct ggttgatact gttggaccca 360 

gactgagtgg ctccaagaac ctagaaaaag ccatccaaat tatgtaccaa aacctgcagc 420 

aagatgggct ggagaaagtt cacctggagc cagtgagaat accccactgg gagaggggag 480 

aagaatcagc tgtgatgctg gagccaagaa ttcataagat agccatcctg ggtcttggca 540 

gcagcattgg gactcctcca gaaggcatta cagcagaagt tctggtggtg acctctttcg 600 

atgaactgca gagaagggcc tcagaagcaa gagggaagat tgttgtttat aaccaacctt 660 

acatcaacta ctcaaggacg gtgcaatacc gaacgcaggg ggcggtggaa gctgccaagg 720 

ttggggcttt ggcatctctc attcgatccg tggcctcctt ctccatctac agtcctcaca 780 

caggtattca ggaataccag gatggcgtgc ccaagattcc aacagcctgt attacggtgg 840 

aagatgcaga aatgatgtca agaatggctt ctcatgggat caaaattgtc attcagctaa 900 

agatgggggc aaagacctac ccagatactg attccttcaa cactgtagca gagatcactg 960 

ggagcaaata tccagaacag gttgtactgg tcagtggaca tctggacagc tgggatgttg 1020 

ggcagggtgc catggatgat ggcggtggag cctttatatc atgggaagca ctctcactta 1080 

ttaaagatct tgggctgcgt ccaaagagga ctctgcggct ggtgctctgg actgcagaag 1140 

aacaaggtgg agttggtgcc ttccagtatt atcagttaca caaggtaaat atttccaact 1200 
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acagtctggt 
gcagtgaaaa 
tcactcaggt 
tgcctggagc 
gagacaccat 
ttgtttctta 
aaagaaacgt 
cttc'acataa 
ttattatctt 
ctccctccca 
atcacctctt 
atctctgatt 
tatttattga 
aggcatttga 
ggaatttttt 



gatggagtct 
ggccagggcc 
cctgagccat 
cagtctactt 
gactgtcatg 
tgttgttgca 
tttcatgctt 
caatttcatc 
tcttgatact 
ccacatagaa 
aaaaacattg 
tttatgtgtt 
aggagtagag 
ctttctaatt 
aatttataga 



gacgcaggaa 
atcatggagg 
ggagaaggga 
gatgacttat 
gatccaaagc 
gacatggaag 
ctggccagga 
caattcatct 
ttccaaattc 
tcaacatatg 
tttccacttt 
catttatgaa 
gaattactca 
tgtatatttt 
aaaaaaaaaa 



ccttcttacc 
aggttatgag 
cagacatcaa 
acaagtattt 
agatgaatgt 
aaatgctgcc 
atcctgggtc 
tcaaagcaca 
tctgattcta 
gtagggatta 
aaaagtaaac 
cattaaatat 
actttagtat 
tatattatgt 
aaaaaaa 



cactgggctg 
cctgctgcag 
cttttggatc 
cttcttccat 
tgctgctgct 
taggtcctag 
tgcaactttg 
actctatttc 
gaaaaaggaa 
cagtgggggc 
acttaataaa 
gaaaatatta 
gctcttaatt 
gaattttaaa 



caattcactg 
cccctcaata 
caagctggag 
cactcccacg 
gtttgggctg 
aaacagtaag 
gaaaactcct 
atgctttctg 
tcattctccc 
atttctttat 
tttttggaag 
tggtttatat 
gaatatatgg 
aatgagcttt 



<210> 153 

<211> 2108 

<212> DNA 

<213> Homo sapiens 



<400> 153 

ggcacgaggg 

ccctctacct 

agacgctgca 

acttgactga 

tccacctgat 

aaggggacct 

ccttctaccg 

ccgtggtaca 

gtcctgggat 

cgccaattct 

gttgtcagac 

aggatggaag 

cagaagatca 

aacagagccc 

cattgaatcc 

ggcctctgcc 

ggatgccaga 

tgagagtcct 

ctgggaccac 

ccaccccaat 

ttttacaagt 

tatataaagt 

ttctgctgtc 

gagaactaat 

gaggacattg 

gtgggatgtg 

tataatgcac 

ctgagtttga 

tgtcccttct 

tctttagctg 

atgttgaaat 

agate aggag 

agagctaact 

caaaaaacag 

tccagaacta 

aaaaaaaa 



agacctaaac 
ttcccttggt 
gtgtgaggga 
tgcaagggaa 
tgtttcctat 
gctggttctg 
agatggctca 
aaaggcagac 
cccagaaaca 
cagagctgta 
aaagttgccc 
gatagtgcaa 
ctccgggtca 
ccagctagag 
agctcctcag 
tccgccacca 
tcctcatctg 
cctcggtcac 
aaaggctact 
aaatctgatt 
tgtcccagtg 
gagaattaga 
tagatcagga 
ggaagtggat 
ggcaatgttt 
tgtgctactg 
agggcagtac 
gaaaccccag 
aattattatc 
gtgctatggt 
cctaatcccc 
tgcagagccc 
cacccttcca 
ctgtcgccaa 
tgagaaataa 



acagt caeca 
gtgctctggg 
cctgtctgca 
gctggcttcc 
gactggctga 
cgctgccagg 
gctctgggtc 
agegggcact 
gcatctgttg 
ccctcagctg 
ctgcagaggt 
agcagggggc 
tactggtgtg 
atcagagtgc 
aaatcagctg 
accccatctt 
tatcaccaga 
ctgctcatgg 
gctgaataga 
ctttattttc 
ttttgttaga 
gtttagctat 
atttctatct 
tgaatacagc 
ggagacattt 
gcatccagta 
cccacaacga 
cctaatgaaa 
tctttccagc 
ctgttcttta 
aaggtgatgg 
tcatgattag 
ccatatgagg 
acaccgactc 
aattctgttg 



tgaagctggg 
tggeccagat 
ctgaggagag 
aggtcaaggc 
tcctccaagg 
cctggcaaga 
cccccgggcc 
accactgcag 
tggctatcac 
aaccccaagc 
cagctgcccg 
tctcctcaga 
aggcagccac 
agggtgette 
ctccaggaac 
ctgaggatcc 
tgggecttet 
agttgaggga 
agtaaacagt 
tcttcctgtc 
ataatgtagt 
aattgtgtat 
gttatatcga 
agtctcaact 
tggtcattat 
aatagaagee 
aaaataatct 
ccctaggtgt 
ctcattcagc 
gttctagttt 
cattaagaag 
gattagtgee 
acgtggcaag 
tgtcgttgcc 
tttgtaagct 



ctgtgtcctc 
gctactggct 
cagctgccac 
ctacactttc 
tccagccaag 
ctggccactg 
taacagggaa 
tggcatcttc 
agtccaagaa 
aggaggcccc 
cctcctcttc 
attccagatc 
tgaggacaac 
cagctctgct 
tgctcctgag 
aggcttttct 
tctcaaacac 
attatctggc 
tcatccatga 
ctgeacatat 
taggtgagtg 
tctctcttaa 
ccagaatgtt 

gggggcaatt 

acttgggggg 
aggggtgccg 
ggcccaaaat 
tgggctctgg 
tattcttact 
gtatcccctc 
tgggcctttg 
cttatttaaa 
aagatgacat 
ttgatcttga 
aaaaaaaaaa 



atggcctggg 
gccagttttg 
aeggaggatg 
agtgaaccct 
ccagtttttg 
actcaggtga 
ttctccatca 
cagagccctg 
ctgtttccag 
atgaccctga 
tccttctaca 
cccacagctt 
caagtttgga 
gcacctccca 
gaggcccctg 
tctcctctgg 
atgcaggatg 
caccggaagc 
tctcacttaa 
gcataagtac 
taaataaatt 
cacaacagaa 
gtgatttaaa 
ttgcccccca 
ttgggggatg 
ctaaacatcc 
gtcagttgta 
aatgggactt 
gacataccag 
aaaagecatt 
ggaagtgatt 
aaggccccag 
gtatgagaac 
acttccagcc 
aaaaaaaaaa 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2077 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2108 



<210> 154 
<211> 1146 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> (5) . . (5) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (857) . . (857) 
<223> n equals a,t,g, or c 

<400> 154 

tccancatta tgggatacat tgatgatcca gacaaatatc atcagggttt tgaattgttg 60 

ctgtcagcct tgggtgatcc ctcagaaaga gtagttagtg ctacacatca agtattttta 120 

ccagcttacg ctgcgtggac tacagaactt ggaaatttac agtctcatct tatacttaca 180 

ctactgaaca agattgaaaa acttctcagg gaaggagaac atggactgga tgaacacaaa 240 

ctccacatgt atctttctgc cttgcagtcc ttgatcccat ctctctttgc attagtgcta 300 

cagaatgcac ctttctccag caaagccaag cttcatggtg aagtgccaca gatagaagtg 360 

actaggtttc ctcggcctat gtcgcctctt caagatgtgt ccactattat cggaagtcgt 420 

gagcaattgg cagtgctgct gcaactttat gactaccagc tagaacaaga gggtacaaca 480 

ggctgggaga gtttactgtg ggttgtcaat caattgttgc cacaacttat agaaatagtt 540 

ggcaaaatta atgttacttc aactgcctgt gtccatgaat tctccagatt tttctggcgc 600 

ctttgccgga catttggcaa aatttttaca aacactaagg taaaacctca gttccaggag 660 

attttaagac tatctgaaga aaacattgat tcctcagcag gaaatggggt cctcactaaa 720 

gctacagtcc ccatttatgc aacaggagtc cttacgtgtt atattcagga. agaagaccga 780 

aaactgttag ttggattctt agaagatgta atgacgctgc tttcattatc tcatgctcct 840 

cttgatagcc tgaaggnttc ttttgtggaa ttgggtgcaa accaggccta ccatgagtta 900 

ctattaactg ttttgkggta tggkgtkgkc catacttcag cactcgtgag gtgtactgct 960 

gctagaatgt ttgagctgtt ggtgaagggg gtgaatgaaa ctctggtagc tcagagggtt 1020 

gttcctgctc tcattactct ctccagtgac cctgaaatct ctgtcaggat tgccacaatt 1080 

ccagcctttg gsactattat ggaaacagta attcaaagag agttgctgga aagagtgaaa 1140 

atgcag 1146 

<210> 155 

<211> 1998 

<212> DNA 

<213> Homo sapiens 

<400> 155 

ggcacgagaa caatttccct tgtacataat atacttatgt acttatacca ttgactctgt 60 

aagataaaag tcttagaaat ggggttgcca agtcaaaggg tctatgcatt taacacaggg 120 

aatgagtact gtcacgtggc ttctgaaact gtttacccag tttatgttcc caccaacagt 180 

gtctaattcc catacctgtg ctaggtatta tgtctttaat ttttgtctga ttatttcatt 240 

taattttaat ttccattatc actggtgagg ttgggcttct gttcagtttt tttgtcattt 300 

atgtttcttt tgtgaattgc cttttcctat gctttgtgca tttttctctt ggggtttgtc 360 

tttttaaaat tgatatacag gtgttctcta taatatagat attctgccac tatatgcaaa 420 

tgatcttcca atttatttat ttatttgaaa cagagtttca ctcttgtcac caggttggag 480 

tgcagaggtg cgatcttggc tcactgcaac ctccacctcc caagttcaag cgattttcct 540 

gcctcagcct ccctagtaac tgggattaca ggtgccctgc caccatgccc ggctaatttt 600 

tgtatttttg gtagagacgg ggttttgcca tgttggccag gctggcctcg aactcctgac 660 

ttcaggtgat ctgcccacct cagcctccca aagtgctggg attacaggcc tgagctaccg 720 

caccggcccc aatttatttc ttttaactta gtttatggtg cctttagctg tacaaaagtt 780 

actaattttt agtcaaatat ccagcttttc ctttagggct tcttatatgt cctgatccaa 840 

ggattatttt ttaaaaattc ttccgtattt tctcctataa tattcatagg ttactttaca 900 

attagatttt taatacatct cctaactttt tcatatagta tgaggtatcc atttttaaaa 960 

aatggatatc cagttgtgcc catatgtctt ttccactcat ctgaaatgcc accattatca 1020 

tattttcaat tttcatgtgc acgtgggtct gttttcaaac ttggagattc tcttccactg 1080 

atatatgtct attcttgtgc cagtggttta actgctattg ctttatagta tattttgata 1140 

tcttcttttc aaaaggatac tcttgtgtat tctttttttc atcttgtgta ttctcatcca 1200 
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ttttccagaa aaccacagaa tcaactttaa gtttcatttc attaagtcag attagtgaga 1260 

gctgattttc tgtgtatcat gttcattcag aaacatagtc tatctctcca tttaggggca 1320 

cttgttttat attttctttc taacacgtat tttgttgggt ttattgtacc tttttttttt 1380 

ttaacttcag attcaggggg gtgcacgtgc aggtttgtta cctgagtata cgatatgata 1440 

ctgaggttgg agtatgaatg atgccattac ccaggtactg ggcataatac ccaatagtta 1500 

gtttttcaac ccttgccctt ctccctctct cctccctcta gtagtcccca gtttctaata 1560 

ctgccatctc catgttcatg agaacccagt gtttagctcc cacttataag tgagagcatg 1620 

ttgcatttgg ttttctgttc ctgccatatc ctttgtaggg acatggatga aattggaaaa 1680 

catcattctc agtaaactat cgcaagaaca aaaaaccaaa caccacatat tctcactcat 1740 

aggtgggaat tgacaatggg aacacatgga cacaggaagg ggaacatcac actctgggga 1800 

ctgttgtggg gttggcggag gggggaggga tagcattggg agatatacct aatgctagat 1860 

gacgagttag tgggtgcagt gcagcagcat ggcacatgta tacatatgta actaacctgc 1920 

acaatgtgca catgtaccct aaaacttaaa gtataatata aaaaaaaaaa agaacaaaac 1980 

ttaaaaaaaa aaaaaaaa 1998 

<210> 156 
<211> 970 
<212> DMA 
<213> Homo sapiens 

<400> 156 

ctcgtgccga attcggcacg aggtgcccag gctctcaggg cagagggtcc agtgtgatca 60 

ctttgcatgg cctctctccc ctcctgagct tgtgccaggg ccccagggct gacctggaga 120 

ggaaaawggc agagggtgaa gatggggtgt ctggtttggg gaccatcctg gccccccttg 180 

tcactgttgg catctcttct gcacagtggc attgctggga ggtgcttact gtgcctattc 240 

aaggggctgg cagccgcagc ctcactgcag atcagggact tggcttcccg gttgaccaca 300 

ggtccaagaa cctgcagggt ccagcctccc ccccatcccc agtcttcccc accctggccc 360 

ggccctccag gtgcagaaac atgcaggccc ctctccagga ctgtgggagg agtgtgtccc 420 

tcagactggc ctgtgtcctg gctcctctta ccacctcttc cagaggttgt cacctgcagc 480 

tgccccagga taaaggcaag gccagagagg actcctgaac tcctgtgtgc ctggggtggc 540 

aggggcaaac atagccaact ggtggcctga gcggggccat ggtgargaca cccttggtgg 600 

cttgtcccac atcaagctgg gargtgacac tgaggatgca ttagtctgca gcgtatgata 660 

aaaacggcat ttcaggccag gcgtggtggc tcatgcctgt caccccagca ccttgggagg 720 

ccgaggtggg cagatcacat gaggtcagga ctttgagacc agcctggcca acatggtgaa 780 

aactcatctg tactaaaaaa acaaaaatta tgtgggttgg tggtgtgtgc ctgtaatccc 840 

agctacttgg gaggctgagg caggagaatc acttgaacct gggaggcgga ggctacaacg 900 

agccgagatt gcaccactgc actccagcct gatccgtctc aaaaaaaaaa aaaaaaaaaa 960 

aaaaactcga 970 

<210> 157 

<211> 1782 

<212> DNA 

<213> Homo sapiens 



<400> 157 

tgccgagcct 

tggggatctt 

ccatcctgga 

gcacctatga 

gctcagaccc 

agtaccaggg 

gcaccctgga 

atggcaacca 

ccaagcccac 

ttagccttca 

agactaataa 

cggtgatagc 

agcacagcga 

ctgaggcacc 

gggactggac 

gcctgcctgt 



ctttggtagc 
actgggcctg 
agtgccagag 
ccccctgcaa 
tgtcaccatc 
ccgcctgcat 
gatggatgac 
agtcgtgaga 
agtgacaact 
atgccaggct 
ccaggaaccc 
cgactcaggc 
cattgtgaag 
tacaaccatg 
cactgacatg 
ctttgccatc 



aggaggctgg aagaaaggac agaagtagct 
ctactcctgg ggcacctaac agtggacact 
agtgtaacag gaccttggaa aggggatgtg 
ggctacaccc aagtcttggt gaagtggctg 
tttctacgtg actcttctgg agaccatatc 
gtgagccaca aggttccagg agatgtatcc 
cggagccact acacgtgtga agtcacctgg 
gataagatta ctgagctccg tgtccagaaa 
ggcagcggtt atggcttcac ggtgccccag 
cggggttctc ctcccatcag ttatatttgg 
atcaaagtag caaccctaag taccttactc 
tcctatttct gcactgccaa gggccaggtt 
tttgtggtca aagactcctc aaagctactc 
acatacccct tgaaagcaac atctacagtg 
gatggctacc ttggagagac cagtgctggg 
atcctcatca tctccttgtg ctgtatggtg 



ctggctgtga 
tatggccgtc 
aatcttccct 
gtacaacgtg 
cagcaggcaa 
ctccaattga 
cagactcctg 
ctctctgtct 
ggaatgagga 
tataagcaac 
ttcaagcctg 
ggctctgagc 
aagaccaaga 
aagcagtcct 
ccaggaaaga 
gtttttacca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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tggcctatat catgctctgt cggaagacat cccaacaaga gcatgtctac gaagcagcca 1020 

gggcacatgc cagagaggcc aacgactctg gagaaaccat gagggtggcc atcttcgcaa 1080 

gtggctgctc cagtgatgag ccaacttccc agaatctggg caacaactac tctgatgagc 1140 

cctgcatagg acaggagtac cagatcatcg cccagatcaa tggcaactac gcccgcctgc 1200 

tggacacagt tcctctggat tatgagtttc tggccactga gggcaaaagt gtctgttaaa 1260 

aatgccccat taggccagga tctgctgaca taattgccta gtcagtcctt gccttctgca 1320 

tggccttctt ccctgctacc tctcttcctg gatagcccaa agtgtccgcc taccaacact 1380 

ggagccgctg ggagtcactg gctttgccct ggaatttgcc agatgcatct caagtaagcc 1440 

agctgctgga tttggctctg ggcccttcta gtatctctgc cgggggcttc tggtactcct 1500 

ctctaaatac cagagggaag atgcccatag cactaggact tggtcatcat gcctacagac 1560 

actattcaac tttggcatct tgccaccaga agacccgagg gaggctcagc tctgccagct 1620 

cagaggacca gctatatcca ggatcatttc tctttcttca gggccagaca gcttttaatt 1680 

gaaattgtta tttcacaggc cagggttcag ttctgctcct ccactataag tctaatgttc 1740 

tgactctctc ctggtgctca ataaatatct aatcataaca gc 1782 

<210> 158 
<211> 1205 
<212> DNA 

<213> Homo sapiens 
<400> 158 

ggcagagctt ttgtgcagca ccctttaaag ggtgactcgt cccacttgtg ttctctctcc 60 

tggtgcagag ttgcaagcaa gtttatcgga gtatcgccat gaagttcgtc ccctgcctcc 120 

tgctggtgac cttgtcctgc ctggggactt tgggtcaggc cccgaggcaa aagcaaggaa 180 

gcactgggga ggaattccat ttccagactg gagggagaga ttcctgcact atgcgtccca 240 

gcagcttggg gcaaggtgct ggagaagtct ggcttcgcgt cgactgccgc aacacagacc 300 

agacctactg gtgtgagtac agggggcagc ccagcatgtg ccaggctttc gctgctgacc 360 

ccaaatctta ctggaatcaa gccctgcagg agctgaggcg ccttcaccat gcgtgccagg 420 

gggccccggt gcttaggcca tccgtgtgca gggaggctgg accccaggcc catatgcagc 480 

aggtgacttc cagcctcaag ggcagcccag agcccaacca gcagcctgag gctgggacgc 540 

catctctgag gcccaaggcc acagtgaaac tcacagaagc aacacagctg ggaaaggact 600 

cgatggaaga gctgggaaaa gccaaaccca ccacccgacc cacagccaaa cctacccagc 660 

ctggacccag gcccggaggg aatgaggaag caaagaagaa ggcctgggaa cattgttgga 720 

aacccttcca ggccctgtgc gcctttctca tcagcttctt ccgagggtga caggtgaaag 780 

acccctacag atctgacctc tccctgacag acaaccatct ctttttatat tatgccgctt 840 

tcaatccaac gttctcacac tggaagaaga gagtttctaa tcagatgcaa cggcccaaat 900 

tcttgatctg cagcttctct gaagtttgga aaagaaacct tcctttctgg agtttgcaga 960 

gttcagcaat atgataggga acaggtgctg atgggcccaa gagtgacaag catacacaac 1020 

tacttattat ctgtagaagt tttgctttgt tgatctgagc cttctatgaa agtttaaata 1080 

tgtaacgcat tcatgaattt ccagtgttca gtaaatagca gctatgtgtg tgcaaaataa 1140 

aagaatgatt tcagaaaaaa aaaaaaaaaa aaactcgggg ggggccggta cccattygcc 1200 

ccaag 1205 

<210> 159 
<211> 809 
<212> DNA 

<213> Homo sapiens 
<400> 159 

ggcacgagga gaatcatggg cctctggctg ggcatgctgg cctgtgtctt cctggcaact 60 

gctgcctttg ttgcttatac tgcccggctg gactggaagc ttgctgcaga ggaggctaag 120 

aaacattcag gccggcagca gcagcagaga gcagagagca ctgcaaccag acctgggcct 180 

gagaaagcag tcctatcttc agtggctaca ggcagttccc ctggcattac cttgacaacg 240 

tattcaaggt ctgagtgcca cgtggacttc ttcaggactc cagaggaggc ccacgccctt 300 

tcagctccta ccagcagact atcagtgaaa cagctggtca tccgccgtgg ggctgctctg 360 

ggggcggcgt cagccacact gatggtgggg ctcacggtca ggatcctagc caccaggcac 420 

tagcaaagaa gcttggaaat agaaagccag gagtggctgt ccccagtatg caaacacacc 480 

acggtctgcc ctgcaaaaac accaatgggg tctagtgcag gtggacactt tgaaccactc 540 

ctcaaaaaaa gaactttggc tgattccttg tggtgacact cagaggggtc tgaacagact 600 

tgacaattct gttctggtca agctggagtt ttcttctgtg acttggactg ctctacagaa 660 

gacatcagcc aactgcacga gtcagagtcc agggattgtc actattatta ataatgtaaa 720 
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tggcttcaaa tgggacactg cagataaaat cacaaaaacc actgttatat taaagattac 780 

acatttcctg gaaaaaaaaa aaaaaaaaa 809 

<210> 160 
<211> 1151 
<212> DWA 

<213> Homo sapiens 
<400> 160 

ggcacgagtg tcaatgaaag tgtttctaat gcaactgcga ttgactccca gatagctaga 60 

agtttgcaca tcccactcac ccaggatata gctggtgacc caagctatga aattagcaaa 120 

cagagactca gtattgtcat tggcgtggtt gctggcatta tgacggtgat tctaatcatc 180 

ttaattgtag tgatggcaag gtactgcagg tccaaaaata aaaatggcta tgaagccggc 240 

aaaaaagatc acgaagactt ttttacaccc caacagcatg acaaatctaa aaagcctaaa 300 

aaggacaaga aaaacaaaaa atctaagcag cctctctaca gcagcattgt cactgtggag 360 

gcttctaagc caaatggaca gaggtatgat agtgtcaatg agaagctgtc agacagccca 420 

agcatggggc gatacaggtc cgttaatggt gggcccggca gtcctgacct ggcaaggcat 480 

tacaaatcta gttccccatt gcctactgtt cagcttcatc cccagtcacc aactgcagga 540 

aaaaaacacc aggccgtaca agatctacca ccagccaaca catttgtggg agcaggagac 600 

aacatttcaa ttggatcaga tcactgctct gagtacagct gtcaaaccaa taacaagtac 660 

agcaaacaga tgcgtctaca tccatacatt actgtgtttg gctgaattcc actctaatat 720 

gatgctccat tatgcaccat actgtgatga cctttctact ccgaaacctg ctggagcctg 780 

cccttggccg tggggtgtca gccaatcact gcttgttcca cttgttgtac attttatttt 840 

tgagtctttt tctttctcat atacagaaaa atagtatgaa aataaaataa atgtatgaaa 900 

cagtattaat gcagaaatgt gctactaatg gatgtctgag tcaccagaaa ttccattctt 960 

aaagaggcgg ttagcaccta ttagacgtaa cagtgatgtc ttttaaaaaa tccaaaagca 1020 

tattgcaaca ataagtttga gactttgtgt gaacaaaggg aaattcagcc tcttatgtct 1080 

ttgtctttaa tacattaaat actgattttg aataaaaatc taaattgatc aataaaaaaa 1140 

aaaaaaaaaa a 1151 

<210> 161 
<211> 1303 
<212> DNA 

<213> Homo sapiens 
<400> 161 

aggttaaatg cgtacttttc taacctttgt tattttgaaa gttattctga tattcctatc 60 

cagttgtgcc tcatttacta gaaatttgct cacatggcca aatgatgtat ccacagaaca 120 

atttgaaact agaccttttg gaagcgaact cctacaaact gtcatcaatg ttagcagaac 180 

ttgagcaaag acctcaaccc agccatcctt gtagtaattc catcttcagg tggagggaaa 240 

aggtaacatt taaggagact ggttgtaatt tcttgattgg gcctgctggg tggagtggct 300 

taaagtagca tcagggcaaa aaaggtgtta ggaattctat gtgatattaa tattcatgca 360 

gttagttaag aagataaatg ttttwatttt tcttttgagc acaataacaa gagctagaca 420 

aaaccgaata cattctgtgt acaccaaact tctatgagaa gctaaaaaac acttttgatt 480 

tcttctttct catcatacct gaatttcatc ctttggatgt gcttttacag taaaatttct 540 

attaaattga aattttaata ttcgttcaga cctaaattat aagattttgt ggtatgtatt 600 

agtctcatct gtttaagatg gtgcctaatg cagataatgc atcagtacag ctctgaaatg 660 

cttgtagcta tttttattac tgatcagaag ggggaactgt aatcatcttg tgaagggaca 720 

gttttctaag gctcaagagc tcgaaaacaa tctcaatcat ttacagggtt gtgatcattt 780 

cacttgcatt aagccaacta aagttgtatt tgtaaaagta atgctatgaa tattactatt 840 

tgacctagac acataggtta gaattggaaa cacaggctat aaagtatagt aattgtgtaa 900 

ttgtgaaaat attaaggctt caactcaaaa ctgaaacaca gtagggctta gaaatctttg 960 

aattatttat acccctcagt ttaaaaactt ccagtccagg cgcagtggct catgcctgta 1020 

atcccagaac tttgggaggc caaggcaggc ggatcacctg aggtcaggag ttcgagagca 1080 

gcctggctga cacggtgaaa ccccgtctct actaagaata caaaaattag ccaggcatgg 1140 

tggtgggcac ctgtaatccc agctacgggg gaggctgagg caggagaatc acttgaaccc 1200 

gggaggtgga ggttgtagtg ggccaagatc atgccactgc actccagcct gggtgaacag 1260 

ggcaagactc tgtctaaaaa aaaaaaaaaa aaaaactcgt agg 1303 

<210> 162 
<211> 4412 
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<212> DNA 

<213> Homo sapiens 

<400> 162 

aacattagat ctcaatgaaa accagaatgg aaccctttca ctatcataaa ctcatttata 60 

aaagtgccca tgatgaatag caagaagtac ccagtggccc atctcattga ccaaacttag 120 

aaagccaagg tggggcatct gcagctctcc cacaatctga gtttgtgttg atccttgtac 180 

cccacaacct gaaacatctt cttatatata tcgagcgagc tctcagccct tctgttttca 240 

aggccatcat ggagaaactg gagatgtcca agttccagcc cactctccta acactacccc 300 

gcatcaaaga gactaagcca gactatgggg gaaagggaga taagaaggat cctggaactt 360 

taaagaggga aagagtgaga ttcagaaatc gccaggactg gactttaagg gacgtcctgt 420 

gtcagcacaa gggactggca cacacagaca cacgagaccg aggagaaact gcagacaaat 480 

ggagatacaa agacttagaa ggacagctcc tttcacctca tcctacttgt ccagaaggta 540 

aaaagacaca gccagaaaga aaaggcatcg gctcagctct cagatcagga caggctgtgg 600 

atctgtggcg gtactctgaa agctggagct gcagcacacc ccttttgtat tgctcaccct 660 

cggtaaagag agagagggct gggaggaaaa gtagttcatc taggaaactg tcctgggaac 720 

caaacttctg atttcttttg caaccctctg cattccatct ctatgagcca ccattggatt 780 

acacaatgac atggagaatg ggaccccgtt tcactatgct gttggccatg tggctagtgt 840 

gtggatcaga accccacccc catgccacta ttagaggcag ccacggagga cggaaagtgc 900 

ctttggtttc tccggacagc agtaggccag ctcggtttct gaggcacact gggaggtctc 960 

gcggaattga gagatccact ctggaggaac caaaccttca gcctctccag agaaggagga 1020 

gtgtgcccgt gttgagacta gctcgcccaa cagagccgcc agcccgctcg gacatcaatg 1080 

gggccgccgt gagacctgag caaagaccag cagccagggg ctctccgcgt gagatgatca 1140 

gagatgaggg gtcctcagct cggtcaagaa tgttgcgttt cccttcgggg tccagctctc 1200 

ccaacatcct tgccagcttt gcagggaaga acagagtatg ggtcatctca gcccctcatg 1260 

cctcggaagg ctactaccgc ctcatgatga gcctgctgaa ggacgatgtg tactgtgagc 1320 

tggcggagag gcacatccaa cagattgtgc tcttccacca ggcaggtgag gaaggaggca 1380 

aggtgagaag gatcaccagc gagggccaga tcctggagca gcccctggac cctagcctca 1440 

tccctaagct gatgagcttc ctgaagctgg agaagggcaa gtttggcatg gtgctgctga 1500 

agaagacgct gcaggtggag gagcgctatc catatcccgt fcaggctggaa gccatgtacg 1560 

aggtcatcga ccaaggcccc atccgtagga tcgagaagat caggcagaag ggctttgtcc 1620 

agaaatgtaa ggcctctggt gtagagggcc aggtggtggc ggaggggaat gacggtggag 1680 

ggggagcagg aaggccaagc ctgggcagcg agaagaagaa agaggaccca aggagagcac 1740 

aagtcccacc aaccagagag agtcgggtga aggtcctgag aaaactggcc gccactgcac 1800 

cagctttgcc ccaacctccc tcaaccccca gagccaccac ccttcctcct gccccagcca 1860 

caacagtgac tcggtccacg tcccgggcgg taacagttgc tgcaagacct atgaccacca 1920 

ctgcctttcc caccacgcag aggccctgga ccccctcacc ctcccacagg ccccctacaa 1980 

ccactgaggt gatcactgcc aggagaccct cagtttcaga gaatctttac cctccatccc 2040 

ggaaggatca gcacagggag aggccacaga caaccaggag gcccagcaag gccaccagct 2100 

tggagagctt cacaaatgcc cctcccacca ccatctcaga acccagcaca agggctgctg 2160 

gcccaggccg tttccgggac aaccgcatgg acaggcggga acatggccac cgagacccaa 2220 

atgtggtgcc aggtcctccc aagccagcaa aggagaaacc tcccaaaaag aaggcccagg 2280 

acaaaattct tagtaatgag tatgaggaga agtatgacct cagccggcct actgcctctc 2340 

agctggagga cgagctgcag gtggggaatg ttccccttaa aaaagcaaag gagtctaaaa 2400 

agcatgaaaa gcttgagaaa ccagagaagg agaagaaaaa aaagatgaag aatgagaacg 2460 

cagacaagtt acttaagagt gaaaagcaaa tgaagaagtc tgagaaaaag agcaagcaag 2520 

agaaagagaa gagcaagaag aaaaaaggag gtaaaacaga acaggatggc tatcagaaac 2580 

ccaccaacaa acacttcacg cagagtccca agaagtcagt ggccgacctg ctggggtcct 2640 

ttgaaggcaa acgaagactc cttctgatca ctgctcccaa ggctgagaac aatatgtatg 2700 

tgcaacaacg tgatgaatat ctggaaagtt tctgcaagat ggctaccagg aaaatctctg 2760 

tgatcaccat cttcggccct gtcaacaaca gcaccatgaa aatcgaccac tttcagctag 2820 

ataatgagaa gcccatgcga gtggtggatg atgaagactt ggtagaccag cgtctcatca 2880 

gcgagctgag gaaagagtac ggaatgacct acaatgactt cttcatggtg ctaacagatg 2940 

tggatctgag agtcaagcaa tactatgagg taccaataac aatgaagtct gtgtttgatc 3000 

tgatcgatac tttccagtcc cgaatcaaag atatggagaa gcagaagaag gagggcattg 3060 

tttgcaaaga ggacaaaaag cagtccctgg agaacttcct atccaggttc cggfcggagga 3120 

ggaggttgct ggtgatctct gctcctaacg atgaagactg ggcctattca cagcagctct 3180 

ctgccctcag tggtcaggcg tgcaattttg gtctgcgcca cataaccatt ctgaagcttt 3240 

taggcgttgg agaggaagtt gggggagtgt tagaactgtt cccaattaat gggagctctg 3300 

ttgttgagcg agaagacgta ccagcccatt tggtgaaaga cattcgtaac tattttcaag 3360 

tgagcccgga gtacttctcc atgcttctag tcggaaaaga cggaaatgtc aaatcctggt 3420 
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atccttcccc aatgtggtcc atggtgattg tgtacgattt aattgattcg atgcaacttc 3480 

ggagacagga aatggcgatt cagcagtcac tggggatgcg ctgcccagaa gatgagtatg 3540 

caggctatgg ttaccatagt taccaccaag gataccagga tggttaccag gatgactacc 3600 

gtcatcatga gagttatcac catggatacc cttactgagc agaaatatgt aaccttagac 3660 

tcagccagtt tcctctgcag ctgctaaaac tacatgtggc cagctccatt cttccacact 3720 

gcgtactaca tttcctgcct ttttctttca gtgtttttct aagactaaat aaatagcaaa 3780 

ctttcaccta ttcatgagtt attattgaaa cctcaaatca taaagacatt taaaagaatt 3840 

gtttttctaa ctggaggggc tctagtgcta aataatagta ctgaaaattg atattatttt 3900 

ccttttctta tatgaaggac cttatttggc atataaaatt ttataaaata tgtatttaaa 3960 

gctttttctt attttttgta ttaattggta agtgaaaact ctgttaaaga tcacaccaca 4020 

atgttttcaa gaaacatctg aaaagataaa acaaagaaca aataacttat aatacttact 4080 

taaattgaca ctttttgaaa tgccagtctg aaaataatta agatatctct gctttgtatg 4140 

agtttctttt atgaaacttg ataccacggg agtccagtaa tattggccac aaaagccaga 4200 

gaaagtacca agcccagctt tgttatcata gccacttcct gccctgcttc tgttattttt 4260 

agtgtttttt cagatataaa tcggggtcca ggaaatcctc accagaatct ggcactgcag 4320 

ccaaaggcga tacttccaga gttctagtag gctgctatgg aatttctggc atgaaaattc 4380 

ttgacccctc acactttacc ccctgtacag ca 4412 

<210> 163 

<211> 1907 

<212> DNA 

<213> Homo sapiens 

<400> 163 

ggcacagggg aatcatcgtg tgatgtgtgt gctgcctttg tgagtgtgtg gagtcctgct 60 

caggtgttag gtacagtgtg tttgatcgtg gtggcttgag gggaaccctt gttcagagct 120 

gtgactgcgg ctgcactcag agaagctgcc cttggctgct cgtagcgccg ggccttctct 180 

cctcgtcatc atccagagca gccagtgtcc gggaggcaga aggtaccggg gcagctactg 240 

gaggactgtg cgggcctgcc tgggctgccc cctccgccgt ggggccctgt tgctgctgtc 300 

catctatttc tactactccc tcccaaatgc ggtcggcccg cccttcactt ggatgcttgc 360 

cctcctgggc ctctcgcagg cactgaacat cctcctgggc ctcaagggcc tggccccagc 420 

tgagatctct gcagtgtgtg aaaaagggaa tttcaacgtg gcccatgggc tggcatggtc 480 

atattacatc ggatatctgc ggctgatcct gccagagctc caggcccgga ttcgaactta 540 

caatcagcat tacaacaacc tgctacgggg tgcagtgagc cagcggctgt atattctcct 600 

cccattggac tgtggggtgc ctgataacct gagtatggct gaccccaaca ttcgcttcct 660 

ggataaactg ccccagcaga ccggtgaccg tgctggcatc aaggatcggg tttacagcaa 720 

cagcatctat gagcttctgg agaacgggca gcgggcgggc acctgtgtcc tggagtacgc 780 

cacccccttg cagactttgt ttgccatgtc acaatacagt caagctggct ttagcgggga 840 

ggataggctt gagcaggcca aactcttctg ccggacactt gaggacatcc tggcagatgc 900 

ccctgagtct cagaacaact gccgcctcat tgcctaccag gaacctgcag atgacagcag 960 

cttctcgctg tcccaggagg ttctccggca cctgcggcag gaggaaaagg aagaggttac 1020 

tgtgggcagc ttgaagacct cagcggtgcc cagtacctcc acgatgtccc aagagcctga 1080 

gctcctcatc agtggaatgg aaaagcccct ccctctccgc acggatttct cttgagaccc 1140 

agggt caeca ggccagagcc tccagtggtc tccaagcctc tggactgggg gctctcttca 1200 

gtggctgaat gtccagcaga gctatttcct tccacagggg gecttgeagg gaagggtcca 1260 

ggacttgaca tcttaagatg cgtcttgtcc ccttgggcca gtcatttccc ctctctgagc 1320 

ctcggtgtct tcaacctgtg aaatgggatc ataatcactg ccttacctcc ctcacggttg 1380 

ttgtgaggac tgagtgtgtg gaagtttttc ataaactttg gatgetagtg tacttagggg 1440 

gtgtgccagg tgtctttcat ggggccttcc agacccactc cccacccttc tccccttcct 1500 

ttgcccgggg acgccgaact ctctcaatgg tatcaacagg ctccttcgcc ctctggctcc 1560 

tggtcatgtt ccattattgg ggagccccag cagaagaatg gagaggagga ggaggctgag 1620 

tttggggtat tgaatccccc ggctcccacc ctgcagcatc aaggttgcta tggactctcc 1680 

tgccgggcaa etcttgegta atcatgacta tctctaggat tctggcacca cttccttccc 1740 

tggcccctta agectagctg tgtateggea cccccacccc actagagtac tccctctcac 1800 

ttgcggtttc cttatactcc acccctttct caacggtcct tttttaaagc acatctcaga 1860 

ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaggg cggccgc 1907 

<210> 164 
<211> 861 
<212> DNA 
<213> Homo sapiens 
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<400> 164 



gaattcggca cgagcctgag 
agattacata gactgtcaaa 
accacaggac taactgactg 
gtcaccctgt agtttctaat 
cccctccctg ttttcctttc 
atcttggtct ttaatgttca 
tcttgcattt tctttcaact 
tcatcatcat ccaatttatt 
gtctattatt aagtctaaca 
tgaaatgtgc aagcaagagt 
cactttggga ggccaaggtg 
tatggtgaag ccatgtctmw 
tgtcatccca gcaaggcagg 
gggatcgtgc cattgcactc 
aaaaaaaaaa aaaactcgta 



tcaacttgat atccaagctt tttacttcaa ttatctggca 
gtttgtgaaa gtttagcaag aaaactgtct tactcacaga 
aaccacactc caccatttgc ccctatttcc aggcgttatg 
ctgtatagat gtgtagagca tgcctcttcc ctcttccttt 
ctcttgccct ttcttaatgt ctgtytctat tggcttcttg 
tccttaagct tgcttctctc ttcagactac tgattcagcc 
tgggccaaaa aaacaggcaa cattttcttc ctccactacc 
cctttagttt atattaccac aactctccta aacgtcccaa 
acttagcttc gaacctcaat ccaagcatct gacaacacac 
cccwatggcc gggtgcagtg gctcatgcct gtaatcccag 
ggatcacctg aggtcgggag ttcgggacca gcctggccag 
actaaaaata caaaattagc cggacattgt ggtgcacgtc 
cgaatcgctt gaacccggga ggcggaggtt gcggtgagcc 
cagcctggtc aacagagcga gactccgcct cattaaaaaa 

9 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
861 



<210> 165 
<211> 587 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<222> (1) . . (1) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (587) . . (587) 
<223> n equals a,t,g, or c 

<40D> 165 

natttctcac aatgacaaat tctcaaatat tgctaatagt actgtggatt ttcctacatt 60 

ggtaaattga aggaattgct aaatgctgaa ttcagcaacc agtttgagat tgttgaaaat 120 

aaagattgtt tctttttcaa tgcaagttca cagatcactg gagttctagc tacagtttgt 180 

tctagaccag aggttgcaga tatttttgtc ctataaagag acacatggtt aatatttttg 240 

gctttgtgag ttgtatagtt ttccgttgta gctgttcagc tctgctacat gaaagcaacc 300 

atagaccata ccttaacaag tggtcacttt tgagtaccaa taaaacttta tttagaaata 360 

acagagggct ggatttggtc ctagtttgct gaaccctttt ctagatgaag gctcctcttg 420 

ccaagactgg ctccctacct tggctgacaa attctcactt tgggacttag tcattgttgc 480 

tgctctctgt tattttgcat gtcttttctc atgtttaggt gctgtgtctt aatacttttt 540 

tcttacattt aatttaacaa tcattactga gcgctggtat gtctagn 587 

<210> 166 
<211> 477 
<212> DNA 

<213> Homo sapiens 
<400> 166 

gaattcggca cgaggtgagt atggcttttg tcttccatct tgctcagggt actttggaac 60 
cgctatacat tgcaggagct tagcttctgg ttaccatggt ttgcttccag agcaacaagc 120 
ctagtacttc aacatggaga caattatctt ttgtttttgt tttgttttgt ttgttttgtc 180 
ttggccatgc ctttttgagt ttaccttttt atattttgtc catcattgcc atgtgtttgg 240 
agcagtgggc gttccataac atgaactcac tgtaccatca cgaatgggaa gtaaggggaa 300 
accttatcca tgtggatttt actcttccct gattccctaa attgggtttg caaaatacta 360 
ctgtgcactt tcttgatgat tcgggcttat ctttatgact gtctgtkttt gtgtcagact 420 
gtaaagaagt ataaaagtct ttagcttgaa aaaaaaaaaa aaaaaaaaaa aactcga 477 



<210> 167 



97 



» 



WO 02/076488 PCT/US02/08276 

<211> 1930 
<212> DNA 

<213> Homo sapiens 
<400> 167 

ggcacgagca gaaatgaaaa attacttgag tgggatgagt aggaaaaaaa gtggtagtgt 60 

cattcaattc cagaggaaga gatgaattta atggtgaggt tactggcatt gggactaata 120 

tcagggatga tgtctaatat tactcaatca cattcaagta aaatatcagc ctttggtatc 180 

ttcattggac cagaacagtt tctttagatc ttcttatttc tctttcaagc ttcaacctta 240 

aataataggc cattgtgtag cagaaaaaac tttaaactta gaagtagaaa tctataatca 300 

aatcctcagc caacttaaaa acagttgtgt gaccttggat aagtcccata gccggactgc 360 

attctctaaa ccagcagcta taacgtttcc tacctcatta gagtgtggtg tgaatgaaaa 420 

tgtgaagaat gcctaaaaca gagtcaggcc ttgaatgcat tagaaagttt caggcagcca 480 

ctcattccat caccctgtct cactctttct agtgacccag ggtcacttac ctgtttttct 540 

taatacaccc caagtctttc tcttgcctct ctttgtagac cagaattatt cttgtgttca 600 

tcaatatgga ttgagtcaaa aattttcaag atctacctga cttattactt caaggatcca 660 

tcatcctctg gcttccattt ttttgtattt ctataggcat ggattcaaag gggatatctg 720 

actggctcag gctagatcca gatgaactcc tcctacattt gttgtccatc ctggtccaat 780 

cagtggcagc taagggagct cagtcacttg tttgaagttt gcccagtcaa ggggctgtgg 840 

aaggaagagg aagttaatct gagacaggat tgtgacaggc agaccaataa acatgtctgt 900 

ttacaatcta aatattcata aaattccaat cccccaaatt ctcccacata tgtatgctct 960 

tgtattcccc tgagatagga agggaggcat gctcataacc ccattttaca gatgggaaga 1020 

ataaagtgcc aggaatactg gtccctccat tagggtcact taatgagcca ctggtgaaac 1080 

aagaaataaa tccgaattga gagcttagac tgcctggtct cctgttaaca attaagactg 1140 

caaaaatttc aaaccatatc gcatgcacaa taaatactgc atctgaatca attgtagaga 1200 

caaagacaga ggcacaggga gaagacagat ctatccaagg tcactcaagt gaggaaataa 1260 

accagcttaa aatagacttc tgtctcagca gcattgtgct ttcactcctg ggcaacttcc 1320 

tgcctataca gcaacattaa tgccagcaag gaaggaacct gagggttaaa tccttggccc 1380 

cagccccaga tagcaataca gaaccccacc cccgtaattc agtcaataaa tagatgtccc 1440 

tttcatacaa gtttcagaaa acacagttaa tatacaacca ctactcacaa attaaataag 1500 

ttatcttact gtaaaggata taactatttt attatctttg caattaaaat gaaatatgct 1560 

aaaggtagaa gcaatacaaa acagctgctg ccagaagttt caataaaaga tcactactgg 1620 

gcacccttat aactgtggga ccattaggag atttaaatgt cttcttcact ttgcccacgg 1680 

tagggagtga ggctgcactg agaaacatca ctggcatgag gtctaattgc ctgccctatg 1740 

attaatgttg ccaagtgaat tcagaagttg tcacagttct catcctatgg tccaggctca 1800 

tttataaaat agagcaaagg gagcccagtg ctttgagaat gccaatgcaa aattataata 1860 

attacttatt acatgataca gttgttaaag tattttctgt gttgttcaaa aaaaaaaaaa 1920 

aaaaactcga 1930 

<210> 168 
<211> 1021 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (248) . . (248) 

<223> n equals a,t,g, or c 

<220> 

<221> mis cofeature 
<222> (1004) . . (1004) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (1014) . . (1014) 
<223> n equals a,t,g, or c 

<400> 168 

ttcggcagag cccttgcgcg ctcttgaata cctgckttct gtagcgctag ttctcttcaa 60 
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gatttgctta gtgtcatttc atttcggttt cttttctcgc catgtttttc tgtcggaatt 120 

acggttcgtt ttggttctat gtactctcta aaatgttatc gtttttcatt tgtctactaa 180 

ttttcgtgca tttgttacta ctgagtttct taatatctga ctggcctccg cccacgggct 240 

ctgcaganca taaaatactc aggctgatgg tagtgcagag actctccctc cttgatcagc 300 

gcaaacgttg gtctgaggct tgagggatgg agcaacattt tcttggctgt gtgaagcggg 360 

cttgggattc cgcagaggtg gcgccagagc cccagcctcc acctattgtg agttcagaag 420 

atcgtgggcc gtggcctctt cctttgtatc cagtactagg agagtactca ctggacagct 480 

gtgatttggg actgctttcc agcccttgct ggcggctgcc cggagtctac tggcaaaacg 540 

gactctctcc tggagtccag agcaccttgg aaccaagtac agcgaagccc actgagttca 600 

gttggccggg gacacagaag cagcaagarg cacccgtaga akargtgggg caggcagarg 660 

aacccgacag actcaggctc crgcagcttc cctggagcag tcctctccat ccytgggaca 720 

gacagcagga caccgaggtc tgtgacagcg ggtgcctttt ggaacgccgc catcctcctg 780 

ccctccagcc gtggcgccac ctcccgggtt tctcagactg cctggagtgg attcttcgcg 840 
ttggttttgc cgcgttctct gtactctggg cgtgctgttc acggatctgt ggagctaagc * 900 

agccttagat agcagcagaa ggctttttgg attctcctcc ttgaaaagat tctcagttac 960 

caaacgtctc cacctagaaa ataaaaatac attaagatgt tganaaaaaa aaanaaaaaa 1020 

a 1021 

<210> 169 
<211> 727 
<212> DNA 

<213> Homo sapiens 
<400> 169 

gctggtatct ccagtgtttg ggtttagctc caacttacag gttaggacca gcttttctgc 60 

aggtgttgac cagcaatttc ctgcggcatt tacttcttga taacaagagt gagaagatag 120 

agacagggca gatagacact taagagtaaa atgtattaac acaaaggctc tggccgcccc 180 

cctacaaagg aggccatgga accgatggaa ctgatggagg aaatgctggg actgtgggtc 240 

agtgctgaca cacccatggc catacgtttg gtcttcttgg ccttggctgg gctggtggat 300 

gggaagccag tatggatcac cttgtggatg gatgcaaaga gaccaaactt ggcgggcact 360 

ggaagtacct ggggaagcag gagagactca cactgctgtc atggccccac agcctggagc 420 

ctcccctgcc tcctctgcct cttcagagcc cagcagaaag acagagaaag aagcctcctt 480 

ggggttccat tacccacact ccaaggtgga aatctttcag atggttagat gatgaaggta 540 

gtagaaggca aggatgattg ggagtagaag gaagagtgac aggctagcat gagctgtgca 600 

gcagcaagat tccatatgag caaagttcag aaagtgrgmm aaaaggacca agttggatct 660 

cctcctaacc ctgacctgca tgatatgggt gtgagaagct tcaactgaga aagctgctga 720 

gaaagta 727 

<210> 170 
<211> 1341 
<212> DNA 

<213> Homo sapiens 
<400> 170 

caggaattcg gcacgagagt ctgtggtcct ctgtatctca actttttcat cttaaaaaaa 60 

caaatagggt tgtgtgtgtg gctggtggtc ataaggtcct ttctggctct aataacctga 120 

gcttctgtta tgaagctggg acccttagag cctcaggatg atcctctgtt tgtttgtgaa 180 

gccccaatca ggtgctaagc accatagtgg cacttagctg aagctcctct gtaactcctg 240 

tgggccctgc cttgcccacc cccgacagct gctgcagtgc tcctgagcag cacaggcctg 300 

atggagcttc tggagaagat gctggccctc accttggcaa aggcagattc tcccaggact 360 

gcactcctct gctctgcctg gctgctcact gcctccttct ctgcccagca gcacaagggc 420 

agtttgcagg ttcaccagac actctctgtg gaaatggacc aagtattgaa ggctctcagc 480 

tttccaaaga aaaaggctgc actactctca gctgccatct tatgcttcct gcggacagcc 540 

ctgcgacaaa gcttttcctc tgccctggta gccctggtgc cctcaggggc ccagccactg 600 

ccagccacca aggacactgt cctagctcca ctgcgaatgt cgcaagtccg gtccctggtc 660 

attgggctgc agaacctcct ggtgcagaag gaccctctat tgtcccaggc ctgtgttggc 720 

tgcctggagg ccttgcttga ctacctggat gcccggagcc cagacattgc tctccacgtg 780 

gcctcccagc cttggaatcg gtttttgctg tttaccctct tggatgctgg agagaattcc 840 

ttcctcagac ctgagatttt gaggctcatg accctgttta tgcggtaccg gagtagcagt 900 

gtcctctctc atgaagaggt gggtgatgtt ctgcaaggtg tggctttggc tgacctgtct 960 

accctctcga acaccacact ccaggccctg catggcttct tccagcagct ccagagcatg 1020 
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ggacacctgg ctgaccacag catggcccag accctgcagg cctccttgga gggccttccc 1080 

cctagcacct cctcaggcca gccacccctg caggacatgc tctgcctggg aggggtggct 1140 

gtatccctgt cccacatcag aaactgatcc tcaggacttg aaggcccaga agtggagaga 1200 

gaatgagacc tggagacaaa gggcataatt gttggggaaa tggatgacag ctgaagctat 1260 

tcatatggag ccatatactc tattgttgaa atagaataag gaaataaaat gatacactca 1320 

cataaaaaaa aaaaaaaaaa a 1341 

<210> 171 
<211> 839 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> (1) . . (1) 

<223> n equals a,t,g, or c 

<400> 171 

ngaacccaga agatgctgcc tctcctgatc atctgtctcc tgcctgccat tgaagggaag 60 

aactgcctcc gctgctggcc agaactgtct gccttgatag actatgacct gcagatcctc 120 

tgggtgaccc cagggccacc cacagaactt tctcaaagta ttcactcctt gttcctagag 180 

gataataatt ttctcaaacc ctggtacctt gatcgtgacc atttggaaga agaaacagcc 240 

aaattcttca ctcaagtaca ccaagccatt aaaacgttac gagatgataa aacagtactt 300 

ctggaagaga tctacacgca caagaatctc tttactgaga ggctgaataa gatatctgat 360 

gggctgaagg agaagggagc cccacccctc tccatgaatg ccttcccggc tccatctcct 420 

acttgcaccc cagaacccct tggctctgtc tgcctcccca gcacctcagt ttctctacct 480 

tctcaccctc cctggcagcc tgcaatgagt cctgtgccag gaaccggcgg acctccctgt 540 

gggctgtgag tctcagcagt gctctactcc tggccatagc tggagatgtt tcttttactg 600 

gcaaaggaag aaggaggcag taaaggaaca gggcagcccg catgtcttcc agaagtgaac 660 

agaggccgca gctaccaccg tcacaaagtt cactcatctc tgggtcccgg tgaccccatc 720 

cccccatacc ctccatcctg ggtcctgggg ccccaaagct ctgaggccta ggagactgcg 780 

ctgtctcgtg gtttgcctac tcctacacct ttgtaaagag tctcttcatt aaaacccct 839 

<210> 172 

<211> 1022 

<212> DNA 

<213> Homo sapiens 

<400> 172 

cgctcctgcc gccgggaccc tcgacctcct cagagcagcc ggctgccgcc ccgggaagat 60 

ggcgaggagg agccgccacc gcctcctcct gctgctgctg cgctacctgg tggtcgccct 120 

gggctatcat aaggcctatg ggttttctgc cccaaaagac caacaagtag tcacagcagt 180 

agwgtaccaa gaggctattt tagcctgcaa aaccccaaag aagactgttt sctccagatt 240 

agagtggaag aaactgggtc ggagtgtctc ctttgtctac tatcaacaga ctcttcaagg 300 

tgattttaaa aatcgagctg agatgataga tttcaatatc cggatcaaaa atgtgacaag 360 

aagtgatgcg gggaaatatc gttgtgaagt tagtgcccca tctgagcaag gccaaaacct 420 

ggaagaggat acagtcactc tggaagtatt agtggctcca gcagttccat catgtgaagt 480 

accctcttct gctctgagtg gaactgtggt agagctacga tgtcaagaca aagaagggaa 540 

tccagctcct gaatacacat ggtttaagga tggcatccgt ttgctagaaa atcccagact 600 

tggctcccaa agcaccaaca gctcatacac aatgaataca aaaactggaa ctctgcaatt 660 

taatactgtt tccaaactgg acactggaga atattcctgt gaagcccgca attctgttgg 720 

atatcgcagg tgtcctggga aacgaatgca agtagatgat ctcaacataa gtggcatcat 780 

agcagccgta gtagttgtgg ccttagtgat ttccgtttgt ggccttggtg tatgctatgc 840 

tcagaggaaa ggctactttt caaaagaaac ctccttccag aagagtaatt cttcatctaa 900 

agccacgaca atgagtgaaa atgatttcaa gcacacaaaa tcctttataa tttaaagact 960 

ccactttaga gatacaccaa agccaccgtt gttacacaag ttattaaact attataaaac 1020 

tc 1022 

<210> 173 
<211> 1028 
<212> DNA 
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<213> Homo sapiens 
<400> 173 

gcatgatcct gtggaacaca gtttgggatc atagatgtga attaagacac caccgagata 60 

cgggctgtga ggttcatacc gtgctgatag cactcgtggt gtctgtgaaa tgtgggtaag 120 

acattcaaac ctggttttga tactggaaac tcttccttta aaactgtgac catgatttca 180 

ttcagcccct ccacacccct atgtctgcct tgtttcagag tgagttttct atggagcctg 240 

tggccctttt gcagcccacc tggtggcttc ttaatgtaac tcttcccctg gtcgcctgga 300 

gtggaccact catctgcagg cctctcctgc atggggaggg taggcaggga gcagcatgtc 360 

tgcaggggtg aacctttgct cttctgtcag gcgaggccca ggctgcacca gccacctgcc 420 

acatggtgac agtgccacgg gccctgcgta tggcccctgc aaccgtgctc tggcgggcac 480 

acctggctgc tgcaggccaa ggccgctgtt cagtgaagag tcccatgttt agtatggact 540 

aaagtcccat gtttagccay tgccccagtc tcccgtgacc ccagaaacca ggtcactgga 600 

ccacagtgcc agatcctcat cacgccggtg agcacctaga agtgagaaca ctgtattcct 660 

acaatgtaca cttggatatt tctccttatt tagtttctag tgaaacaaat caagtaagga 720 

actatcttta gtttagatgg aattatttgt ttttaattgt tgccgtattc atctatatag 780 

ctaatatttc aagataagta atgaacaaaa cctgtctaaa ccttttgttt ccaatgaatg 840 

aaagtcatgc actttattta taggctctat gttttggctt ctgcagtact tttattatct 900 

atacataatt tggccaaaaa taagaaattg gaaagaatga aatgtttagt ttatagtaga 960 

agaaagatga tgacactaag ttgtgaaaat atgttgtgat ttttatgaaa taaactcacg 1020 

gcacgtag 1028 

<210> 174 
<211> 808 
<212> DNA 

<213> Homo sapiens 
<400> 174 

tgcaggaatt cggcacgaga ttacaacaca tcagaacaaa atgttatgga ctaccatgga 60 

gcagaaatcg tgagccttcg tttgctgtca ctagtaaaag aagaatttct ttttctcagc 120 

cccaacctag attcacatgg actgaaatgt gcatcttctc ctcatgggct ggttatggtt 180 

ggagttgctg ggactgtcca tcgaggaaac acttgtttgg gcatttttga acaaattttt 240 

ggactcatcc gctgcccttt tgtggagaat acttggaaaa tcaaatttat caacctgaaa 300 

attatgggag agagttccct tgctcctgga acattaccga aaccatctgt taaatttgaa 360 

caaagtgatc tagaggcctt ttataatgta atcactgtat gtggtaccaa tgaagtacga 420 

cataatgtaa agcaggcttc ggatagtgga actggggacc aagtttgagg tagtggaaat 480 

gagacattgc tgaacaaaag agaactgggt ttacctgacc ctctaaagcg ctaagtactg 540 

tcagcctgaa aaaaatcttc tatacagaaa ctcttccaaa tactatatca gtaatgtctg 600 

aatgatttca gatgtgaaaa ttgacatatt ttagttgaaa tacctttctg gactacagac 660 

ttacatatca tgtgaatact tacctatttc tacccgagtt gcagcaagta ttctgaaagc 720 

ttaatgcaaa taaatcccac tttagatctt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 780 

aaaaaaaaaa aaaaaaaaaa aaaaaaac 808 

<210> 175 
<211> 1898 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1398) . . (1398) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_ feature 

<222> (1428) (1428) 

<223> n equals a,t,g, or c 

<400> 175 

ggcacgaggt ctccatggcg ttagaagtct tgatgctcct cgctgtcttg atttggaccg 60 
gtgctgagaa cctccatgtg aaaataagtt gctctctgga ctggttgatg gtctcagtta 120 
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tcccagttgc agaaagcaga aatctgtata tatttgcgga tgaattacat ctgggaatgg 180 

gctgccctgc aaatcggata catacatatg tatatgagtt tatatatctt gttcgtgatt 240 

gtggcatcag gacaagggta gtttctgagg aaactctcct ttttcaaacc gagctgtact 300 

ttaccccaag gaatatagat catgaccctc aggaaatcca tttggagtgt tccacctcta 360 

ggaaatcagt gtggcttaca ccagtttcta ctgagaatga aataaaattg gatcctagtc 420 

cttttattgc tgactttcag acaacagcag aagagttagg attattatct tctagtccaa 480 

acttgctctg agctaaagga gaaatggaaa cttgaagctg gtgttatgta ttttgcagga 540 

aaacagtttc attttttcat agcaaaaata tagttggtgt atatctctcc ttaagtctct 600 

ggtttctaaa aaccctactt cagtaaaggt cctgattagt tgattagtga atgtgtattt 660 

ctaaatattt gtattcagta ggggtatggc tgattaattt aacattaact attaggtaat 720 

tcatattata catttaagtt ctttctgttc tgtgtagaag attcagaaat atgtcttcaa 780 

agacaatgac ttgatctaat tgataagaac ctccaataaa tatgttctaa tatttttcag 840 

gaagaataaa gaatagagag agacatataa atgtgcaaga ggcaaaactt tgagcatagt 900 

gtaaaattta acatattaac tctcacgaaa ggcaaaatcc ttttatgtgc agatacttta 960 

attcatgtag attttcctat taatcagtaa agttgaatcc taacaataat gccatgtgac 1020 

aacctattta gattattcca gaattaaatt caatttattt tctagagctc aagtaaccac 1080 

tactttaact gaaatttgat gttaggtttc ccttgttcct ccgaatggtt cttccacact 1140 

caaaataatt gaatggttga gttggttaag caaagagtta tcctgccacc taagagcatt 1200 

cattaaatga ttatttatta ccacctactt tatactatct tcctttcttt aaacatggag 1260 

tctaaatatg taatatatca aaaaatactt ctgatttggt agatttctta tatcaagggt 1320 

gagaattgaa ctgtgccatt ggctattcaa tagcttattg aatgtatgtt ttggatgcca 1380 

catcctcctg gaagcaantt ttgccaagat actgtttatt attatttnta attaaagtga 1440 

tactattcca ttttcaatta aatgctgtct gtagctgtta acttgtcaga taaagaattt 1500 

gaccctgtca tagtgaacat ctgtctttac cagttaacat gcagctaaga ggtaatactt 1560 

ctatgggact tcctaagggt cagaatatgg tacaagtaca ttgcgataaa ttatttaatc 1620 

ttcttaaaga gtgaaatata tcatgattat cccaatttta cagataagca aacagaggtt 1680 

aaatcatttg cc tgagtcac ataacttgtt ggtgttggtt caagatttaa aatagggcaa 1740 

tctgccttta gatctgtctc tatactctct ctttgtatat tagccactat actctactgc 1800 

ttggaatcat cttaagttgc tgaactttag ttctctagaa aacaattgct attcaagcag 1860 

ttatacaact ctcaataaaa cttaaagttg aaaaaaaa 1898 

<210> 176 
<211> 818 
<212> DNA 

<213> Homo sapiens 
<400> 176 

aaaacttgag tatgttgagg gaaggaatat atatatatct gggagagaat ggatacgttt 60 

tgtttttctg aaatggaatt agaaagatgt tcagttgtct tgtgcattct tgcaaacctt 120 

gcagttttga gagccctgtt tctgccttgt atcattttcc actgtgtatc kgattctagg 180 

agcgtgaaca gggagacaaa ggtgaagttt gtgcacacct ctgtccatgg ggtgggtcat 240 

agctttgtgc agtcmgcttt caaggctttt gmccttgttc cycctgaggc tgttcctgaa 300 

cagaaagatc cggatcctga gtttccaaca gtgaaatacc cgaatcccga agaggggaaa 360 

ggtgtcttgg taacctaatt tttttttaaa ttatgaaatc tgcttttata ttcaaaacta 420 

ttactgtcaa gtaaaataca tttttatgtg ttttcattgt gctgaagaaa aactaatttc 480 

agcatggaaa tatgtatgtt tggctgggtg cagcgtctca tgtctgtaat cccagcactt 540 

tgggagacca aggcaggcag atcacttgag gtcaggtgtt cgagaacagc ctggccaaca 600 

tggcaaaacc ctgtctctac taaaaataca aaaattagct gggtgtggtg gtacatgcct 660 

gtaatcccag ccacttggga ggctgaggca ctagaattgt ttgaacctga gagatggagg 720 

ttgcagtgag ctgagattgc accactgcac tccagcctgg gtgacagggt gacagagcga 780 

gactctgtct caaaaaaaaa aaaaaaaaaa aactcgag 818 

<210> 177 
<211> 3435 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (760) . . (760) 
<223> n equals a,t,g, or c 
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<400> 177 

cgagcagcat cagaaagtac gawtctgctc tgagggcgra tgraaaatga agawtctgac 60 

gtaaagcctc cagactkgcc aaacccaatg aatgctacct cccagtttcc tcagcctcag 120 

cactttgcag ctttggcctc cgtctgcctc gggatatcac agagctgccc gagtgragta 180 

ggggtacccc ttctacatgg ccatgggctt cccagggtat gacctctcgg ctgatgacat 240 

agctgggaag tttcagttca gccggggcat gcgccgcagt tacgacgcag ggttcaagct 300 

gatggtagtg gaatatgctg agagtaccaa caactgccag gctgccaagc agtttggagt 360 

attggaaaaa aacgttcgag actggcgcaa agtgaagcca cagcttcaaa acgcccacgc 420 

catgcggcgg gcattccgag gccccaakaa tgggaggttt gctctggtgg accagcgtgt 480 

ggccgaatat gtcagataca tgcaggccaa aggggacccc atcacccggg aggcgatgca 540 

gctgaaagct ctcgaaatcg cccaggaaat gaacattcca gagaaagggt tcaaggcaag 600 

cttgggttgg tgtcgaagaa tgatgagaag gtatgacctg tctctgaggc ataaagtgcc 660 

cgtgccccag cacctgccgg aagacctgac tgagaaactc gtcacttacc agcgcagtgt 720 

cctggctctg cgcagggcgc atgactatga ggtagctcan atggggaatg cagatgagac 780 

gcccatttgt ttagaggtgc catcacgggt aactgttgat aaccagggcg aaaagcctgt 840 

cttggtcaag acaccaggca gggaaaaact gaaaatcaca gcaatgcttg gtgtcttggc 900 

tgatggragg aagttaccac cgtacatcat tttragggga acatatatcc ccccggggaa 960 

gtttcccagt gggatggaaa ttcgctgcca ccggtatggg tggatgactg aagacttgat 1020 

gcaggactgg ttggaagtgg tgtggagacg gaggacagga gcagtgccca agcagcgagg 1080 

gatgctgatc ttgaatggct tccggggcca tgccacagat tccgtgaaga actccatgga 1140 

aagcatgaac actgacatgg tgatcatscc agggggtctg acctcacagc ttcaggtgct 1200 

ggatgtcgtg gtctacaagc cactgaatga cagtgtgcgg gcccagtact ccaactggct 1260 

tctggctggg aacctggcgc tgagcccaac cgggaatgct aagaagccac ccctgggcct 1320 

ctttctggag tgggtcatgg tcgcgtggaa tagcatctca agtgagtcca tcgtccaagg 1380 

gttcaagaag tgccatatct ccagcaactt ggaggaggaa gacgatgtcc tgtgggaaat 1440 

cgagagtgag ttgccaggag gaggagaacc accaaaagat tgtgacaccg aaagcatggc 1500 

tgagagcaac tgaagggaaa gggaaagcaa atggaactct gatttaaaca gctggggatg 1560 

aaattcctca agatgattat tcctgaaagt gtggatgcgc tggatgcgca gggaacatca 1620 

ggaaaaggcc acggggctct gaacagcccc ggtccagaca gcagcctgta catccatccc 1680 

aggacacagc ccagcccctc cccacaccat acaaggtatc agaaaagtct aggacctatc 1740 

atttcatcag agacatgatc agaaaagaaa ctgcttctgc cccatttctt gttttggaga 1800 

ttactccatc tgtccatcaa aagaaacctg taaatatgaa agaacaaagg ttatttcctg 1860 

gagaaaagac aatttattca acaccaacaa gggactcatc atatgggcac aactctggtg 1920 

tccttctatg gagaaaacct caagtaaagt tttattctgc ctttgaaaat gcttccaaaa 1980 

gtagaccctg tccccacaca ggtcaagact acagagaagg ctttgtagaa atgtgtcacc 2040 

tatgtacacc tgctacttac acatttcctc ttttggaaaa atgagatact tagaataaca 2100 

agaaaattaa gacatactgg cctggtgcca gcagatggct tttctataga caaactaggt 2160 

tagtgtggaa gatataggtt aaaataaact atgctgtttt atttatcttc ccaacctgat 2220 

tggcagctag acttttttag ggtctcattt aatggccctg tttttttcat tatfcatattt 2280 

aatgataggg caggatttcg tatgcaagct cttgtttctc aggctgcctg cagaagaagt 2340 

cgctataaat tatctgttgt ctacatggta caaggcccat tgactcatct gatgcttgtt 2400 

ttgttaattt ctttaatatt tttatcacgg ggcagtggga gggcttgggc ttttagccac 2460 

agctgtttta agacttctga tctcctgccc tgcaggaata ggtgggaagt cattgaattt 2520 

ttacactata gtaatttgca ttcccacata agtttgagtg ttacgaaaac attcctttaa 2580 

agggatctgt gctacacaaa atatgccagg acctcacaga caaagccatt gctagaaatg 2640 

tcattccaat gatcagatct ggaaacaggc tgccataacc acttttcctt cttgtagact 2700 

cagctcacct gtatatttaa actgttcttg gcatcttgaa acacctattt ctactcaggt 2760 

actcattgtc ctgttactga ttcacctttc tgatcctttt caaccagttt tcccccaagg 2820 

ggggaaattt tacttaacct ctagtatttg aacaactcaa tatttgaatt gttgccccat 2880 

ttgcttttac ctgtactgta ttcttggtca tctcaaatgg cgtctaaacc cagctacttt 2940 

gcattccaga agtttccatt ccctccaatt ccacctaatt tttcatctgt cctagttact 3000 

ggctctttct tcatgtctta tttctcttgc tttgggagct taaaagattt tacaagacct 3060 

aattttgggt tccttccttg gagccatagt taccctgcca agaagagtag aaaatgggtt 3120 

caactcctgt ttcgctccac caacacctct gtgagtctca tcatcagctg agcgatgatg 3180 

ccttacaggt tgcatagcac tggaactttc ctagagtaac ggctctgctg ccagggtttc 3240 

tctgggctca ttcttccact gacttaatta tgatctatgc ctaacagagc cccagtacaa 3300 

ctattttgca gaatggctgt taccctagaa ttactatagc acatattgag atatagttgt 3360 

actccctagt agataggaac tgaccccaac aataaacttt gataataaag amaaaaaaaa 3420 

aaaaaaaaaa aaaaa 3435 



103 



t 



WO 02/076488 



PCT/US02/08276 



<210> 178 
<211> 1481 
<212> DNA 

<213> Homo sapiens 
<400> 178 

ggcacgagcc tggcagagag actctgaaat gagggattag aggtgttcaa ggagcaagag 60 

cttcagcctg aagacaaggg agcagtccct gaagacgctt ctactgagag gtctgccatg 120 

gcctctcttg gcctccaact tgtgggctac atcctaggcc ttctggggct tttgggcaca 180 

ctggttgcca tgctgctccc cagctggaaa acaagttctt atgtcggtgc cagcattgtg 240 

acagcagttg gcttctccaa gggcctctgg atggaatgtg ccacacacag cacaggcatc 300 

acccagtgtg acatctatag cacccttctg ggcctgcccg ctgacatcca ggctgcccag 360 

gccatgatgg tgacatccag tgcaatctcc tccctggcct gcattatctc tgtggtgggc 420 

atgagatgca cagtcttctg ccaggaatcc cgagccaaag acagagtggc ggtagcaggt 480 

ggagtctttt tcatccttgg aggcctcctg ggattcattc ctgttgcctg gaatcttcat 540 
gggatcctac gggacttcta ctcaccactg gtgcctgaca gcatgaaatt tgagattgga ■ 600 

gaggctcttt acttgggcat tatttcttcc ctgttctccc tgatagctgg aatcatcctc 660 

tgcttttcct gctcatccca gagaaatcgc tccaactact acgatgccta ccaagcccaa 720 

cctcttgcca caaggagctc tccaaggcct ggtcaacctc ccaaagtcaa gagtgagttc 780 

aattcctaca gcctgacagg gtatgtgtga agaaccaggg gccagagctg gggggtggct 840 

gggtctgtga aaaacagtgg acagcacccc gagggccaca ggtgagggac actaccactg 900 

gatcgtgtca gaaggtgctg ctgaggatag actgactttg gccattggat tgagcaaagg 960 

cagaaatggg ggctagtgta acagcatgca ggttgaattg ccaaggatgc tcgccatgcc 1020 

agcctttctg ttttcctcac cttgctgctc ccctgcccta agtccccaac cctcaacttg 1080 

aaaccccatt cccttaagcc aggactcaga ggatcccttt gccctctggt ttacctggga 1140 

ctccatcccc aaacccacta atcacatccc actgactgac cctctgtgat caaagaccct 1200 

ctctctggct gaggttggct cttagctcat tgctggggat gggaaggaga agcagtggct 1260 

tttgtgggca ttgctctaac ctacttctca agcttccctc caaagaaact gattggccct 1320 

ggaacctcca tcccactctt gttatgactc cacagtgtcc agactaattt gtgcatgaac 1380 

tgaaataaaa ccatcctacg gtatccaggg aacagaaagc aggatgcagg atgggaggac 1440 

aggaaggcag cc tgggacat ttaaaaaaaa aaaaaaaaaa a 1481 

<210> 179 
<211> 652 
<212> DNA 

<213> Homo sapiens 
<400> 179 

gaattcggca .cgagcaacag tggggcactc tgctcccagg caggtcccac tgggctgagc 60 

cgcacagcct ggctttgggc ttccctgact gcaccaccca catcasctgc ctctagccct 120 

taamatacaa aacttccccc agtcacfcggc cgccaggctg agttggggga tgtgttacat 180 

ccctgggtcc actggggggc agtgttggcc atggtgttgg tgctggctct gccgagaggc 240 

gttggagtgg ctgtgtgggg cggtgagcgc cggcccagcc tgatggaacc cactgtacca 300 

ggcccaggcc tcagcctctg agaaggactt ccctgtgtca ctcactcata catgtcctca 360 

ggacgtgaag acatttcagc agaccaaagt ttccttcgaa tttccttcga atcgtccaga 420 

tacttggaga catctcctcc tcacctgtgg ggtgctgggg cagtcctagg cgtgggggca 480 

gatgggtgga cagctgctgc tgccctgctg ggggtgggca gcccttggag cacacagtgg 540 

tgaagacatt cctgaatatg tctcaggctg tagaaatctt attttgtgga aagattttag 600 

agaatcatca aaataaactt ttaccaaata aaaaaaaaaa aaaaaaactc ga 652 

<210> 180 
<211> 1711 
<212> DNA 

<213> Homo sapiens 



<400> 180 

ggcacgagcg 

tttgcccatc 

gagtagtgcg 

tctggtgtct 

agtcctcctg 



ctcctgtcct gccactgagg 
tgtcccggtg ctaacacaca 
aaatgcgtgt acctctagtc 
atttagcaag tgaaagtttc 
gattgcattt atggaagttt 



gacccggtta ccaaccctca tgtagctcag 
gttctcggga gactttcccc attcccagag 
ttaagctggg cgtttgtatt agttgggttt 
tggttccctc cttcactgtg tgacctgact 
atacgagacc tagtttccat ggaggaactc 



60 
120 
180 
240 
300 
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actgattccg cgagggagat ggggtactgg atgatggtct tcagccttaa ggctatgttt 360 

ccagtgtcct ctgggtgttt ccaagagcgg caagaaacga ataaatctct gacccttctc 420 

aggtgcagcc agagagacac tagcccactg atggacggac agacgtgggc aagggtccgt 480 

gtcactaaac cacccaccac tgccacagct gcctacaaca gacacatcag atgacactcc 540 

gggcaaataa atgattttca ctgaggactt actggtttta ataataggtc ctggtgtaga 600 

gaagtccctc aacctattgt gcaacgagtt ttgagaagcg ggtaagctgt atgttttgtg 660 

gttttgtttc ataaattcat ctacaggaag accaatattg actgaatgaa gctttcattt 720 

aaagagctaa aatatgcttt gtgtttttat atgtggatac tactttaaac ctaacgacta 780 

ttcattgtat catagcttgt gatgtattct gctcatggct tttaaggtaa attgtgccat 840 

gatccactgc cattctaatt gctttaacaa gtcattacca cactactgtt acatcttaat 900 

tatgcataca gacaggtaga cttgttttac atatgtgaac taactagttg tcaaagcaaa 960 

tgcagattgt attctgcaag taaagtcttt ttctctctga aatttctagg gatgttcttt 1020 

aagtgaaatt catattaaaa ctgaagattt tagttacaag aactgagtgc agattaagtc 1080 

tttgtgattc aacatagtca agatacaact gtggatattt catggaagta tgcaataaaa 1140 

tgtctctacc tggaaaaatc tatcaagcag cgtcacagta ctgaatttga aaccagaaat 1200 

actgggtttt tatataaatg cttcatagat ttgttttatg ataaagggca cataactctc 1260 

ctaaacctca caccacctct tgaataggta taataagtcc acatcaatgc tgatgcctta 1320 

gctattatta aactcttaca gtatgatgta aagtgaaagt acaatgtaag atcattccta 1380 

ggccaacttt gaccagtttt atacagaaac atgtgccaac ttttctgttt gcaaggataa 1440 

tatcaaagca aacaccagaa agttatatct ttgatgcatt ttttcaaaat catacacata 1500 

atacacaaac caaagacaaa tgatgaatat tacgtcagaa aatataaagt cttccccttt 1560 

cttcttttgc caagaaagtc caatattttc accattttta tgcacacaat caactttatt 1620 

taagctggaa gttaatgtct cattgttttc attgttctaa ataaacacct tttcccttga 1680 

gtattgctct aaaaaaaaaa aaaaaaaaaa a 1711 

<210> 181 

<211> 2058 

<212> DNA 

<213> Homo sapiens 

<400> 181 

ggcacgagct caaagagtaa gaatccaagt gtgtgacatt acatagcttt gcatctatgg 60 

aaacctaaat cataattgtt tccactgccc aattatgttc cttttcataa catttactat 120 

tctggctata tttatcatag aacctaggaa ccttagagtt gacctgaatc taattaaatt 180 

tcagacctcc tggccaaaga ccctagtgga agagcaaaac taaatcaaca tattaccaat 240 

ctcaagtatt tctctgagga cccagaccac tgactttttg ttgtcatttt caggttgatc 300 

ctataactgt atgttctaca atatctgtgc tccaccagct cagtgaggaa tcaacggaat 360 

atcaaaagta aatattggtc accatatacc ttttggtact agtctacgaa ataattggct 420 

gaggaactgt ttcatattaa agaaaagcta aaagcaatgt gtgatcttag attagaccta 480 

tgattggaat gtatgtatat tttatataca aaatattgag gaaattgaca aaatttaaat 540 

acagaatatg gattagataa taggaatgta tcaaggtcaa tatttaaaaa gataatttca 600 

acttttattt tattcagtgg gtacatgtgc agactttgtt ttacatagta cccaacagtt 660 

tttcaacgct tatcccccac cctctagtaa tctgcagtgt ctattattgt catcttcgtg 720 

gctattgtac atgggattcc atacttgatt ttgctctcaa catgaacatt attggtgtag 780 

agaaatgcca ctaagttttg tacgttgctt ttgtatcctg aaactttgct aaagttattt 840 

atcatttcca ggagcctttc gttggagtct ttagggtttt ctagttagag agtcatcagt 900 

gaagagagat aatttgactt cttcttttcc tttttgggtg ccttttactt ctttctcttc 960 

cctgattgct ctggctaggg tttctaacat caaatttttt gatgttgatc attatactgt 1020 

agttatgcag gggaatgtct ttgttctttc taaagaacat tcaatgttaa acattgaaat 1080 

gtttaacaat gaaagatcat tatatgtaat ttactctcaa atggtcaagg gagaggagag 1140 

agaaagagac agagagaaag aataagaaaa gagagtgagt aaatggggca aaatgcaaac 1200 

aatttgtcaa tctatgtaaa atgtttacaa gagttctgtg ttacattttt gcaactcttc 1260 

tgtatgttta aaataatgtt aaaataacaa ttttcccaaa tgtcaaatgt tgccacatac 1320 

aagcatttat gagcatggaa aatgtggctt ctgaatgata ggatacaaaa tctggtgatg 1380 

agcgaataca ctgaatatat caagattgag ttgccatctt agaaggagaa tattagctat 1440 

aaatcacatg atgaatacat caatattaaa tagactagaa aaccaataat tataaggtca 1500 

caagagtgca ataaacatga attatctcca ttgcatgtat tttcatctct acttgccagt 1560 

tttatgagat tcagtcccca atatattttc aattaattca acatagaaga ttcacttcta 1620 

gtatgttttc aaattgtttc aaaccctgac catctttttg attgctctac cttccaaaag 1680 

aaaagaaggg aacactaatt ttctttcctg atttacttca ttgttttctt ctgttagatt 1740 

aactttacct ataaaagatt gtctcttgac tttatatata tatatatgtg tgtgtgtgtg 1800 
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tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tatttgaaga ggacatgtgc ctccataaaa 1860 

ggaaataaaa tgagagaata cattattgat tttgtgaaat caaaatattt gaattatggt 1920 

ttctcaatat tcaaaaactc ttgcagtttc tgtacttatt tcttctgatg catagagttt 1980 

cggggactac atatgtttca caaccaaaga tatccacttg aaataaaaac attataaagt 2040 

taaaaaaaaa aaaaaaaa 2058 

<210> 182 

<211> 2398 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1874) . . <1874) 
<223> n equals a,t,g, or c 

<400> 182 

attgcttagt ttgatgtgtc ttgctttaaa tccatttatt tcaacaagct taaagagatt 60 

tttttttaat ggagatgatt taattttaac aatctgtgat tttctctgaa tcgaacttgt 120 

gttttggcac ctttcaatct gtggtaacaa atgacaagaa gggtgcaatt cttccttccc 180 

ttgtgcaggg attttgcctc cccctttctc ccagatgaaa gatatttggg tctctagaat 240 

aactgtggta cagttagctc cagagtgttt tctttctgga ggcagtttag acaacagcct 300 

caagtagtgc ttttgttaaa aatatacatg tttttaaaag tgcttgtatt tctaatattc 360 

ttttctcctt tctcttctag tctgttctct ggggaggcag taaggggccg tggagctggc 420 

ctcggcctcg gcatcgggag aggctggact tcctgtctct ctgtgctgaa tggctgcgat 480 

ggcgcccgct ctcactgacg cagcagctga agcacaccat atccggttca aactggctcc 540 

cccatcctct accttgtccc ctgggcagtg ccgaaaataa cggcaacgcc aacatcctta 600 

ttgctgccaa cggaaccaaa agaaaagcca ttgctgcaga ggatcccagc ctagatttcc 660 

gaaataatcc taccaaggaa gacttgggaa agctgcaacc actggtggca tcttatctct 720 

gctctgatgt aacatctgtt ccctcaaagg agtctttgaa gttgcaaggg gtcttcagca 780 

agcagacagt ccttaaatct catcctctct tatctcagtc ctatgaactc cgagctgagc 840 

tgttggggag acagccagtt ttggagtttt cyttagaaaa tcttagaacc atgaatacga 900 

gtggtcagac agctctgcca caagcacctg taaatgggtt ggctaagaaa ttgactaaaa 960 

gttcaacaca ttctgatcat gacaattcca cttccctcaa tgggggaaaa cgggctctca 1020 

cttcatctgc tcttcatggg ggtgaaatgg gaggatctga atctggggac ttgaaggggg 1080 

gtatgmccaa ttgcactctt ccacatagaa gccttgatgt agaacacaca attttgtata 1140 

gcaataatag cactgcaaac aaatcytctg tcaattccat ggaacagccg gcacttcaag 1200 

gaagcagtag attatcacct ggtacagact ccagctctaa cttggggggt gtcaaattgg 1260 

agggtaaaaa gtctcccctg tcttccattc ttttcagtgc tttagattct gacacaagga 1320 

taacagcttt actgcggcga caggctgaca ytgagagccg tgcccgcaga ttacaaaagc 1380 

gcttacaggt tgtgcaagcc aagcaggttg agaggcatat acaacatcag ctgggtggat 1440 

ttttggagaa gactttgagc aaactgccaa acttggaatc sttgagacca cggagccagt 1500 

tgatgctgac tcgaaaggct gaagctgcct tgagaaaagc tgccagtgag accaccactt 1560 

cagagggact tagcaacttt ctgaaaagca attcaatttc agaagaattg gagagattta 1620 

cagctagtgg catagccaac ttgaggtgca gtgaacaggc atttgattca gatgtcactg 1680 

acagtagttc aggaggggag tctgatattg aagaggaaga actgaccaga gctgatcccg 1740 

agcagcgtca tgtacccctg tgagtagacc tcatgcatga tagcattctt gagaaatgtt 1800 

ggcacaagga agaatgaatg aatcgccatt atggagagaa tgtgttsttt gtacataggt 1860 

gtytagttcy gttngttttt tccctgatgt tgggtagatg agtgcatata catgctagtg 1920 

aagaagggga agatactttg ctgtagggtt gtattgttgt agtctaaatg gtggtaattt 1980 

ccttttgaag tctaagaaaa ataactagga gacatcttat gtgtaaaatt gtactagtac 2040 

ctctttaaga gtgaatttag atttcttttg aaactatata taggacatga taagttaatg 2100 

gcctgattgt tgagattttg ttgtttccag taagcaggga caaatgctga gttgacctag 2160 

ttacctttgt aggaaattac agttgctttt gattgaactt tcagcagaga gcacacccag 2220 

tcttcaattt taacacttga gattttctta cattttaagg actgacaatt agaaaatgct 2280 

tcagaatatt taatacatcg cctccaagca cagtctagtt tcacaacctg actctcttcc 2340 

tattaaaaaa aaaaaaaaaa aactcgrggg ggggcccgta cccaatcgcc cctcatga 2398 

<210> 183 
<211> 1505 
<212> DNA 
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<213> Homo sapiens 



<400> 183 

gcaccatggc 

ctgtggccct 

ctccaggtgt 

ttccgaggat 

gccaccatgg 

aagtagtcac 

aattggatgt 

attttttctc 

ccagtggcta 

tccccactcc 

gcctttgaaa 

aggaggtttt 

caacaatatg 

cctgcggtga 

ctccaaatga 

aacattttta 

gtttctgatg 

cactcaaata 

tgtcacctty 

tgtttagctc 

attattatat 

aggcaggcag 

ctgtctgtac 

cagctacttg 

ctgcactcca 

tcgag 



cacgcccctg 
gatgggtccc 
ggcggggcgc 
gtacagcgct 
cacggaccgt 
cttcacaaag 
tagctgctgt 
agtgacatca 
taaggccatt 
tgagaaagag 
gcaagtttat 
taaaacagta 
caacttatgg 
atggagctga 
tagctgaaat 
agagcatttt 
tgcagctata 
gtgracattg 
tggaatcccc 
ccactcataa 
aggccaggtg 
atcacctgag 
taaaaataca 
ggaggcttga 
gtctgggcca 



gaggatgttg 
tgcagagcct 
cctgctcctg 
ggagctcggg 
ttattgtaca 
catgcactga 
tgattttgcc 
aaatgaccca 
gagcttcagg 
caaatatctc 
tatttttttc 
cataaaaatt 
gtcacctccc 
ctaggcactg 
caagctgcag 
cttttttaaa 
ttgcagtgat 
tacccaatag 
agtgtctgtg 
gtgagaacat 
cggtggctga 
gtcaggagtt 
agaattaacc 
acctgggagg 
cagagtgaga 



gcaagcaggt 
cacgctgctt 
gcagactaca 
gccggcacgg 
ggtaatgagg 
ctgtataaaa 
atcctggtcc 
gcaatacgca 
aggtgcctag 
caaaaacagg 
ctgggtatag 
actcataatt 
gtgtgccact 
ccctcagggc 
cagcactgta 
tatgtatata 
gacatccgfcc 
ggaagcttta 
tttccactca 
tttaagagca 
catctgtaat 
tgagaacagc 
agatgtggta 
cagaggttgc 
ctctgtctca 



gggtaggtct 

gtcgctcctt 
tcctgttccg 
ggctcactag 
tgacatctca 
aaagaggcag 
cctggccctc 
ctcagcagca 
cgcccctgct 
aggaatatac 
aagccttgcc 
ttacaatccc 
catttctaga 
gcttacgttg 
ttctgctgaa 
tttagggggt 
tgggctttta 
atcccccacc 
gtatgtccat 
ttttctcatg 
cccagccctt 
caggccaaca 
gcggcgggca 
agtgagctga 
aaaaaaaaaa 



tgtctgcttc 60 

gtcctcttcc 120 

acaggacctc 180 

catcatcgca 240 

ggctgcaggg 300 

aggcaatgga 360 

tccactctcc 420 

gcagcgtcac 480 

ggtacctctc 540 

ccttttagaa 600 

cattctttgt 660 

tagattgaat 720 

tgtaggaggc 780 

taagaatctc 840 

aatgttgaaa 900 

acaagtgcgg 960 

gtggaccttc 1020 

cctyccaccg 1080 

gtttacccat 1140 

ccattaaaaa 1200 

tgggaggctg 1260 

tggtgaaacc 1320 

cctgtaattc 1380 

gattgcacca 1440 

aaaaaaaaac 1500 

1505 



<210> 184 
<211> 868 
<212> DMA 
<213> Homo sapiens 



<400> 184 

gactgactat 

cacgcgtccg. 

agatggatgg 

tgttgagtgg 

cctggcagga 

ctctccaaga 

scaggaagct 

taaacggaag 

ctgaggaccc 

tgtccgtggg 

gaggtcagcc 

tgccctttgt 

tgtttttgtt 

aatttgtatt 

aaaaaaaaaa 



agggaaagct 
ctgaatttag 
tggcccaggt 
aggcaacttt 
agcaggactc 
caggagtggg 
ggggctgccc 
cagggytctc 
caccccccta 
tcctcagccc 
agctcctagc 
gatttcagtc 
tgtttggttt 
tatttatttt 



ggtacgcctg 
gagacttttt 
gcattttcca 
gctctgtgtg 
ccatcctcac 
cagccctggg 
tccagaaagc 
cagttccact 
ccccttcccc 
agggtgagct 
agagaagagc 
ttggtagact 
ttgttttgag 
aacattttac 



caggtaccgg 
acccaggggc 
gggcctgggt 
aacctcgccc 
cctgacttag 
cagtcttctg 
cggggtaggr 

tggcgccccc 

agcagccttt 
gcagtcaggc 
cagccagacc 
tgtatttgga 

ggggtggggg 

taaataaagc 



tccggaattc 
aaaaggctct 
tctccagatc 
ctgtccctct 
actgtcctct 
gccccttgct 
actctgaaaa 
tcccacaagg 
ggaccctcac 
gggatgggac 
ccaaccctgt 
gttttgtgct 
gggatacaga 
caaataaagc 



aaaaaaaagg gcggccgc 



cgggtcgacc 60 

tagggtaatg 120 

ccgtggcttc 180 

gccgggcacc 240 

gag t cage tc 300 

aaagtgaggg 360 

atacctcctc 420 

cccttcctcc 480 

ctctctccgg 540 

gggcaggeca 600 

ctcttgtcca 660 

tcaaagtttt 720 

gcagctgatc 780 

ctcaaaaaaa 840 

868 



<210> 185 

<211> 1502 

<212> DNA 

<213> Homo sapiens 



<400> 185 

gctgattacc tttatgttgg tttctcttat tatttgtctc ttgetagate tgctaaacca 
acccagcttg ctcagagatc tcatattgaa gcaacataca ggcaatccac atctttcttt 
ccctttgaag tatagtcatt ggatgggatg agggacaggg cctgttgggt tcacagggcc 



60 
120 
180 
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ttgcactgca tgggcacata cttaaaagct cttgtgcatg gaatccctgt ctgttagcca 240 

caggcctctt tagctctata cattcaaaat aactactgta gtagaaaata gataagcttc 300 

agctgagttg gcttttgata gtggaaaaaa aacaaaattt gactttttat ggccaaaatt 360 

ccttgttgac agctgtgatg ttctaatatg atttgggaat atgtcagtct acagaacctg 420 

catcctgtaa aaacaccttt ggggtagacg ataaaagtca tttttaaggc aaatacttac 480 

catgtgactt tttattacca aatgcatcag tagtggagct ggtatgttgt ttcataggat 540 

ggaaacatta gaagtccaga gaaaaataaa ttttaaaaaa aggtggaaaa gttacggcaa 600 

acctgagatt tcagcataaa atctttagta tgaagtgaga gaaagaagag ggaggctggt 660 

tctgttgctc gtatcaatag gttatctgtg tccctcatct tggtgttaca gtgttatttc 720 

tgtcagtatt atgaatatgt ggttgaccca tcctgtcaaa tgtaccaaca ttttcgaaag 780 

aattcattca aatctcttat gccaacagaa aagttccttc ttgtttaata tctctttacc 840 

tcagtcctac attttgattc tctggaggag attttagctt gtcttaaaaa gccaaatttg 900 

gagtcatcaa gcctgctgaa cctgatgggg cagctttttg aacagctttc tggaagtaag 960 

aacttcagtt gaaaagccct ttgatcgctt cagcccggga catgcccttc agatggctta 1020 

ttctcagtaa agctttatgt agactgtgac actgtatatg tgtgactcgt acaactttga 1080 

cgtgtttctg aagtggttta atcgtatttg ttattagctt ctttgtggaa atgcaatttt 1140 

tatactaaaa acattgctta tttgcaatgc aatatgttat aaatttgttg tttatattac 1200 

tggtattagt cttagcctaa tgaacctaat tatttttctt tctgtattct ttgcttcctc 1260 

aaatagcatc tgcagcaatt ggaatgagaa atccagatat gtgtttcaag tagtacattg 1320 

cctgaatcac aaatcacttg atcacagtat tgtatataat ccctgatcct atttgtttca 1380 

ttttattgta aattcccatt tgcatcaaaa cctaatgata gtgattggta agtaaaaaca 1440 

aatggtgtat tgcttttcat aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaactcg 1500 

ag 1502 

<210> 186 
<211> 3308 
<212> DNA 

<213> Homo sapiens 
<400> 186 

ccacgcgtcc ggcccagggc tgtctgtctc caaagcccaa ccataactca catccccatt 60 

ccagctcctc tgggtgagtc tgttccccct cagcctcact ttccttatcc tgtcaaatga 120 

aggatttgga atgacttaag ttattcaagc aacaaacact tactgaattg tcttgccact 180 

tccagggtga cattatggag ttctgtgatt ctgcaagagg ccagagggga caaggtcaag 240 

tgggtgttca cctggcccct catcttcctc ctgtgcgtca ccattcccaa ctgcagcaag 300 

ccccgctggg agaagttctt catggtcacc ttcatcaccg ccacgctgtg gatcgctgtg 360 

ttctcctaca tcatggtgtg gctggtgact attatcggat acacacttgg gatcccggat 420 

gtcatcatgg gcattacttt cctggcagca ggacaagtgt ccagactgca tggccagcct 480 

aattgtggcg agacaaggcc ttggggacat ggcagtctcc aacaccatag aagcaacgtg 540 

tttgacatcc tggtaggact tggtgtaccg tggggcctgc agaccatggt tgttaattat 600 

ggatcaacag tgaagatcaa cagccggggg ctggtctatt ccgtggtcct gttgctgggc 660 

tctgtcgctc tcaccgtcct cggcatccac ctaaacaagt ggcgactgga ccggaagctg 720 

ggtgtctacg tgctggttct ctacgccatc ttcttgtgct tctccataat gatagagttt 780 

aacgtcttta ccttcgtcaa cttgccgatg tgccgggaag acgattagcg ctgagtcgcg 840 

gcccctggga gctgatctgg acaccctgtg acactggcgt cctcctctcc cctccttccc 900 

ccaccacagg tctctcctgc ataggcagcc actgtccgtt ctttcacaca ctggaaggaa 960 

gagccatcgt ggtctttgtc tggccacagc caagctgctg ggcatcctcc tcctccttgg 1020 

agttccaccc ctgcaaggct ggatttgggg gccattatct gagcagcttc aaagacccct 1080 

gagctgccaa ccacggagat gtgccaagca tctcatctct cctgcacact ttagtcagaa 1140 

ggacttctgc atgcagtttg tctttctgtt ctgcaggcag cttcagaatt gaggtcattt 1200 

gtgagcacaa gatctcatag ggcaggtgca aaataggaat gttgttctca agtgtcacct 1260 

ccagcccaga ggtggttcct taggcagcat gtgctcctgg gagcctctga cttttgctgg 1320 

aagcacccac agtttggaag gggcaagacc tcaacctgtt ggggtttagg gcccatgatg 1380 

gcagacattc tacccctttt cctggaaaaa ctggaagaat gaaaataatt tttttctgtg 1440 

gaagagagaa aatgagtgaa tattcttctc acttttattg atgcattcag agaataagca 1500 

atgaaatatt aaaaaatgaa acatcatata ggtcatcata cttgaaaatt atcattccat 1560 

atgaaaggat catgatacac accaaaaaag taatgatcgt aaagacacaa atcctctgta 1620 

tgccatcttg cattggcact gaggtgtttg gtttggaata gggaaaaaga gacaggatct 1680 

cgctgtgttc cccaggtagg tcttgaactc ctggcctcaa gtgatcctcc tgccttgacc 1740 

tcccaaagtg ctggattaca agcgtgagcc cctgcacccg gcccaagcag ttgcttcttt 1800 

ttttctcttt tttttttttt ttgagatgga gcctcactct gttgcccagg ctggagtgca 1860 
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gtggcgcgat ctccactcac tgcaagctcc gcctcccggg ttcatgccat tctcctgcct 1920 

cagcctcccg agtagctggg actacaggcg cctgccacca cacccagcta attttttgta 1980 

tttttggtac agacagggtt tcaccgtgtt agccaggatg gtcttgatct ctgatctcgg 2040 

atccgccacc ccggcctcca aagtgctgga ttacaagcgt gagccaccgg gccccgccaa 2100 

gcagttgctt cttatgcaac atgttgggtg ggacttgtcc acgggccagg ccaataaaat 2160 

tcttaatcct gcagagaggc agtaccctca tcaccccatc actggaaaac aaatgtttaa 2220 

gctatcaaga gagggaatgt gcagcttggt tctagatgca tggtttggag gatctacctt 2280 

tggcctaaag ggaatgtccc aaacaacaga gccttctttg ctgtcactcc agaattctct 2340 

acacagaatt tcccaagtcc attcaggaca gacgcgcagt cctctttcaa tggaagaaga 2400 

gaggactttt cccctcctga aaaatgactg gagtgtgaac aaggcagctc tgtttttcta 2460 

aataagttgt tcttgtgagt tttttctggc cactgggcat ctctgccctc acttttcatc 2520 

cctgccctct aagctgcaga ccccatgacc acactgtctg cttccttgag cttcccgcac 2580 

gaggcttgca cctgggggac ctggagaccc tgcggacaga actgtggctg agccactgtg 2640 

gccaactctt ggggagctcc acagtggggg ttgctggtct gtgaggctga gtctccattt 2700 

cagagcacac actccctggc agggcgcctc cgcctgtgtc tcctgcccag cagccgccag 2760 

cagggaatag ttgctggtgt ctgagcacaa agagagcttt gattacctag agaggaaaaa 2820 

ggctgtcagc cagatgcagc caggcccagg ggtagataca ggagttgcta aggaaggggc 2880 

cgagccagga gaggccaggc agatccacaa agcccaaggg gatgcaggct gggtgtggtt 2940 

tctgagggaa cctaccaaat agcaggtaga tggaatcaga ggactcttgt gtcctgaaag 3000 

aacctcctta aaaacaacta aaaccaagaa cttctggggc tgttcacaca ttgttcaagt 3060 

caccccaaga tcgttctggc acgctgagct gaacaccacc atctttgttc attctctctc 3120 

taatgggcaa agcaggatca tcgagttgaa aagttgtaaa taatgaggat atttatcccg 3180 

ctatttattt tttcaataac tgtgacctcc tgcactgtga atgctctgtg acatgagatt 3240 

cttagtttaa taaaactgtc attaaatttg aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3300 

aaaaaaaa 3308 

<210> 187 
<211> 1769 
<212> DNA 

<213> Homo sapiens 
<400> 187 

agaaaaattg cagggaccca ccccagactt gtgagtgcga gtgaagcagg agcagccctg 60 

gccatcactg tttctttgac gtgtacatcc catcctgaga tgcagctggg ctgggagccg 120 

ccacctgggt ggatctgatt cctggatttc cccatcctgg ggasaggtga cccatcctgt 180 

tctcctcctt aggtccatgt gaaatctgar gtccttgctg tcaagttgtc acaagaaata 240 

aactacgcaa agagcctcta ctatgaacag cagcttatgt taagactcag cgaaaaccga 300 

gagcagctgg agctggactc ctgaagcccc gctgctgaga tgggcgctcc. cgacacagcg 360 

cagacccacc aggaggaaag aggcccagct ctcagctgac gatggaggca gaaccggagt 420 

cgggtttggg gaagttgtca aggaatgagg gaaagtaaat cctcatgagg aaaagtacaa 480 

atggaaatcg tattaatttg tgaggcaggg agttatttta gattatggga aataattttt 540 

aaaggtattg gttaaataac gtttaaaaac atgtactgag atgaatctaa tttttagatt 600 

gccctgtatt ttgttaacat gtatatatgt acaacagtgt gtttgtaaat atataggaac 660 

gtttctgaac agggtctgtg ctatgtgtaa aggtttgtta actgtaaagt aatataaagt 720 

tatattggat cttctattgc actaattcta gatgtctaat tcaggatact gtctatagaa 780 

aggcattctt aaaagttaaa gaatgttacg tcttagtttt ggagactaaa gtattcccag 840 

taaagtgggt tgaggtgagg gctgtggtcc tgaaagggac gcctttgaca tcgtggctgt 900 

ccagttgggc tgtgagctgt ggcacccagg actggcgctg gcccttcaga aggatctagg 960 

agaggggctt gggagcccac ttttaatttc tcacccccat tttacaaaga gtgcttagat 1020 

tcttacaaat tatgatgtaa gttatccatt tggctttttc ctaactagtc ttaccaaact 1080 

tagggggaaa cctgtgctcc attaccacat gggtgcaagt cagcattgta agttttctca 1140 

ggttattatt attagagagg ttggaaacat tggtaaactc tgttgattga gaaggaaaaa 1200 

aaaagtccca ttgaactgtt gcaacaaatc agaaatccac ataaaagtgc tctcctgcct 1260 

gggcagcaac aaccaagaac aaagccccgg gactgttttc tttttaataa agccacaggc 1320 

aggcatcgta gctccacagc ccgaggggac acaggatgga aaccccagga tgagaaggga 1380 

gcagggagag ttccagaaag ggggatgaaa taggagtatt aaaaagctgc gttggtaagt 1440 

ttttcatgga accaagattt gacaaaggca tctcttatcc ttggttttaa attcctgctg 1500 

ggagcaaggc ctggtatgag cgccctgggt cttgtttttg gtgtttcgct tttctgtaag 1560 

gattaagcag atagggagaa gggaaaaggg gcctcacttt agaatgaatg agtcaccttg 1620 

tgatttttaa atttttattt taataaagct aatcaatttc taaaaaaaaa aaaaaaaaaa 1680 

aaaaaaaggg cggccgctct agaggatccc tcgaggggcc caagcttacc gtgcatgcga 1740 



109 



WO 02/076488 PCT/USO 2/082 76 



cggtatagct ctctcctata gtgagccta 1769 

<210> 188 
<211> 1677 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (537) . . (537) 
<223> n equals a,t,g, or c 

<400> 188 

ccccccgggt cgacccacgc gtccggactt tttttatttt agctattcca gggggtttga 60 

agtagtactg cattgtgatt ttaatttgca tttccctgat gattaatagt gataagcata 120 

ttttcatata atatttacca tttatttatc ttcttgtttg aaatatctgt tcaagtctta 180 

tgcccatttt aatgggacag tttttctgtt tattattgat tcatattatt gacttatagt 240 

aatattttat gaattgatct ctttattatt atgaaatgyt tctttttatt tgtggtaata 300 

ctcatcatca tgaaatctaa tttgtctgat attattatag ccacttatac ttactgtata 360 

cctgattatt ttttccatac ctttatcttc aatttatctg tatatttgaa ttcaaagttc 420 

atctcttgag cctgaaagta ataagcacct tgaacccgga tctttatatc taaatatgat 480 

tctccagtaa aatttatcag ggttctctgg acaagtggct gattgattgt agagcangga 540 

taagaagagt atatgctgaa cctgaatcat ttttgtggtg ccaaaagtaa ggaagtacac 600 

aaaaaaattg agaagatatc awgaaatgca caaaaactaa cctgaagtga ctctyaatga 660 

ccatatctkg gacaatttga gtaaaaaaat aagtaacaat aatggatgat atcttataag 720 

ataatataaa acaaatatca atgagtctat gatgatatag aattagaatg tataaatggg 780 

aggaagggga aaattctttg cttaagcaga ataataatta attaatataa aaagaataat 840 

ggaaatggta gaaatagaaa gtcaacattt ggcaagcacc acagtaataa ttgttgcagg 900 

caagaatcat caatggatgc taaagtttgt gagaaaaagt ttgatgagaa atgggctatt 960 

tacataacct caaaggcttt tcccacaaga tactgttaat tacaaagggg aaaatattga 1020 

gaaacatgga* agacagcacc ttaaccaaat gattgaagtt aataccatta gtaatgcaac 1080 

aaatcagtat cgtgtgcctc ctgatatgat gcactgagaa gggcacaagc atcaccttta 1140 

tggtattctt gcaaaaaatg cataatctaa attcaaccat gaagaaatat tagaaggatg 1200 

ttctatgact agtcagtact gctcaaaaat gtcaagatta tgaaagacaa agaaagacta 1260 

aggtactttc cagatttaaa gaaattaaat agacaggact actaaatgca atatatgagt 1320 

ctgaattgga ccctggacta aaattaggca gacagttggt gaattttgag tcaagtctgt 1380 

agagtagtta atagtatttc tggttttgat catcataata tggttattta agatgttaac 1440 

atttggtgga tctgggtgaa aagtatatga ggattctctc gcagtatttt tgcaattttt 1500 

ttaaatctga aattctttta aaatgagaag ttggctgggc acagtggctc acacctgtaa 1560 

tcccagcact ttgaaacacc aaggcaggag actcgcttga gcccaggagt ttgagaccat 1620 

cctgcgtaag atggcaagac tccatctctt taaaaaaaaa aaaaaaaggg cggccgc 1677 

<210> 189 
<211> 2709 
<212> DNA 

<213> Homo sapiens 
<400> 189 

ggcacgagat ttcctacagg tgaaacgcca tcattaggat tcactgtaac gttagtgcta 60 

ttaaactcac tagcattttt attaatggcc gttatctaca ctaagctata ctgcaacttg 120 

gaaaaagagg acctctcaga aaactcacaa tctagcatga ttaagcatgt cgcttggcta 180 

atcttcacca attgcatctt tttctgccct gtggcgtttt tttcatttgc accattgatc 240 

actgcaatct ctatcagccc cgaaataatg aagtctgtta ctctgatatt ttttccatgc 300 

ctgcttgcct gaatccagtc ctgtatgttt tcttcaaccc aaagtttaaa gaagactgga 360 

agttactgaa gcgacgtgtt accaagaaaa gtggatcagt ttcagtttcc atcagtagcc 420 

aaggtggttg tctggaacag gatttctact acgactgtgg catgtactca catttgcagg 480 

gcaacctgac tgtttgcgac tgctgcgaat cgtttctttt aacaaagcca gtatcatgca 540 

aacacttgat aaaatcacac agctgtcctg cattggcagt ggcttcttgc caaagacctg 600 

agggctactg gtccgactgt ggcacacatt cggcccactc tgattatgca gatgaagaag 660 

attcctttgt ctcagacagt tctgaccagg tgcaggcctg tggacgagcc tgcttctacc 720 

agagtagagg attccctttg gtgcgctatg cttacaatct accaagagtt aaagactgaa 780 
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ctactgtgtg tgtaaccgtt tcccccgtca accaaaatca gtgtttatag agtgaaccct 840 

attctcatct ttcatctggg aagcacttct gtaatcactg cctggtgtca cttagaagaa 900 

ggagaggtgg cagtttattt ctcaaaccag tcattttcaa agaacaggtg cctaaattat 960 

aaattggtga aaaatgcaat gtccaagcaa tgtatgatct gtttgaaaca aatatatgac 1020 

ttgaaaagga tcttaggtgt agtagagcaa tataatgtta gttttttctg atccataaga 1080 

agcaaattta tacctatttg tgtattaagc acaagataaa gaacagctgt taatattttt 1140 

taaaaattct atttttaaaa tgtgattttc tataactgaa gaaaaatatc ttgctaattt 1200 

tacctaatgt ttcatccttt aatctcagga caacttactg cagggccaaa aaagggactg 1260 

tcccagctag acctgtgaga gtatacatag gcattacttt attatgtttt cacttgccat 1320 

ccttgacata agagaactat aaattttgtt taagcaattt ataaatctaa aacctgaaga 1380 

tgtttttaaa acaatattaa cage tgt tag gttaaaaaaa tagctggaca tttgttttca 1440 

gtcattatac attgetttgg tccaatcagt aattttttct taagtgtttt gtgattacac 1500 

tactagaaaa aaagtaaaag getaattget gtgtgggttt agtcgatttg gctaaactac 1560 

taactaatgt gggggtttaa tagtatctga gggatttggt ggcttcatgt aatgttctca 1620 

ttaatgaata cttcctaata tcgttggctc tactaatatt ttccaatttg ctgggatgtc 1680 

acctagcaat agcttggatt atatagaaag taaactgtgg tcaatacttg catttaatta 1740 

gaegaaaegg ggagtaatta tgacacgaag tacttaatgt ttatttctta gtgagctgga 18 00 

ttatcttgaa cctgtgctat taaatggaaa tttccataca tcttccccat actatttttt 1860 

ataaaagagc ctattcaata gctcagaggt tgaactctgg ttaaacaaga taatatgtta 1920 

ttaataaaaa tagaagaaga aagaataaag cttagtcctg tgtcttttaa aaattaaaaa 1980 

ttttacttga ttccccatct atgggcttta gacctattac tgggtggagt cttaaagtta 2040 

taattgttca atatgttttt tgaacagtgt gctaaatcaa tagcaaaccc actgecatat 2100 

tagttattct gaatatacta aaaaaatcca getagattge agtttaataa ttaaactgta 2160 

catactgtgc atataatgaa tttttatctt atgtaaatta tttttagaac acaagttggg 2220 

aaatgtggct tctgttcatt tegtttaatt aaagctacct cctaaactat agtggctgcc 2280 

agtagcagac tgttaaattg tggtttatat actttttgea ttgtaaatag tctttgttgt 2340 

acattgtcag tgtaataaaa acagaatctt tgtatatcaa aatcatgtag tttgtataaa 2400 

atgtgggaag gatttattta cagtgtgttg taattttgta aggecaacta tttacaagtt 2460 

ttaaaaattg ctatcatgta tatttacaca tctgataaat attaaatcat aacttggtaa 2520 

gaaactccta attaaaaggt tttttccaaa attcaggtta ttgaaaactt ttcafctttat 2580 

tcatttaaaa actagaataa cagatatata aaagtgttaa tctttgtgct atatggtatg 2640 

aaatacaata ttgtactcag tgttttgaat tattaaagtt tctagaaagc aaaaaaaaaa 2700 

aaaaaaaaa 2709 

<210> 190 
<211> 813 
<212> DNA 

<213> Homo sapiens 
<400> 190 

ggcacgagat cattttctgt cccctcctat cttaggctga ccggttccct gatgtgttac 60 

ctgcttctgc tactgatcca aactgeagaa cttctcattc atccccaagg cctccaggca 120 

gtatccaatg gggaatcagc tctaaaagga accagaccaa cgttttccag ccccttcatt 180 

ctggtgactg aggggaggaa agaatgggag ggggtattct tgtctagtgg atggaaagga 240 

aacacactgt caaattacta tatctccttg gttttctatt acagtagaat tctccagcca 300 

tatttttatt gtctatgggg gaagttggag atggtgacct tgattagaag tgtctggagg 360 

gggataaatg gaggggataa gatttcagtt ggttttggaa aatgttaaag tcttaaaata 420 

atgcgtccca tctgaagaat tttttctaaa accagagttt ataaaaatat cactgataca 480 

gcctgccccc tcatttccct gecacaggag atgtcttgga ctagagacac ttgtttaata 540 

atagcttgtc tctgatattc ccagtagctt ccctctgtgt gaggaaagga tagaaatgtt 600 

caggacatca tcatacaggc tcctcatcta caaagttcca gtagcagtga cgcctacacg 660 

gaagacttgg aactgeaaac aggctggggt cacctcagtg acatctgacg ctgtccaacc 720 

agaagttcga tttttgttct gggggtgaag gaggaaacag actgtactaa aggactaaaa 780 

taatttgtct atactaaaaa aaaaaaaaaa aaa 813 

<210> 191 
<211> 2288 
<212> DNA 

<213> Homo sapiens 
<400> 191 
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ccacgcgtcc gggggctgca aggacctgag ctcagcttcc gccccagcca gggaagcggc 60 

aggggaaagc accggctcca ggccagcgtg ggccgctctc tcgctcggtg cccgccgcca 120 

tgtgggccgt cctgaggtta gccctgcggc cgtgtgcccg cgcctctccc gccgggccgc 180 

gcgcctatca cggggactcg gtggcctcgc tgggcaccca gccggacttg ggctctgccc 240 

tctaccagga gaactacaag cagatgaaag cactagtaaa tcagctccat gaacgagtgg 300 

agcatataaa actaggaggt ggtgagaaag cccgagcact tcacatatca agaggaaaac 360 

tattgcccag agaaagaatt gacaatctca tagacccagg gtctccattt ctggaattat 420 

cccagtttgc aggttaccag ttatatgaca atgaggaggt gccaggaggt ggcattatta 480 

caggcattgg aagagtatca ggagtagaat gcatgattat tgccaatgat gccaccgtca 540 

• aaggaggtgc ctactaccca gtgactgtga aaaaacaatt acgggcccaa gaaattgcca 600 

tgcaaacagg ctcccctgca tctacttagt tgattcggga ggagcatact tacctcgaca 660 

agcagatgtg tttccagatc gagaccactt tggccgtaca ttctataatc aggcaattat 720 

gtcttctaaa aatattgcac agatcgcagt ggtcatgggc tcctgcaccg caggaggagc 780 

ctatgtgcct gccatggctg atgaaaacat cattgtacgc aagcagggta ccattttctt 840 

ggcaggaccc cccttggtta aagcggcaac tggggaagaa gtatctgctg aggatcttgg 900 

aggtgctgat cttcattgca gaaagtctgg agtaagtgac cactgggctt tggatgatca 960 

tcatgccctt cacttaacta ggaaggttgt gaggaatcta aattatcaga agaaattgga 1020 

tgtcaccatt gaaccttctg aagagccttt atttcctgct gatgaattgt atggaatagt 1080 

tggtgctaac cttaagagga gctttgatgt ccgagaggtc attgctagaa tcgtggatgg 1140 

aagcagattc actgagttca aagcctttta tggagacaca ttagttacag gatttgctcg 1200 

aatatttggg tacccagtag gtatcgttgg aaacaacgga gttctctttt ctgaatctgc 1260 

aaaaaagggt actcactttg tccagttatg ctgccaaaga aatattcctc tgctgttcct 1320 

tcaaaacatt actggattta tggttggtag agagtatgaa gctgaaggaa ttgccaagga 1380 

tggtgccaag atggtggccg ctgtggcctg tgcccaagtg cctaagataa ccctcatcat 1440 

tgggggctcc tatggagccg gaaactatgg gatgtgtggc agagcgtata gcccaagatt 1500 

tctctacatt tggccaaatg ctcgtatctc agtgatggga ggagagcagg cagccaatgt 1560 

gttggccacg ataacaaagg accaaagagc ccgggaagga aagcagttct ccagtgctga 1620 

tgaagcggct ttaaaagagc ccatcattaa gaagtttgaa gaggaaggaa acccttacta 1680 

ttccagcgca agggtatggg atgatgggat cattgatcca gcagacacca gactggtctt 1740 

gggtctcagt tttagtgcag ccctcaacgc accaatagag aagactgact tcggtatctt 1800 

caggatgtaa ctggaataaa ggatgttttc tgttggacat gtactgaaaa ttaacacatg 1860 

tagtagcctt aaaattttag acttctcgaa catgaggctg ttacagtaat ttttttaaca 1920 

ctgtgcattg tacttttcta ccttaaaaaa atcagtgagg atatttattt aatgaacatc 1980 

aattcctttt aaattttctt agagaaattt ctctgtggct cagttttacc acccataaag 2040 

cggagacagt aatttatggt atcctttctg acccacaaag tatgaaaagt tctgtaatct 2100 

gtaaactcag ttctgtaatc tgtattattg agatgattaa tataaagttg tattttcact 2160 

gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2280 

aaaaaaaa 2288 



<210> 192 

<211> 3758 

<212> DNA 

<213> Homo sapiens 



<400> 192 

ccacgcgtcc gctttttctc aggatgaata ttttcctggc cgactcattg atccttggta 60 

caaataaact tctggaagac ccagagagag gaaaacacag gagaaattga gcgatgtacg 120 

tacatcaaat accactactc ctcagcaacc atccccagga acctcacttt caatatcacg 180 

aagaccatcc gtcaggatga gtggcatgcc ctacacctgc gcagaatgac ggctggcttc 240 

atgggcatgg cggtggccat catcctcttt ggctggatca tcggcgtgct gggctgctgc 300 

tgggaccgag gccttatgca gtacgtggca ggctgctctt cctcatggga gggaaaacag 360 

tggaattaaa gagtgtctgc cccagcccgg cagggtgaag taggatgggg aaaacgttct 420 

caccagaccc tgggacttct atgctgcagc atcgcgacct gaggggtgga tgcagttgcc 480 

acagctcttt gaggcaaagg ccccgatgct ctgtggacag cctcaggctt gggatggatt 540 

tggcagtgag gaacttattg taacagaaga aagtcatcca agatgcctga ggaaagaaac 600 

cttcaattga gccagccggc tggaaaatgt ggccaagaaa accgcagaga ccaatgttcg 660 

gaggagaaaa ccagaaagag gggcctgcct ggcccctttg atcctttatg gccgattccg 720 

tggacattgc tgctcctcac gccggcagcc ctctcttgag tacctcaatt gcagtctcca 780 

gaccctcacc ccgcaggcat tcctgggtcg gtgtcccagt cggtcacagt catggatcct 840 

ctgcagagca gtagaaagtc gggaggggcc cgtgcccatg gtcaggaaag gagcggcagg 900 
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aggaaagagg agcatgagaa ctcagaagaa attgtaccta ctcagaaggt ggagtgagga 960 

tagacgttcc cagattcaaa ggcatcatga agtgtcatga caagatagaa aagactttgg 1020 

gctggccaag aaggaactgg ataaaattat gagtgaggta cagcaggtgg gaacagtgtc 1080 

actgaaccct atcaacagca gagcatgaga acgtgaattc ctgctgctgg ggaggcaatg 1140 

aaatgatatg ggccttcaga tgtctatgaa tcctgaccca ccgtgggtgc cagttttcaa 1200 

gagggcttcc catcaaatat tgtgcgcaaa ggatggatgg atgaaaggaa gagtgagcca 1260 

ataaacgagg gaacgccggg aaaggcagcc tcaagccggt gggccctggc acccccaccg 1320 

tccctgagca tcgagccggt tcccgccccg gcccgaactg gcccgcgcgc gctcgcagcc 1380 

ccgcggcgga acccgagggc ggcggcagcg gttccttgaa cgagccgggg aatctggagg 1440 

gagcacacag gaaaggcaga gccgcgagct ggaccagcgg caaatctcta gaagatgacg 1500 

ggttctttaa aacgcttcga aatcactgga agaaaactac agctgggctc tgcctgctga 1560 

cctggggagg ccattggctc tatggaaaac actgtgataa cctcctaagg agagcagcct 1620 

gtcaagaagc tcaggtgttt ggcaatcaac tcattcctcc caatgcacaa gtgaagaagg 1680 

ccactgtttt ctcaatcctg cagcttgcaa aggaaaagcc aggactctat ttgaaaaaaa 1740 

tgctgcccga ttttacattt atctggcatg gatgtgacta ttgtaagaca gattatgagg 1800 

gacaagccaa gaaactcctg gaactgatgg aaaacacgga tgtgatcatt gttgcaggag 1860 

gagatgggac actgcaggag gttgttactg gtgttcttcg acgaacagat gaggctacct 1920 

tcagtaagat tcccattgga tttatcccac tgggagagac cagtagtttg agtcataccc 1980 

tctttgccga aagtggaaac aaagtccaac atattactga tgccacactt gccattgtga 2040 

aaggagagac agttccactt gatgtcttgc agatcaaggg tgaaaaggaa cagcctgtat 2100 

ttgcaatgac cggccttcga tggggatctt tcagagatgc tggcgtcaaa gttagcaagt 2160 

actggtatct tgggcctcta aaaatcaaag cagcccactt tttcagcact cttaaggagt 2220 

ggcctcagac tcatcaagcc tctatctcat acacgggacc tacagagaga cctcccaatg 2280 

aaceagagga gacccctgta caaaggcctt ctttgtacag gagaatatta cgaaggcttg 2340 

cgtcctactg ggcacaacca caggatgccc tttcccaaga ggtgagcccg gaggtctgga 2400 

aagatgtgca gctgtccacc attgaactgt ccatcacaac acggaataat cagcttgacc 2460 

cgacaagcaa agaagattfct ctgaatatct gcattgaacc tgacaccatc agcaaaggag 2520 

actttataac tataggaagt cgaaaggtga gaaaccccaa gctgcacgtg gagggcacgg 2580 

agtgtctcca agccagccag tgcactttgc ttatcccgga gggagcaggg ggctctttta 2640 

gcattgacag tgaggagtat gaagcgatgc ctgtggaggt gaaactgctc cccaggaagc 2700 

tgcagttctt ctgtgatcct aggaagagag aacagatgct cacaagcccc acccagtgag 2760 

cagcagaaga caagcactct gagaccacac tttaggccac cggtgggacc aaaagggaac 2820 

aggtgcctca gccatcccaa cagtgtcgtc agagggtccc cagggcattt tcatggcaag 2880 

tacccctctg cccccactcc agcagtgctt cccaaagtgt gctctgtcac ctgctttgca 2940 

atcggcttcc attagcgcat gttttatttt ggtgtgacgg ttggccctcc taaacacgga 3000 

ctttcctcag gctggttcaa gacggaaaag gactttcttc tgttttcttc caaagtgcaa 3060 

ccacagtgga gagcccacgg tgggcttagc ctgcctaggc ccttccattt ctcttctttg 3120 

accgtgctag gaattccagg aaagtgcatt cctgccctgg tgaccttttc ctatgtctag 3180 

gctcctccac aggtgctgct attttgtgag ctccggctcc tgtttagctt ttatttcagt 3240 

tctaacctca gtccagaaac atatgtgagg ttgtttccct cttcagccac ggctacaata 3300 

ccggaaaatg ctagttttta tttatttttt taagtagtgc ttcctaaatg gtttgcatga 3360 

gagccacctg gggtacatgt tgaaaactta tttggggtct accccaaacc taataaccca 3420 

aatttgggga tggggcccag gaatatgcat ttttaaaaag tcatctgccc ttcccaggtg 3480 

attctgtaag ttgtccctca actgtacttg gagaaatcgt gttttaaagc agtagtccac 3540 

aaagtattct gctcatgtgc ccccaaaagt attttgaaaa atcatgtata ccctcaccca 3600 

tctaagttga tatctaaaat tttatctaag ttggtatcta aaatttttca tgggaagtta 3660 

aatagttgac aaagtatgta tttgctggtg tcgtgtaaat attggtattt taaaataaaa 3720 

actgttacat cactaaaaaa aaaaaaaaaa aaaaaaaa 3758 

<210> 193 
<211> 1534 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> (1212) (1212) 

<223> n equals a,t,g, or c 

<400> 193 

tcgaagatag gttcgagacg gtggatgttg cage tga tea tgcagttggg ttcggtgctg 60 
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ctcacacgct gccccttttg gggctgcttc agccagctca tgctgtacgc tgagagggct 120 

gaggcacgcc ggaagcccga catcccagtg ccttacctgt atttcgacat gggggcagcc 180 

gtgctgtgcg ctagtttcat gtcctttggc gtgaagcggc gctggttcgc gctgggggcc 240 

gcactccaat tggccattag cacctacgcc gcctacatcg ggggctacgt ccactacggg 300 

gactggctga aggtccgtat gtactcgcgc acagttgcca tcatcggcgg ctttcttgtg 360 

ttggccagcg gtgctgggga gctgtaccgc cggaaacctc gcagccgctc cctgcagtcc 420 

accggccagg tgttcctggg tatctacctc atctgtgtgg cctactcact gcagcacagc 480 

aaggaggacc ggctggcgta tctgaaccat ctcccaggag gggagctgat gatccagctg 540 

ttcttcgtgc tgtatggcat cctggcccct ggcctttctg tcaggctact acgtgaccct 600 

cgctgcccag atcctggctg tactgctgcc ccctgtcatg ctgctcattg atggcaatgt 660 

tgcttactgg cacaacacgc ggcgtgttga gttctggaac cagatgaagc tccttggaga 720 

gagtgtgggc atcttcggaa ctgctgtcat cctggccact gatggctgag ttttatggca 780 

agaggctgag atgggcacag ggagccactg agggtcaccc tgccttcctc cttgctggcc 840 

cagctgctgt ttatttatgc tttttggtct gtttgtttga tcttttgctt ttttaaaatt 900 

gttttttgca gttaagaggc agctcatttg tccaaatttc tgggcttcag cgcttgggag 960 

ggcaggaacc ctggcactaa tgctgtacaa ggtttttttc ctgttaggaa gaacttgagg 1020 

ccagctgccc actgagtctt ctgtccctga agaaagggag tattgggcag ggcttgggat 1080 

ccggctactg agagtgggag agtgggagac agaggaagga agatggagat tggaagtgag 1140 

caaatgtgaa aaattcctct ttgaacctgg cagatgcagc taaactctgc agtagtgttt 1200 

ggagactgtg anagggagtg tgtgtgttga cacatgtgga tcaggcccag gaagggcaca 1260 

ggggctgagc actacagaag tcacatgggt tctcagggta tgccaggggc agaaacagta 1320 

ccggctctct gtcactcacc ttgagagtag agcagaccct gttctgctct gggctgtgaa 1380 

ggggtggagc aggcagtggc cagctttgcc cttcctgctg tctctgtttc tagctccatg 1440 

gttggcctgg tgggggtgga gttccctccc aaacaccaga ccacacagtc ctccaaaaat 1500 

aaacatttta tatagacaaa aaaaaaaaaa aaaa 1534 

<210> 194 

<211> 2664 

<212> DNA 

<213> Homo sapiens 

<400> 194 

ggttgctggc ccaggtgagc gggcgcgctg gtccaggtga gcgggcgcgt ccccgcgacg 60 

gcgctgcctg cccgaggcgg ttcacgtaaa gacagcgaga tcctgagggc cagccgggaa 120 

ggaggcgtgg atatggagct ggctgctgcc aagtccgggg cccgcgccgc tgcctagcgc 180 

gtcctgggga ctctgtgggg acgcgccccg cgccgcggct cggggacccg tagagcccgg 240 

cgctgcgcgc atggccctgc tctcgcgccc cgcgctcacc ctcctgctcc tcctcatggc 300 

cgctgttgtc aggtgccagg agcaggccca gaccaccgac tggagagcca ccctgaagac 360 

catccggaac ggcgttcata agatagacac gtacctgaac gccgccttgg acctcctggg 420 

aggcgaggac ggtctctgcc agtataaatg catgacggat ctaagccttt cccacgttat 480 

ggttataaac cctccccacc gaatggatgt ggctctccac tgtttggtgt tcatcttaac 540 

attggtatcc cttccctgac aaagtgttgc aaccaacacg acaggtgcta tgaracctgt 600 

ggcaaaagca agaatgactg tgatgaagaa ttccagtatt gcctctccaa gatctgccga 660 

gatgtacaga aaacactagg actaactcag catgttcagg catgtgaaac aacagtggag 720 

ctcttgtttg acagtgttat acatttaggt tgtaaaccat atctggacag ccaacgagcc 780 

gcatgcaggt gtcattatga agaaaaaact gatctttaaa ggagatgccg acagctagtg 840 

acagatgaag atggaagaac ataacctttg acaaataact aatgttttta caacataaaa 900 

ctgtcttatt tttgtgaaag gattattttg agaccttaaa ataatttata tcttgatgtt 960 

aaaacctcaa agcaaaaaaa gtgagggaga tagtgagggg agggcacgct tgtcttctca 1020 

ggtatcttcc ccagcattgc tcccttactt agtatgccaa atgtcttgac caatatcaaa 1080 

aacaagtgct tgtttagcgg agaattttga aaagaggaat atataactca attttcacaa 1140 

ccacatttac caaaaaaaga gatcaaatat aaaattcatc ataatgtctg ttcaacatta 1200 

tcttatttgg aaaatgggga aattatcact tacaagtatt tgtttactat gaaattttaa 1260 

atacacattt atgcctagaa ggaacggact ttttttttct attttaatta cacataatat 1320 

gtaattaaag tacaacataa tatgttgttt ctctgtagcc cgttgagcat atgagtaagt 1380 

cacatttcta ttaggactac ttmcaaggac aaggtttcca tttttccagt tgtaaaattg 1440 

gaaccatcag ctgataacct cgtagggagc aaccccagga tagctaagtg ttatgtaata 1500 

tgcctagaag gtgatgtgaa tgcgattcag aagcatagcc actcccattt tatgagctac 1560 

tcacatgaca aatgtcatct tttgctataa cctttgccaa gttagagaaa agatggattt 1620 

aatgagataa atgaaaagat atttamccta atatatcaag gcactatttg ctgttatgct 1680 

ttgttattta tttcccagca cttgttcctt attgtagatt ttttaaagac tgtaaccttt 1740 
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tactaactgt ggtcttacta aaatttgtgc ttgatactgc ttttcaaaaa gcctttaatt 1800 

agagccaaaa ggatggaaaa ggcaagatat aaatgccttt tatagatctc ttatttacat 1860 

tgaaaattat taccatatgt ttagagcaaa tccaagaaaa cttcaacagc ttctgaagat 1920 

gtctatgaat gttgaaaact tttcaatctc ttggaatgct cagttatgtt cctagaccgg 1980 

tctttgctga ctactggttg ttaacctttc cctagcctgg gacctcaagc catatatatc 2040 

ctttgggtga cccatggcca aagttattaa gatgaactga ctttcaaagt cagagaagga 2100 

cagcataggg agaggcggtt atttgtaagt cattacaggt agaacagggc agaaggaaaa 2160 

gtatgttctg gagaaagggc catgttccta actttggaga tatgtcattg ccgggaacct 2220 

agtatcttcc aacttgaatt ggtggcagct gttccagtga gacaaggcac atgtatgcct 2280 

tgtggctaag tgagcaaact gggtttccac ttaaatgttt gggaccctca attgattctt 2340 

tatttcaaac ctttataaaa ggtacagttt tgtaagccat tattaataat taatgcttat 2400 

cggctgggca cagtggctca cacctataat cccagcactt gggaggctga ggcggttgga 2460 

tcacttgagg tcaggagttt gagaccagct ggccaacatg gtgaaacagc gtctctacta 2520 

aaaatacaaa aatttgccgg gcgtggtggc gcatgcttat agtctcagct actcaggaag 2580 

ctgaggtacg agaatcactt gaacccagga ggtggaggtt gcagtgagct gagattgtgc 2640 

cactgcactg cagcctggct cgag 2664 

<210> 195 
<211> 1076 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<222> (1029) . . (1029) 
<223> n equals a,t,g, or c 

<220> i 
<221> mi sc__f eature 
<222> (1037) . . (1037) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (1040) . . (1040) 
<223> n equals a,t,g, or c 

<400> 195 

ggcacgaggg gaaagccatg ctcccaggac tccttccttg cagccttaaa tcggtctgta 60 

cggaaaattc cgcgccttag aaacccacgc ttgggtgtaa cttattattg ttcttcctga 120 

cctacttcct gtttatcact tccgggttca tcattttggc atttcggtga tcgggttgga 180 

actattgaag cccgctttca ggttcttttc cccattttcc ctttgaaagg aagacttctg 240 

gcttctccta aatctccgtt ctctgggtaa ggggagtcca agcctctgtc atgaggaacg 300 

gaaatgcgag ggcctcgggt gttactctaa aatccgccct cagcttgcac gccggaagct 360 

gcgattcctg cagcggaaga ggcgtgatct ggccttcgac tcgctatgtc cactaacaat 420 

atgtcggacc cacggaggcc gaacaaagtg ctgaggtaca agcccccgcc gagcgaatgt 480 

aacccggcct tggacgaccc gacgccggac tacatgaacc tgctgggcat gatcttcagc 540 

atgtgcggcc tcatgcttaa gctgaagtgg tgtgcttggg tcgctgtcta ctgctccttc 600 

atcagctttg ccaactctcg gagctcggag gacacgaagc aaatgatgag tagcttcatg 660 

ctgtccatct ctgccgtggt gatgtcctat ctgcagaatc ctcagcccat gacgccccca 720 

tggtgatacc agcctagaag ggtcacattt tggaccctgt ctatccacta ggcctgggct 780 

ttggctgcta aacctgctgc cttcagctgc catcctggac ttccctgaat gaggccgtct 840 

cggtgccccc agctggatag agggaacctg gccctttcct agggaacacc ctaggcttac 900 

ccctcctgcc tcccttcccc tgcctgctgc tgggggagat gctgtccatg tttctagggg 960 

tattcatttg ctttctcgtt gaaacctgtt gttaataaag tttttcactc tgaaaaaaaa 1020 

aaaaaaaana raaaacncgn gggggggccc ggaacccaat tcsccggata gtgagt 1076 

<210> 196 
<211> 943 
<212> DNA 

<213> Homo sapiens 
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<400> 196 

ggcacgagct 

catgggcaac 

ggtctgtgcc 

cctcctcacc 

ttgagatctt 

cgagggtctc 

aacaagaccc 

tctgcaggac 

tgcctatatt 

tcagaggagg 

gtctggagcc 

tgtggctcca 

tgtgtcactg 

tgagggcact 

ggctctgagt 

aagctttttt 



ccgcccggcc 
tgccaggcag 
actttgatcc 
cacaggactg 
ttggccagct 
acgtcgtggg 
ggacattcca 
aactggtcct 
ttagtgctgt 
gggctggaaa 
atgaaggcct 
ggctgctggt 
ctatgtgctt 
ggcctagttc 
tggaaaaggt 
gcccaaaaaa 



ccgaggggct 
ggcacaacct 
tgctgctcct 
gcctgcgcac 
gaccctgtgt 
cttgctgacc 
ggccacagtc 
tatcacagcc 
tccaggaatc 
tgccaccctg 
tgtgctgacc 
cttgggctcc 
ccacccgcgc 
ccgacttgtt 
tttagaaaaa 
aaaaaaaaaa 



ctccccggag 
gcacctgtgt 
tggcctctct 
cctgacatcc 
cccaggaatg 
accttgaact 
ctgggaagtc 
agggtgacca 
ctaccctcca 
agccctagaa 
aagctgctca 
ttcctgcttc 
cgggagtccc 
tctcaggtgt 
gtgaagagct 
aaaaaaaaaa 



gctcagcccc 
ctggcccacc 
ggcctgggcc 
cccaggactg 
ggacagtcac 
tcggagacgg 
agatgggatt 
cagaaaggac 
gccagccacc 
tgggtgagga 
cctcggagga 
tcttctgtgg 
actggtctag 
gaatcaactt 
ggaatgtggg 
aaa 



ctctgctccc 60 

acccacctct 120 

ttggcagctt 180 

ggtctctttt 240 

agggaagtgg 300 

tccagacagg 360 

gaaaggatct 420 

tgcaggaacc 480 

catatcctgc 540 

atgtgttagt 600 

gctggctctg 660 

ccttctctgc 720 

aacccggctc 780 

cttgggcctt 840 

ggaaaataaa 900 

943 



<210> 197 
<211> 1566 
<212> DNA 

<213> Homo sapiens 



<400> 197 

cgcgtccggc 

ccgcgagccc 

ttgctgcctc 

gaggggagcc 

ggaggtctgt 

ggagcagtgc 

aggttgctcc 

tgaaccgaag 

tgacggggcc 

ctttgtgctg 

ccagcagcaa 

agtacccagg 

tgtgagccac 

gggaccagga 

ccttcatgct 

cgcgtttctg 

aaggggagct 

gcctgtgtta 

ctggctgagg 

tttgggcctg 

cttgtgggta 

cgggcaccaa 

cctgcagctg 

cagcctgaca 

aaataaagga 

aaaaaaaaaa 

aaaaaa 



gcccggcagc 
atggaggctc 
ctcctatgtg 
ctgatccgcc 
gagcactgcg 
cggccagagg 
atctacaacc 
acagtcacaa 
agctttatcg 
cacttcctca 
gtacctgagt 
gtgctgtctt 
accccttttg 
ctcagcctgg 
gccagcaggg 
tggctgtggg 
gtgcctgagg 
aagtgtttgc 
cagtgtcacc 
gactccatcc 
agcggggggc 
gcaggctcag 
agaaaatccc 
gtgggttgta 
actctgcact 
aaaaaaaaaa 



tgtccaccga 
cgggaccccg 
cccagctggc 
tgaatatctg 
tggagggaga 
agccaggaca 
gctcagaggc 
cagggagccc 
gaggtgtcgt 
aggccaagga 
cccagctcac 
ctccatgggc 
tctcctccag 
gccagtctag 
aagagaacca 
caggagctgg 
ccctctgtgt 
tcttcctctg 
ttcctgagtg 
tgagggcaaa 
gggggcggga 
tgtctgatgc 
tcaaccacct 
tgcctgcctc 
gcaaaaaaaa 
aaaaaaaaaa 



tcccggccac 
cgccttgcgg 
tgtggctggt 
gccggcggtc 
cagagcgcgc 
ctgtgtggcc 
atgtccagct 
cccagtccct 
gctggtgttg 
cagcacctac 
ctcctggttc 
aagccctcag 
ttggggtgtt 
gagcccagct 
gtaggtgcca 
gccttgtgtc 
gccgtgtgtg 
ctgcctcctc 
tcctctttgg 
ggaggatgca 
aaaactctgg 
ctgacatctc 
cgtctcctcc 
tttccaccaa 
aaaaaaaaaa 
aaaaaaaaaa 



cgcccccggc 
actgcgctct 
aaaggagctc 
caaggggcct 
aatctctcca 
caatctgagg 
gctcaccacc 
gaggcccaca 
agcctacagg 
cagacgctgt 
ctgccccacc 
gacggtgaca 
tcctttgtca 
gagccctcct 
gcccaggcaa 
tagttgggtt 
ctgtggggcg 
tcgaggcagg 
cctctgcaga 
gagggtggcc 
ccgccagttt 
ctcctgtcct 
atcgcccctg 
ctggcctggg 
aaaaaaaaaa 
aaaaaaaaaa 



cacccccacc 60 

gtggcggctg 120 

gaggctttgg 180 

gcaaacagct 240 

gctgcatgtg 300 

tggtcaagga 360 

accccaccta 420 

gccctggatt 480 

cggtggcttt 540 

gagtacctgg 600 

gttccccttc 660 

gcgtgctcca 720 

gatgttggct 780 

gtgtcttttc 840 

gcctgtggcc 900 

ttgctctgag 960 

ggtcgccaca 1020 

gggtccttgg 1080 

atctgacccc 1140 

tctgggcacc 1200 

ttggctcctg 1260 

gggcctggaa 1320 

ctgggccccc 1380 

cactgccccc 1440 

aaaaaaaaaa 1500 

aaaaaaaaaa 1560 

1566 



<210> 198 
<211> 1067 
<212> DNA 

<213> Homo sapiens 
<400> 198 

taccggtccg gaattcccgg gtcgacccac gcgtccggcg cccggcagct gtccaccgat 
cccggccacc gyccccggcc acccccaccc cgcgagccca tggaggctcc gggaccccgc 



60 
120 
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gccttgcgga ctgcgctctg tggcggctgt tgctgcctcc tcctatgtgc ccagctggct 180 

gtggctggta aaggagctcg aggctttggg aggggagccc tgatccgcct gaatatctgg 240 

ccggcggtcc aaggggcctg caaacagctg gaggtctgtg agcactgcgt ggagggagac 300 

agagcgcgca atctctccag ctgcatgtgg gagcagtgcc ggccagagga gccaggacac 360 

tgtgtggccc aatctgaggt ggtcaaggaa ggttgctcca tctacaaccg ctcagaggca 420 

tgtccagctg ctcaccacca ccccacctat gaaccgaaga cagtcacaac agggagcccc 480 

ccagtccctg aggcccacag ccctggattt gackgggcca gctttatcgg aggtgtcgtg 540 

ctggtgttga gcctacaggc ggtggctttc tttgtgctga cttcctcaag gccaaggaca 600 

gcacctacca gacgctgtga gtacctggcc agcagcaagt acctgagtcc cagctcacyt 660 

ctggttcctg cccacgttcc cttcagtacc cagggtgctg tcttctccac tggcaagccc 720 

tcaggacggt gacagcgtgc tycatgtgag ccacacccct tttgtctyct ccagttgggg 780 

tgtttccttt gtcagatgtt ggctgggacc aggactcagc ctgggccagt ctaggagccc 840 

agctgagccc tcctgtgtct tttcccttca tgctgccagc agggaagaga accagtaggt 900 

gccagcccag caacctgtgg cccgcgtttc tgtggctgtg ggcaggagct gggccttgtg 960 

tctagttggg ttttgctctg agaaggggag ctgtgctgag gccctctgtg tgccgtgtgt 1020 

gctgtggggc gggtcgccac agcctgtgtt aaagtgtttg ctcttcc 1067 

<210> 199 
<211> 2078 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<222> (1177) . . (1177) 
<223> n equals a,t,g, or c 

<220> 

<221> misc__f eature 
<222> (1187) . . (1187) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2057) . . (2057) 
<223> n equals a,t,g, or c 

<400> 199 

ggcacgagga gttgtgcaga tacctggctg agagctggct caccttccag attcacctgc 60 

aggagctgct gcagtacaag aggcagaatc cagctcagtt ctgcgttcga gtctgctctg 120 

gctgtgctgt gttggctgtg ttgggacact atgttccagg gattatgatt tcctacattg 180 

tcttgttgag tatcctgctg tggcccctgg tggtttatca tgagctgatc cagaggatgt 240 

acactcgcct ggagcccctg ctcatgcagc tggactacag catgaaggca gaagccaatg 300 

cyctgcatca caaacacgac aagaggaagc gtcaggggaa gaatgcaccc ccaggaggtg 360 

atgagccact ggmagagaca gagagtgaaa gcgaggcaga gctggctggc ttctccccag 420 

tggtggatgt gaagaaaaca gcattggcct tggccattta cagactcaga gctgtcagat 480 

gaggaggctt ctatcttgga gagtggtggc ttctccgtat cccgggccac aactccgcag 540 

ctgactgatg tctccgagga tttggaccag cagagcctgc caagtgaacc agaggagacc 600 

ctaagccggg acctagggga gggagaggag ggagagctgg cccctcccga agacctacta 660 

ggccgtcctc aagctctgtc aaggcaagcc ctggactcgg aggaagagga agaggatgtg 720 

gcagctaagg aaaccttgtt gcggctctca tcccccctcc actttgtgaa cacgcacttc 780 

aatggggcag ggtccccccm agatggagtg aaatgctccc ctggaggacc agtggagaca 840 

ctgagccccg agacagtgag tggtggcctc actgctctgc ccggcaccct gtcacctcca 900 

ctttgccttg ttggaagtga cccagccccc tccccttcca ttctcccacc tgttccccag 960 

gactcacccc agcccctgcc tgcccctgag gaagaagagg cactcaccac tgaggacttt 1020 

gagttgctgg atcaggggga gctggagcag ctgaatgcag agctgggctt ggagccagag 1080 

acaccgccaa aaccccctga tgctccaccc ctggggcccg acatccattc tytggtacat 1140 

cagaccaaga agctcaggcc gtggcagagc catgagncca gccgttnagg aaggagctgc 1200 

aggcacagta gggcttcttg gctaggagtg ttgctgtttc ctcctttgcc taccactctg 1260 

gggtggggca gtgtgtgggg aagctggctg tcggatggta gctattccac cctctgcctg 1320 

cctgcctgcc tgctgtcctg ggcatggtgc agtacctgtg cctaggattg gttttaaatt 1380 
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tgtaaataat tttccatttg ggttagtgga tgtgaacagg gctagggaag tccttcccac 1440 

agcctgcgct tgcctccctg cctcatctct attctcattc cactatgccc caagccctgg 1500 

tggtctggcc ctttcttttt cctcctatcc tcagggacct gtgctgctct gccctcatgt 1560 

cccacttggt tgtttagttg aggcacttta taatttttct cttgtcttgt gttcctttct 1620 

gctttatttc cctgctgtgt cctgtcctta gcagctcaac cccatccttt gccagctcct 1680 

cctatcccgt gggcactggc caagctttag ggaggctcct ggtctgggaa gtaaagagta 1740 

aacctggggc agtgggtcag gccagtagtt acactcttag gtcactgtag tctgtgtaac 1800 

cttcactgca tccttgcccc attcagcccg gcctttcatg atgcaggaga gcagggatcc 1860 

cgcagtacat ggcgccagca ctggagttgg tgagcatgtg ctctctcttg agattaggag 1920 

cttccttact gctcctctgg gtgatccaag tgtagtggga ccccctacta gggtcaggaa 1980 

gtggacacta acatctgtgc aggtgttgac ttgaaaaata aagtgttgat tggctagaaa 2040 

aaaaaaaaaa aaaattnctg cggtccgcaa gggaattc 2078 

<210> 200 

<211> 2494 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (920) . . (920) 
<223> n equals a,t,g f or c 

<400> 200 

ggagatgttt aaggattacc cgccagccat aaaaccatcc tacgatgtgc tgctgctgct 60 

gctgctgcta gtsytcctsc tgcaggccgg cctcaacacg ggcaccgcca tccagtgygt 120 

gcgcttcaag gtcagtgcaa ggctgcaggg tgcatcctgg gacacccaga acggcccgca 180 

ggagcgcctg gctggggagg tggccaggag ccccctgaag gagttcraca aggagaaagc 240 

ctggagagcc gtcgtggtgc aaatggccca gtgaccccca gacgcggaaa ccgggtggca 300 

gckcccagcc tggccccaag catggaaacg cacaacccct aatcgccctg agctactgct 360 

tctaacacct cttttccctt gtgtgagggc aaaccaggct gcaggtgggg ttttcacttc 420 

ctagggtagt ttaattttaa aataggccaa tgttggctag tctgtgcctc agtgagatca 480 

gtcagctccg agtggctccc gtgtcgtaac agcaggagca tggccgcaac ttcccaggcc 540 

gaggaagggc ccccggctcg gcctcttgag agccccaccc ctgaactggc cccagctcct 600 

cttcctgcct ctctcatggc ttgggctgga gtgggctctc tggacctgac cagactgtgg 660 

gtccctgcgt ctcctgccca ctctgaccgg gcttcctccc tccacgctta gggtctgtcc 720 

cgggtactca gtcagcccag tgggatctta cccacttccc tgcaaggtgc acctgcccca 780 

ggctcaggct gcccagcggc tcttcctgga cagtgagagc agggctgggc gcctctgtcc 840 

tggcccggga gccgcagggg cccctcctcc agagcctggg cgcaagcgac acaggctgcc 900 

gctgctctcc aggtgaaatn cacaccagtc cacgccgggt cgcctgccct gtctccctac 960 

ttagacccag tcattctaga gggatccamc gccamactgg ccggcccacg tcctgggtgc 1020 

tgtcatgccc agcttggagt gccacgtggc cgctgcccac gtcccgggca ctgtcatgcc 1080 

cagcttggag tgccacatgg ccgctgccca cgtcccgggc actgtcatgc ccagcttgga 1140 

gtgccacgtg gccgctgctg tgacaggcag tgttcttggg ggtggggctg catccaaggc 1200 

tttgtaaacc ggctggacca cgtctccctg gccccagtga ccgggggaag ctgagcccct 1260 

ccctcctgtg tttgctccca ttactcaaaa tgcaggacag atcaggtcag agcccaggaa 1320 

ttctcacagg ttcacccagc gccctctacc tcctagcaag tactttgtct tgatcctcac 1380 

tgagaaggcc ccagggcagt ggtcttctcc atctccgctg ttttggggtc ttagggtaca 1440 

gcccaggcgg tcactgccca cctgccaggc tgcagggaca gttgggtgtg agaataacac 1500 

tggctttggg tagtgccatg gccaggagtg ggtttccctg cgtctcctcg tcccgagggc 1560 

gcctgggtcc tcccagctga cggcagtaaa tccacagtga gttggggcga ctgtgaaact 1620 

ggaatgctgt tactttgata attactttcc agcaggtgtt ttccttcaca atggttttgt 1680 

ttctttcctt ctgatctgag aagacatgaa cgttttctct tcaccgccgt ggggtgtatt 1740 

gactggtccc ccatgggctg ctggaaaggc ccggagatgc atctgtggcc tggggccatc 1800 

aagatcaaag aaccaggagg cctgggagat gcagctggat ggggcggcct gcagaccctg 1860 

ccagggggtt tgaggaccct cccaggtttc ccactgcgga acaggagtga ctctggctgc 1920 

caagatacct tcatggtgtt catgacaagt ggaatcatta ttttcaacca ttgaaggggg 1980 

atgcaggcaa gacaccttcc cagctgctcc tagaggggac aagccaggcc ctctctgcag 2040 

tcctcggcag ctccggaagg acacagtcag gggccgggca aacactttgg ccacagcccc 2100 

aaacaagcgc caccgtggga gaggagaggc tgctgtcact ggtaccggat gcagacccca 2160 

ccctgtctgc aggccacccc cacctccctg cagctttgag gctggcgggg tctgctcctg 2220 
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ggaatggggt gggagccaca gggacgaccc ggggcgggct gatgtcttct tgggggcaga 2280 

ccagagagct caagtttcag agtcagaatt aggcacttgg agcgtttttg ctggcttgca 2340 

ctttcttatt ttcttatttt agagcgctta aaaaatccgg aaaaatgggg tttaaaagaa 2400 

ctgtctcttt cagtctacat ttttgtttaa tacgcttgag caataaacgc tgacttgcag 2460 

acgtgaaaaa aaaaaaaaaa aaaaaaaaac tcga 2494 

<210> 201 
<211> 845 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> 

<223> n equals a,t,g, or c 
<220> 

<221> misc_f eature 

<222> (4) . . (5) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (823) . . (823) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (845) . . (845) 
<223> n equals a,t,g, or c 

<400> 201 

ntanntaatg gcacatctcc ccttctattg ggaacacaag ctggagctcc accgcggtgg 60 

cggccgctct agaactagtg gatcccccgg gctgcaggaa ttcggcacga ggtcaaggca 120 

aaaatgggtc aggtttggag agttccccca ctccttttga gtgttcaggt tttccttacc 180 

atggctcatg ctttccatca agcaccagag ttgcagtggc ttggcctctg gttctgggtg 240 

aggttatttg caggtggaga cggggggctg cacctgaaca tttctagtgt caccctccct 300 

ctccttcatg ggaaacagct ctccagggaa gtaccttcct gccaggggaa gccaaggctg 360 

ggccggccgc cctacaagga gccacaggat tgcagccatg ggtgccacct ttcatggaag 420 

gggagattta tgggctttcc tggaaccccc aggctgtcct ggccaagagg aaagaggtgg 480 

ttacttcagg agtttgacct tagttagata actaaaagaa tacatttccc ctcccttttc 540 

tttatttcct caataaaaat gtacaaagta tcacccttct ccatgcccca atctgtgtta 600 

aagtcacaat ctatgggtgt agttctggga ttctgtcaaa ttctccttcc tgctctccaa 660 

aatggacaat tgtcgtaggg accacatgcc cccagaatac aatggcctct gtgktctact 720 

ggggtcaagc ctgctagaac tcagcattca tgacaggggs taagtgtgca tgaaktgaca 780 

ctgactacag stargaaagc caggcgcaca aatgsccctt tcnccccaag ggccggtctt 840 

tccan 845 

<210> 202 
<211> 738 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (3).. (3) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (8) . . (8) 
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<223> n equals a,t,g, or c 
<220> 

<221> misc_ feature 
<222> (566).. (566) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (680) . . (680) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (684) . . (684) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (703) . . (703) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (715) . . (715) 
<223> n equals a,t,g, or c 

<220> 

<221> mis cofeature 

<222> (717) . . (717) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (731).. (731) 

<223> n equals a,t,g, or c 

<400> 202 

ccnctgtnct ggccctctat catttgcctg cctttagacc tgcctcaagg atatggcagc 60 

gctagccttt agctyccaca gcacggatgg gggtgatgcc agttagaagt gggtagtgaa 120 

cgtttgctga gctgttcact gtttmtctct tctctttgga agcacctctc cgagccatgt 180 

gagccccctg atgccaccga gcaggggcag cttcatgacc gatgtctggc tgaggctgtg 240 

gcggacactc tcggggttgt ctgcaggaga gcaagccagg aggacatggg cctggacgac 300 

acggcctcgc agcaaagtgt gtcagacgag cagtgacggg cgtgcggccg ggcggggagg 360 

ctggctcccc cacacctccc acctgcattg ctctccctcg tgctccccaa atcaccacaa 420 

ccaaccaata ccgcratcca tgagggactc ctcctgtgga aaaggagagc tgttccagaa 480 

cacagaactg atctcaggtt tttgaaaaaa aaaaaaaaaa aactcgaggg ggggcccggt 540 

acccaattcg ccctatagtg agtcgnatta caattcactg gcgtcgtttt acaacgtcgt 600 

gactgggaaa accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc 660 

agctggcgta ataagcgaan aggnccggac cgatcggcct ttnccaacag ttggngnagc 720 

ctgaaatggg ngaatggg 738 

<210> 203 

<211> 1145 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> (9) . . (9) 

<223> n equals a,t,g, or c 
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<220> 

<221> miscL_f eature 
<222> (410) . . (410) 
<223> n equals a,t,g, or c 

<400> 203 

caggcagang ggctgagtca caggcacagg tgaggaactc aactcaaact cctctctctg 60 

ggaaaacgcg gtgcttgctc ctcccggagt ggccttggca gggtgttgga gccctcggtc 120 

tgccccgtcc ggtctctggg gccaaggctg ggtttccctc atgtatggca agagctctac 180 

tcgtgcggtg cttcttctcc ttggcataca gctcacagct ctttggccta tagcagctgt 240 

ggaaatttat acctcccggg tgctggaggc tgttaatggg acagatgctc ggttaaaatg 300 

cactttctcc agctttgccc ctgtgggtga tgctctaaca gtgacctgga attttcgtcc 360 

tctagacggg ggacctgagc agtttgtatt ctactaccac atagatcccn ttccaaccca 420 

tgagtgggcg gtttaaggac cgggtgtctt gggatgggaa tcctgagcgg tacgatgcct 480 

ccatccttct ctggaaactg cagttcgacg acaatgggac atacacctgc caggtgaaga 540 

acccacctga tgttgatggg gtgatagggg asatccggct cagcgtcgtg cacactgtac 600 

gcttctctga gatccacttc ctggctctgg ccattggctc tgcctgtgca ctgatgatca 660 

taatagtaat tgtagtggtc ctcttccagc attaccggaa aaagcgatgg gccgaaagag 720 

ctcataaagt ggtggagata aaatcaaaag aagaggaaag gctcaaccaa gagaaaaagg 780 

tctctgttta tttagaagac acagactaac aattttagat ggtaaggttc acaaataggt 840 

tgatttcttt cttcagcttt ctgacatgtc cagcccatct ctaatgagga ctcccagatc 900 

atcactttat ggctgttarg tgtttcccat atgaaattag aggagctggg tcagggagac 960 

aaaagtcttc tattagtctt atggatagct cctccttgag tgtattttgt gcaaaagatt 1020 

aagaagctgg actctactgc cattaaagct gagagaatcc taaggttatt tgtggcttcg 1080 

gggttatatt tattactact actactaata aatattcaac aagtaaataa atctttttta 1140 

aatca 1145 

<210> 204 
<211> 4909 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (2488) . . (2488) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2493) (2493) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2512) . . (2512) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2523) . . (2523) 
<223> n equals a,t,g, or c 



<400> 204 



gcgtccggtg gtggcggcgg 
gcaagaagat ggtgtttctg 
cctcactgtt ggctctctta 
ctccaaaacg tagtgatggg 
tcatcccagt tcattatgat 
gaaccacgaa agtagaaatc 
accacctgca gatatctagg 



cgcaagggtg agggcggccc 
cccctcaaat ggtcccttgc 
actgtgtcca ctccttcatg 
acaccatttc cttggaataa 
ctcttgatcc atgcaaacct 
acagccagtc agcccaccag 
gccaccctca ggaagggagc 



cagaacccca ggtaggtaga 
aaccatgtca tttctacttt 
gtgtcagagc actgaagcat 
aatacgactt cctgagtacg 
taccacgctg accttctggg 
caccatcatc ctgcatagtc 
tggagagagg ctatcggaag 



60 
120 
180 
240 
300 
360 
420 
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aacccctgca ggtcctggaa cacccccctc aggagcaaat tgcactgctg gctcccgagc 480 

ccctccttgt cgggctcccg tacacagttg tcattcacta tgctggcaat ctttcggaga 540 

ctttccacgg attttacaaa agcacctaca gaaccaagga aggggaactg aggatactag 600 

catcaacaca atttgaaccc actgcagcta gaatggcctt tccctgcttt gatgaacctg 660 

ccttcaaagc aagtttctca atcaaaatta gaagagagcc aaggcaccta gccatctcca 720 

atatgccatt ggtgaaatct gtgactgttg ctgaaggact catagaagac cattttgatg 780 

tcactgtgaa gatgagcacc tatctggtgg ccttcatcat ttcagatttt gagtctgtca 840 

gcaagataac caagagtgga gtcaaggttt ctgtttatgc tgtgccagac aagatgaatc 900 

aagcagatta tgcactggat gctgcggtga ctcttctaga attttatgag gattatttca 960 

gcataccgta tcccctaccc aaacaagatc ttgctgctat tcccgacttt cagtctggtg 1020 

ctatggaaaa ctggggactg acaacatata gagaatctgc tctgttgttt gatgcagaaa 1080 

agtcttctgc atcaagtaag cttggcatca caatgactgt ggcccatgaa ctggcccacc 1140 

agtggtttgg gaacctggtc actatggaat ggtggaatga tctttggcta aatgaaggat 1200 

ttgccaaatt tatggagttt gtgtctgtca gtgtgaccca tcctgaactg aaagttggag 1260 

attatttctt tggcaaatgt tttgacgcaa tggaggtaga tgctttaaat tcctcacacc 1320 

ctgtgtctac acctgtggaa aatcctgctc agatccggga gatgtttgat gatgtttctt 1380 

atgataaggg agcttgtatt ctgaatatgc taagggagta tcttagcgct gacgcattta 1440 

aaagtggtat tgtacagtat ctccagaagc atagctataa aaatacaaaa aacgaggacc 1500 

tgtgggatag tatggcaagt atttgcccta cagatggtgt aaaagggatg gatggctttt 1560 

gctctagaag tcaacattca tcttcatcct cacattggca tcaggaaggg gtggatgtga 1620 

aaaccatgat gaacacttgg acactgcaga ggggttttcc cctaataacc atcacagtga 1680 

gggggaggaa tgtacacatg aagcaagagc actacatgaa gggctctgac ggcgccccgg 1740 

acactgggta cctgtggcat gttccattga cattcatcac cagcaaatcc gacatggtcc 1800 

atcgattttt gctaaaaaca aaaacagatg tgctcatcct cccagaagag gtggaatgga 1860 

tcaaatttaa tgtgggcatg aatggctatt acattgtgca ttacgaggat gatggatggg 1920 

actctttgac tggcctttta aaaggaacac acacagcagt cagcagtaat gatcgggcaa 1980 

gtctcattaa caatgcattt cagctcgtca gcattgggaa gctgtccatt gaaaaggcct 2040 

tggatttatc cctgtacttg aaacatgaaa ctgaaattat gcccgtgttt caaggtttga 2100 

atgagctgat tcctatgtat aagttaatgg agaaaagaga tatgaatgaa gtggaaactc 2160 

aattcaaggc cttcctcatc aggctgctaa gggacctcat tgataagcag acatggacag 2220 

acgagggctc agtctcagag cgaatgctgc ggagtgaact actactcctc gcctgtgtgc 2280 

acaactatca gccgtgcgta cagagggcag aaggctattt cagaaagtgg aaggaatcca 2340 

atggaaactt gagcctgcct gtcgacgtga ccttggcagt gtttgctgtg ggggcccaga 2400 

gcacagaagg ctgggatttt ctttatagta aatatcagtt ttctttgtcc agtactgaga 2460 

aaagccaaat tgaatttgcc ctctgcanac ccnaaaataa ggaaaagctt cnatggctac 2520 

tanatgaaag ctttaaggga gataaaataa aaactcagga gtttccacaa attcttacac 2580 

tcattggcag gaacccagta ggatacccac tggcctggca atttctgagg aaaaactgga 2640 

acaaacttgt acaaaagttt gaacttggct catcttccat agcccacatg gtaatgggta 2700 

caacaaatca attctccaca agaacacggc ttgaagaggt aaaaggattc ttcagctctt 2760 

tgaaagaaaa tggttctcag ctccgttgtg tccaacagac aattgaaacc attgaagaaa 2820 

acatcggttg gatggataag aattttgata aaatcagagt gtggctgcaa agtgaaaagc 2880 

ttgaacgtat gtaaaaattc ctcccttgcc aggttcctgt tatctctaat caccaacatt 2940 

ttgttgagtg tattttcaaa ctagagatgg ctgttttggc tccaactgga gatacttttt 3000 

tcccttcaac tcattttttg actatccctg tgaaaagaat agctgttagt ttttcatgaa 3060 

tgggctatcg ctaccatgtg ttttgttcat cacaggtgtt gccctgcaac gtaaacccaa 3120 

gtgttgggtt ccctgccaca gaagaataaa gtaccttatt cttctcattt tatagtttat 3180 

gcttaagcac ccgtgtccaa aaccctgtac cccatgttta tcattcataa actgtttcat 3240 

cagtctcctc gaaagactct gaatagtcga ctactgaaca atgaacacct ggatctgaga 3300 

ctaagccgga cgatgactgg gttaaagctc tcccggctca cccctccaga cccgctgccc 3360 

atccctcttc cttgctccat gcccaggggc tgacttgtaa aggccaagtc atcaagcttt 3420 

cttgcccttt ggatgttggt cagtggggag ccggagagct ggagctgggg tcggaggagg 3480 

tagtaggtgg aggtgttctt ccctgattcc cttgcgggat gcctcgggct ggcctcccct 3540 

gagggtctta gctccgagag gggaccctct tttccacaca gccttctcca cctctggatt 3600 

ttggtaactg ctccctcctc atcccttcag gattagtggc ctcagtggga gtctggcttt 3660 

tactagtcct ggcggacttg tggtttctac ataatgtgct cgcacttttg caaaaaatct 3720 

ttttatagaa ccctcctcag ataattctga gtgtctgtca tctatttccc tgactggtac 3780 

agtatctctt ctgaaaaagc agagtgcatt caagtctgta ggaaaaccct tttcttaggg 3840 

aggtgatttt ttttctctct ctgcttctta tttggcctac tttacaattt ctaactaact 3900 

agttattggc atttactgac agtaaattat tgcagtcacc aataaatgat agtacattgt 3960 

gaaacaaaat atttgctcat attagcaaat aggacattct ttggctttga agtctttctt 4020 

ttgtgaagac ttcacacacg gttgcttcag cacacagttg ctgctcaggt tttatgtata 4080 
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gatgataata atagaaagca cagtttacta acatggtaaa ccaacggagt tcaagtcaag 4140 

tcagttaata ccctaagaat tagattttat ttcttattct gaaaacttgc tacacaggga 4200 

cttatctaac ccatagtgtg ctctgttgct gacttgattc aagttgcagc gtgttttgcg 4260 

ctgactctaa ggtgcggaaa tcctcacacc tggcaaagga gaattcaaac tgaacttttt 4320 

gaatataagg caaaaacttc aagataaggg aatatgattg atgattggta cgaaaaatgt 4380 

caaaaatgtg ttcccctaat acacgacaaa atagagtgac ttctggacat aaatctgcca 4440 

tttattaaac cattcactac aacaaataaa taggtataaa agtggaattg gaatttttat 4500 

acttatttgt tgtagtgaat ggtttaataa aaatagaaat cactggtaat ttccacccca 4560 

aactaaacta tttcccttct tttaaaaaaa tacacaacca agattttaat gtaaaatatt 4620 

ttgctttaat tgtattttat gccttgatta atgaaacatg gaaatattga ttttcagttt 4680 

tggtcacctg aggaacctat ctttgtttgc ttttggaaaa gcccattttc taaacagata 4740 

caatattgcc acaacaatgt gcagaaacct ttttgataat aaaaaattgt tctttgcctc 4800 

taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4860 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 4909 

<210> 205 
<211> 2921 
<212> DMA 

<213> Homo sapiens 
<400> 205 

ccacgcgtcc ggatttctga ggttctgcaa acaggccacc ctgcagttct gtgtggtgaa 60 

gccactcatg gcggtcagca ctgtggtcct ccaggccttc ggcaagtacc gggatgggga 120 

ctttgacgtc accagtggct acctctacgt gaccatcatc tacaacatct ccgtcagcct 180 

ggccctctac gccctcttcc tcttctactt cgccacccgg gagctgctca gcccctacag 240 

ccccgtcctc aagttcttca tggtcaagtc cgtcatcttt ctttccttct ggcaaggcat 300 

gctcctggcc atcctggaga agtgtggggc catccccaaa atccactcgg cccgcgtgtc 360 

ggtgggcgag ggcaccgtgg ctgccggcta ccatgacttc atcatctgtg tggagatgtt 420 

ctttgcagcc ctggccctgc ggcacccctt cacctacaac gtctatgctg acaagaggct 480 

ggacgcacaa ggccgctgtg cccccatgaa gagcatctcc agcagcctca aggagaccat 540 

gaacccgcac gacatcgtgc aggacgccat ccacaacttc tcacctgcct accagcagta 600 

cacgcagcag tccaccctgg agcctgggcc cacctggcgt ggtggcgccc acggcctctc 660 

ccgctcccac agcctcagtg gcgcccgcga caacgagaag actctcctgc tcagctctga 720 

tgatgaattc taggtgcggg ctgcagtggc ggaagtgctg gcgccatagc cacggtcagg 780 

ctgtgcccca cctccagcct caccaccagg ccaggaggca gctggcacag tgctcacgcc 840 

gcctttattt attggaccag aaacactcac atgtcgcttc cagaggaacg ggggacagcc 900 

aggctcgccc atgggccttc aggaatattt atacatggcc cagcctgcac tgcccgggcg 960 

agggcagagg acactgggag caaggcttat gcccctgctg cccgtcctgt gctgggggca 1020 

tgctgggacc agccgcaccc aggccccaat gcttgtgtgt ggaccagcgg ctgcagcctt 1080 

ctagcccctc ctccccgcga gactctcagg ctgaggtcgg caagccgtgg ctcccccaca 1140 

caccgtgcaa taccctgtct gacctgggct cttcccgcct gcatcccttc cctgtccacc 1200 

tttgtccagt gctagattca cctcaccccg ggcaggagtg gggatgtggg cgctctgtgg 1260 

tcctcccctc ctgacccagg cctctgtggc atgctgcaag gatcagagcc agacaccagg 1320 

agtcacaggc cccacccagg aagggcattc agggcccctg ggcaccgctt ctgttgaagc 1380 

aggggcttct gggcccctgg gtatccccac ctgtcgtggc cacacctctg cctgcctcat 1440 

gccccttccc ctggcctacc aaggacagcc cacagcccgc actgccggct cacttgggtc 1500 

cttcctcgat agctttgggc agagcccttg cttcctggct gcttcagggc tcaggggctc 1560 

ccagccctcc ttcccaggct gatgctgggt cctctctctc tttggggctt ctccctcccg 1620 

tttcagggga aaggtctgag tctccacgtt tcagaccagc ttctggggga aggcagtccg 1680 

gcagggagac cgggaggggt ggccacacag tggggagctg ggaggtgggg ggaatggtcc 1740 

cagactcctc tcggggcccc tatccacaca gggcctggtg ttctacccca tctggcccct 1800 

ggcccatctc ttctgtgcct tagtcacata tgaaagcgcc cctccctggc tccccatc tg 1860 

tcccacacgc tccctggggc tcttagttca gctgctggca ctcgcaggat cctgcagtgc 1920 

tgggcccaga gcccttggac aggcctcagg agtggtcagg accaccaagc ccctcctctc 1980 

cccctccaca cctctagacc tggggcctcc ggaaccccca gcaggctggg cttatactag 2040 

ctcctgactt aggaagagcc tcgtgtcaca acacgtgtcc ctacaggcaa agtgtcctgg 2100 

catttaaaac ccagattatc cctgggtttg ggctgcagtc acctggagaa gctggtaggg 2160 

taagggagag ggaccctgcc ggtgttcatg gggattcttt cttttggtcc ttcctggaat 2220 

gaacaggttc cctccctgcc acctgtgagg agagttgggg cccagccgtc ttcctggcct 2280 

ccttcctttc ctcgtggcag aggcctgcat gtgggtgcca gaggccagct ctccccctcc 2340 

atcttggggg ggcggagcag ttgggcccaa gctgcccggg agggtgggtg cagacacagg 2400 
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ctgaggacca gccctggccc tgccccgcca tctgctttca ccaagctgtc tctccaccgt 2460 

ggcttccctt ctccctccag gccaaagtgc tgctgattcc cactcccttg gttttcgcct 2520 

gcccagcgtt gctgtttgcg tggagggtgg ggggagctca gtggcaggga atcagcggtc 2580 

cgtggggtcg tggggacggg aacatgtgcc cgaccgctcc atcccctcct cctccttagg 2640 

atgcataacc taccttgtct tttttttttt aattttcttt ccaggtagag tagctctttg 2700 

tacataaaga atacttgaaa aattaattgt atgatgtatg agaagacaga gtctcctagt 2760 

tttgtatctt gttgtatgac tgccatgagt tccaccagaa agccactcta ttttggtctc 2820 

tgtgacattt taaatgcgtg acagaagtga gcaaataaag tgaggaagaa atctaaaaaa 2880 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaag g 2921 

<210> 206 

<211> 1259 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (4).. (4) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 

<222> (18).. (18) 

<223> n equals dL,t,g f or c 

<400> 206 

gggntacaaa agctgganct ccaccgcggt ggcggccgct ctagaactag tggatccccc 60 

gggctgcagg aattcggcac gagtccacac agggcctggt gttctacccc atctggcccc 120 

tggcccatct cttctgtgcc ttagtcacat atgaaagcgc ccctccctgg ctccccatct 180 

gtcccacacg ctccctgggg ctcttagttc agctgctggc actcgcagga tcctgcagtg 240 

ctgggcccag agcccttgga caggcctcag gagtggtcag gaccaccaag cccctcctct 300 

ccccctccac acctctagac ctggggcctc cggaaccccc agcaggctgg gcttatacta 360 

gctcctgact taggaagagc ctcgtgtcac aacacgtgtc cctacaggca aagtgtcctg 420 

gcatttaaaa cccagattat ccctgggttt gggctgcagt cacctggaga agctggtagg 480 

gtaagggaga gggaccctgc cggtgttcac tggggattct ttcttttggt ccttcctgga 540 

atgaacaggt tccctccctg ccacctgtga ggagagttgg ggcccagccg tcttcctggc 600 

ctccttcctt tcctcgtggc agaggcctgc atgtgggtgc cagaggccag ctctccccct 660 

ccatcttggg ggggcggagc agttgggccc aagctgcccg ggagggtggg tgcagacaca 720 

ggctgaggac cagccctggc cctgccccgc catctgcttt caccaagctg tctctccacc 780 

gtggcttccc ttctccctcc aggccaaagt gctgctgatt cccactccct tggttttcgc 840 

ctgcccagcg ttgctgtttg cgtggagggt ggggggagct cagtggcagg gaatcagcgg 900 

tccgtggggt cgtggggacg ggaacatgtg cccgaccgct ccatcccctc ctcctcctta 960 

ggatgcataa cctaccttgt cttttttttt ttaaattttc tttccaggta gagtagctct 1020 

ttgtacataa agaatacttg aaaaattaat tgtatgatgt atgagaagac agagtctcct 1080 

agttttgtat cttgttgtat gactgccatg agttccacca gaaagccact ctattttggt 1140 

ctctgtgaca ttttaaatgc gtgacagaag tgagcaaata aagtgaggaa gaaatctata 1200 

tatgagataa tatagattgt attgaaaaaa aaaaaaaaaa aaaaaaaaaa aaactccga 1259 

<210> 207 
<211> 1314 
<212> DNA 

<213> Homo sapiens 
<400> 207 

atgcggcagc tcttctatga ccctgacgag tgcgggctga tgaagaaggg gggcttgtac 60 

ttcagtgact tctggaataa gctggacgtc ggcgcaatct tgctcttcgt ggcagggctg 120 

acctgcaggc tcatcccggc gacgctgtac cccgggcgcg tcatcctctc tctggacttc 180 

atcctgttct gcctccggct catgcacatt tttaccatca gtaagacgct ggggcccaag 240 

atcatcattg tgaagcggat gatgaaggac gtcttcttct tcctcttcct gctggctgtg 300 

tgggtggtgt ccttcggggt ggccaagcag gccatcctca tccacaacga gcgccgggtg 360 

gactggctgt tccgagggcc gtctaccact cctacctcac catcttcggg cagatcccgg 420 
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gctacatcga 
acaagcctaa 
gacggtcctc 
catcgccatg 
gttccagcgc 
gatcctcttc 
gaggcacaag 
ggagatctac 
cgagcagaag 
ggacccactg 
ggcccagaca 
ctcggaggcg 
gccgggaggc 
cctcctctac 
ggagacggag 



cggtgtgaac 
gtgccccgag 
ctactctgcc 
ttcaactaca 
catgacctga 
agccacctgc 
cagctcaaga 
ctgaaggaga 
atcgaggaca 
aagaggtcgg 
gcccgagccc 
gacgtcccca 
aggaagaaga 
cccaactgcc 
ttcctgatct 



ttcaacccgg 
agcgacgcga 
tctacctgct 
ccttccagca 
tcgaggagta 
agctcttcat 
acaagctgga 
actacctcca 
tcagcaataa 
gctccatgga 
tgcactggat 
ctctggcctc 
cggaggagcc 
ctgtcacgcg 
atgacccacc 



agcactgcag 
cgcagcagag 
cttcaccaac 
ggtgcaggag 
ccacggccgc 
caagagggtg 
gaagaacgag 
gaaccgacag 
ggttgacgcc 
gcagaggttg 
cgtgaggacg 
ccagaaggcc 
gggcgacagc 
cttccccgtg 
cttttacacg 



ccccaatggc 
ccggccttcc 
atcctgctgc 
cacacggacc 
cccgccgtgc 
gtcctgaaga 
gaggcggccc 
ttccagcaaa 
atggtggacc 
gcctccctgg 
ctgcgggcca 
gcggaggagc 
taccacgtga 
cccaacgaga 
gcagagagga 



accgacccct 480 

ctgagtggct 540 

tcaacctcct 600 

agatttggaa 660 

cgcccccgtt 720 

ctccggccaa 780 

tgctatcctg 840 

agcagcggcc 900 

tgctggacct 960 

aggagcaggt 1020 

gcggcttcag 1080 

cggatgctga 1140 

atgcccggca 1200 

aggtgccctg 1260 

agga 1314 



<210> 208 
<211> 468 
<212> DNA 
<213> Homo sapiens 



<400> 208 

gtgagaagat 

ttatttttgc 

ggacaagcat 

gcttgggaat 

tcagaacttt 

cattctggtc 

tggaagtaat 

ctcgaaaagc 



aatcctgaga 
aggtcctgac 
tgatagattg 
tattcttgca 
cctcttttgg 
atatattatt 
tcgtctagga 
aatacctgca 



ggctgcatcc 
actaaactaa 
atgaatactc 
ataggaaatt 
aatgaaggag 
attctcaata 
cacagttatt 
gtggctcgaa 



tgagaaatac 
tgcagaatag 
tagtactatg 
caatctggga 
agaagagctc 
cagttgtacc 
ttataaactg 
cgaccacgct 



cagctggtgt 
tggtaagaca 
gatttttggg 
gagtcaaact 
tgtgttctcc 
catttcctta 
ggaccggaag 
caatgagg 



tttggaatgg 60 

aagtttaaaa 120 

tttctgatat 180 

ggggaccaat 240 

ggattcttaa 300 

tatgtgagtg 360 

atgtattaty 420 

468 



<210> 209 
<211> 181 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> (178) . . (178) 

<223> n equals a,t,g, or c 

<400> 209 

ggtcagtgtg cagatagcct tggataccag ktactggact ttcattaatc acgtcttcat 
ctgggggagc attgccattt atttctccat tttatttaca atgcacagta atggcatctt 
tggcatcttc ccaaaccagt ttccatttgt tggtaatgca cgacattccc tgacccanaa 

g 



60 
120 
180 
181 



<210> 210 
<211> 612 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (47).. (47) 

<223> n equals a,t,g, or c 



<220> 

<221> mi sc_f eature 
<222> (534) . . (534) 
<223> n equals a,t,g, or c 
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<220> 

<221> mi sc_f eature 

<222> (537) . . (537) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (563) . . (563) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (565) . . (565) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 

<222> (591) . . (591) 

<223> n equals a,t,g, or c 

<400> 210 

cagtctgggc ttaagaaacc accagaagaa cccaaaccag aaatgcncaa gtgtaaatgc 60 

aaaaattctt atagaagaaa tagcataaga atttgcacat tcggaaataa gaccaccttc 120 

catgaacaag gagaagcctt tggagatatc taaactgtgc aaatgaatag tcgctggcta 180 

agactgcttg caatccttcc tggccgctga tgccaacacc aatgtgagca cttttaatca 240 

tgctgacatc attggctcca tcwccaatgg ccaaagtaac agcatttctg tacttcttca 300 

ccagctctac cacttgggct ttctggagtg gagtgaccct gcagcaaatt acagtcttac 360 

acatgcaagc aagttctagg agatcattct tgacatcact ttctagggca tgagccaaac 420 

tgtggccatt tatgattaag gcataatctc ctgttatggt ttcttctaca atagaatcca 480 

actccagctg ctgctttttt tcacaaacta catggccatt ggaaaaattt ctgnttngtc 540 

caaacaaatt ttgttttgaa atnangagtt cttctctcac ttccacagca nttattccct 600 

gctataggga gg 612 

<210> 211 

<211> 1024 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<222> (29) . . (29) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (986) . . (986) 
<223> n equals a,t,g, or c 

<400> 211 

tgctttcctg agttcttctc tcacttccnc agcattattc cctgctatcc caaacmcatc 60 

attcatgtcg tcagtcagca tgttgcaggc ataaccgatg ttgatgscag tttcttgttt 120 

gtctcctgtt aggacccaga tcttaatatt ggctagtgat aaacttgtaa ctgtttcaat 180 

aacaccctcc tgtaacttat cttctacagc agtggcacct agtagcatca aatctctttc 240 

aatttcttca tatagcccag ctattcgttc atccctctct tctgtggcaa cattcgcatc 300 

ttcaagcatc ttatgccact ctttaaagta cttgtcatcc aggtctctgt atgcgatggc 360 

caaggtccga aggccttccc ctgcaaattc actgaggtgg tctgacgtca aagacaaaag 420 

gacttcattg gaaggatsaa gtttctcaaa cagaatagta tctgctcctt tggaataaag 480 

ctttatctgt ccttctgggt ttcgarctat gacagacatc ctttttctgg tgttgttgaa 540 

atccaaaaag gcaagtaatt gataagtaac tagtgttccc aattcttcta ttgttatggt 600 

ctctggggtc cgggatttaa aratgaaccc aaaatttcta gcggcagtca ctagagcccc 660 
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ttcatcaggt gactgaactt ggtaaatcag ctctcctgcg ctattctctt ctgacattac 720 

agtgtggcag agagcaagta acctaaggaa ttcatgaact ttgggatcac ccattttaat 780 

ggattccatc agattgtggt caaagaactg aaattctcta tccgcttgag atttgactga 840 

gaaatccaca ggctcttttt cctgagttat ttctgtcttc tgatccaggt catcatgtac 900 

ttcaccatag attctcccat taatggaaca tcttttaaag gtcatgatgt tttgagtgag 960 

ggtacccgtt ttgtcggaga aaatgnactc aatctgcccc agttcttcat tgagcgtggt 1020 

cgtt 1024 

<210> 212 
<211> 366 
<212> DNA 

<213> Homo sapiens 
<400> 212 

gacgcgtggg agctcattat ccatcaaact cactcargtg wcacytgagt gagtttgatg 60 
gataatgagc taatgtgata tctataggtc acaatttttt aaaaccaaaa ttttcaagtc 120 
tgggataatc tttcctaaat gggatcaaat gaaataatat gtgtaaaaga gtcaaatgca 180 
gtcctttacc atagtaactg cctatggacg ttgtctttcc cttacatgcc tgcctacact 240 
taaccagatg ttggttttca agtctaatkt gtcattagtt tcaccacatt kgctcacttt 300 
tkgtaacatt tttgcaagat ttgaaaactt tcagtaaatg ttttggcact attggtaaaa 360 
aaaaaa 366 

<210> 213 
<211> 519 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__f eature 
<222> (371) . . (371) 
<223> n equals a,t,g, or c 

<400> 213 

cctggttagg gtcctacagg gaaataaaat tataaccgtg gaggtacatt tctctaccag 60 

aaagcaaaaa taaagcatca tgtcttaatg gttttctaca aatcaacttc taattctaca 120 

gagtccttaa tctggtccct attaaattct tggtcagaca aagbtacatt tcccaagaga 180 

gtcaggtgac acttgagtga gtttgatgga taatgagcta atgtgatatc tataggtcac 240 

aattttttaa aaccaaaatt ttcaagtctg ggataatctt tcctaaatgg gatcaaatga 300 

aataatatgt gtaaaagagt caaatgcagt cctttaccat agtaactgcc tatggacgtt 360 

gtctttccct nacatgcctg cctacactta accagatgtt ggttttcaat gtctaatttg 420 

tcattagttt caccacattt gctcactttt tgtaacattt ttgcaagatt tgaaaacttt 480 

cagtaaatgt tttggcacta ttggtaaaaa aaaaaaaaa 519 

<210> 214 
<211> 2042 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__feature 
<222> (2001) . . (2001) 
<223> n equals a,t,g, or c 

<400> 214 

ggtcagcttt catctcgtcc tatctttgtt caggcaaact tctctagttc tgttttaata 60 

ggcatatttg ttaggtctgt tttttgaaat cctctttttt acattgttta aagataatgc 120 

cttggctaaa aagcctgctt cacttttccc tgtttttagt tgttttctcc acattggcag 180 

taaagagcct tggcgtccca gtagcagcag gttctccttt ttgtattgtg gatgttttgc 240 

atttcatact gttgtgaaga gtggctttga tcatacatgt tgttggtata tttgccyttt 300 

tgctgggggt gtgagaagaa ccagagatga gcagaggtac acccagtaga cttcccagcc 360 

tgcagagcct cccgggaaga gcttccgtgt tcaggtgctt ggggccccwc cctaggagcc 420 
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tgwctcwcag tcagagcwgg gtcccggctt gygttcagga ttttgaaaca tttgtawggt 480 

gattttgttg tttctacacc tttctcctca tctttttttt tttgtagtta atcgttacta 540 

ataacagaaa agacattttt ggcatggtaa ttggcacaaa gtgaataatt gttgaataga 600 

tgacttttga ggctttcaaa attcgagtgt ccataaaatc catccagagc cacctggttc 660 

ctttttttga accacttaac gtaattctgg aaaaccttga ctgtgggtct taagtttggt 720 

ggattgctgc ttctcactgg ctgacctttg gaggtcgcat atttcaggat gtgattccac 780 

ttaggctcca tttcacctga cactgcaatt ctgtgccttc agagggattt gttattgcga 840 

atgatgtgga caacaagcgc tgctacctgc tcgtccatca agccaagagg ctgagcagcc 900 

cctgcatcat ggtggtcaac catgatgcct ccagcatacc caggctccag atagatgtgg 960 

acggcaggaa agagatcctc ttctatgatc gaattttatg tgatgtccct tgcagtggag 1020 

acggcactat gagaaaaaac attgatgttt ggaaaaagtg gaccacctta aatagcttgc 1080 

agctacatgg cttacagctg cggattgcaa cacgcggggc tgaacagctg gctgaaggtg 1140 

gaaggatggt gtattccacg tgttcactaa accctattga ggatgaagca gtcatagcat 1200 

ctttacfcgga aaaaagtgaa ggtgctttgg agcttgctga tgtgtctaat gaactgccag 1260 

ggctgaagtg gatgcctgga atcacacagt ggaaggtaat gacgaaagat gggcagtggt 1320 

ttacagactg ggacgctgtt cctcacagca gacacaccca gatccgacct accatgttcc 1380 

ctccgaagga cccagaaaag ctgcaggcca tgcacctgga gcgatgcctt aggatattac 1440 

cccatcatca gaatactgga gggttttttg tggcagtatt ggtgaaaaaa tcttcaatgc 1500 

cgtggaataa acgtcagcca aagcttcagg gtaaatctgc agagaccaga gaaagcacac 1560 

agctgagccc tgcagatctc acagaaggga aacccacaga tccctctaag ctggaaagtc 1620 

cgtcattcac aggaactggt gacacagaaa tagctcatgc aactgaggat ttagagaata 1680 

atggcagtaa gaaagatggc gtgtgtggtc ctcctccatc aaagaaaatg aagttatttg 1740 

gatttaaaga agatccattt gtatttattc ctgaagatga cccattattt ccacctattg 1800 

agtaaggatt cagccttttt aattattcat ttaaagaaat ttactataga gtatcaaatg 1860 

tacaactgat cacatgtaac cattgttttg tatgtagttc tgtctagctt tttttttttt 1920 

ttaacctttt taactgcata ttagagcagg atgaaacttt agaggttact caatctttta 1980 

atttaaggag aaagtaaaca nttactttgt gaacatgata gataaaaaaa aactggaccg 2040 

gg 2042 

<210> 215 
<211> 308 
<212> DNA 
<213> Homo sapiens 

<400> 215 

ggaggcagga ccttgtccta ttcattaatc ttgcccctca acagttattt tcagaggggc 60 

aagaagtgtt tcagggttct tggcccttgt ttgaccagtc gtcctaaccc tcrtgtcttg 120 

ggtcattgtt gttrtaatct ggggttacct tttggaaggt catggggtac ccttttgcaa 180 

aagttatggg ccctmtcctt ggaaactgca cacacaccat gcagcttaca attcagggag 240 

ttcacaggtc tacagaatcc tgggaaactc tccatgtccg gttctaatcc attgtagctt 300 

cagtggga • 308 

<210> 216 
<211> 1568 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1550) . . (1550) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_£eature 
<222> (1564) . . (1564) 
<223> n equals a,t,g, or c 

<400> 216 

ctcatcagcc ctatgaggta gggaggtagg tattattatc acccttgtta gtttttttgt 60 

tttgttttgt tttccagatg agagaatcat tgctcacaga agtgaagtaa cttttccaag 120 

gtcatacaat cagtaagtgg caggcaagga ctgaaatcca agttgttacc ctccaaagtc 180 
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cctgctctga gaactggagg aattctttat caaatctaaa atcctctttt agscctgtct 240 

gctttaatat tccggtcttt attgatcctt ctcttctcta aamcttcagc tgtcagtata 300 

aaaatcaagg aatttagcmc ttgttattgt gtgamcagct tcttgtctct cctgtactgt 360 

aagtgggtct agggattttt attctttaaa tatccccctg tactcagtag atctttggga 420 

gamcaagctc ataggcttct aataattctt tctttgactg ccagctgaat tagacagaag 480 

gtaagtcctg ctgccgtgtc gtgcctaacc ccatctttat ttcctgtgct gttagagaac 540 

agtcttttct tggctggaac aaatactaca gcctgctcaa ctagctaata tgtattgagt 600 

tcttartatg ttccaaggac tgctctaagt attttatata tattaactca ctgaatctta 660 

aataccctat gagctaagtc ctatttttat ccccatttta caaaagagga aactgaatgt 720 

accagtgcat cagtatttga ctgagtaaat gaatgactgc tttgctgatg gatagtatta 780 

ttagcaacaa ccctacaaat atgatgttat gtttgcatca tgcagtacag ctttatgtac 840 

cttatgtcat tgtcactcat gattagcaaa taggcacgaa catccctatt ttatagaaga 900 

ggaamccatg gctctaagag ggtgagtgat tctcaacagt cacatgccat ctgtatcctt 960 

cagtaaacaa ggtatttggt ccattccagg atcgggggca agagagatgg gagggcctcg 1020 

gtgagaaaca ctcatattca caaaaggtac tagatagata gacagataaa taaataaata 1080 

gagataaaag ctagtaatag cagagatttg atgggaattq agatctttga ttcctagtcc 1140 

agtgctcttt cttttatgta aggtgatggg aagcaagtct taggtccaga tctggcagct 1200 

gctttgattt aggatcttaa gccagaagca gcagcgccct aaacaaaaag catcatttta 1260 

acttctctgc atttctcaca ttctcaacaa tgaccatgcc ctactttcat aattaaaaac 1320 

aaacaaataa acagaggggg caaaatgcat ccttctgagt gggctggggc ttgtagaggg 1380 

attcttgggt ttgctctggg atgtctttgg gcccctgcac ttgtgggcac tctgatttat 1440 

ccccacaggc cacctggccc accttatggg ctgaggaggg cttgatgggc ggaggsaagc 1500 

agagctgagg agctggggag gagctggaaa ggactgctag aaggttccan tgaagctaca 1560 

atgngtag 1568 

<210> 217 
<211> 865 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_£ eature 

<222> (13) . . (13) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (20) . . (20) 

<223> n equals a,t,g, or c 

<400> 217 
accgcagcct 
cagytccact 
tggtctcaaa 
tacaggcgtg 
tttgttaaat 
ctgggtgggc 
ccctggctgg 
ccagactcca 
cattcccaaa 
kgctcacgcc 
aaaacagaga 
gcctgtaatc 
gaggttgcag 
tctatctcaa 
aggggcccaa 

<210> 218 
<211> 1687 
<212> DNA 

<213> Homo sapiens 



gcngggctcn agcattccty tsgcytcagc 
aattttgaag ttttttkggt agacatgaag 
ctcctgacct caagcagtcc tcctgtcttg 
agccactgtg ctggcctctt ttttcttttt 
gggaagtctg tgccatcaac tgagcattgt 
cactgggaga gaactataca ttaaatgtaa 
tttcctttcc tttctctcct tttctctact 
gtttcttacc accctcacgg attttgctat 
attgacttta tggagactca ttaaaagaaa 
acgaaggcgg gcgaatcacc tgaggtgcgg 
aaccccatct ctactaaaca atacaaaatt 
ccagctactg gggaggctgg gacgggagaa 
tgaccaaaga tcgcactatt gcactccagc 
aaaaaaaaaa aaaaaaaaaa aaaagggcgg 
gcttaggcgt gcatg 



ctcctgakta gctgggaccm 60 

tctccctgtg ttgcccgggc 120 

gcctctggaa atgctgggat 180 

cttttttttt aaggttttta 240 

attttctcct tagtaagagc 300 

gtagcctctg ggtagagagc 360 

ttggtgtctg gaggcatttc 420 

tgtattatca cctcctttat 480 

gaatcatcgg ccgggagcgt 540 

agttcgtgac cagcctgacc 600 

agctgggcgt ggtggtgcac 660 

tcacttgaac ccgggaggca 720 

ctgggcaaca agagcaaaac 780 

ccgctctaga ggatccctcg 840 

865 
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<220> 

<221> misc__f eature 
<222> (1568) (1568) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (1652) (1652) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (1654) . . (1654) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (1660) . . (1661) 
<223> n equals a,t,g, or c 

<400> 218 

ccacgcgtcc gggcagtggg gtgagggcac acaagcagtt cagggtccca gcaggaagtg 60 

gggctgcagg gccggggtgg gtcctgggcc tggccatcag gcagcctagc aggttgttct 120 

gggcatggag ggggcctggt gtggctgagg gcatgcccag ggctccctgg aggatcccgc 180 

tctgtgccct gcccaccctg tgcctgggga gccctctgcc ctcacagccc acccacccca 240 

ttttctatga ccacagagct ccgacctgga agatggctca cccaggaggt cccaggagct 300 

ctcactcccc caggggacct ggaggacacc cagctctcag acaaaggctej ccttgccggc 360 

ggggggagcc cgaaacagcc ctttgcagct ctgcaccagg agcaggtttt gcggaacccc 420 

catgtaaggc ttccccgggg tggggtcctc ccagccgtgg gcctcagggt gaccgatcac 480 

agggagagtg gctccctgcc ctgggcaccc cctgcggtgg ccccgacgac agctgaggag 540 

tgaccacaag gtctctgccc acagtgctcg gggtgcggtg tctgggctgc gaagtggatc 600 

cccctccttt cttgggcact gcagcagctt ggggggcttt ttggacgtgg atgtgcctgg 660 

tcctggtttc ccgagggcct ttacagtgga tgaggaggtg aacacaggga gtcctgagag 720 

caagcaccac ctcgggcttt ggtgtagaaa caatggcccg gaccccaggc cggagccgtg 780 

gcttggcctc ctgggtgtgt cttggcatct gaaatgcagg ctacccacac cggctcacct 840 

ccaggggtac aggcaggtcc cacagggaga gcttggcgct gagctgaggc tgtctgggct 900 

cctcgcctcc caaccagtct gcagttacag gggccagtgg atgggcgggt gagaaggacg 960 

ggttccctca ggggagccgg ccggagcccg agccttcccc cttctccagg acgcaggcct 1020 

gagcagcggg gagccgcccg agaaggagcg gcggcgcctc aaagagagtt ttgagaacta 1080 

ccgcaggaag cgcgccctca ggaagatgca gaaaggatgg cggcagggtg gaggaggacc 1140 

gggagaacac cacgggcagc gacaacaccg acactgaggg ctcctagccg cagcagccgc 1200 

aggccccgac cagggcacac ccaccggccc ggcctcctgc cacccggggg tgccgacgcc 1260 

ctggggcgca gacttccccg agccgtcgct gacttggcct ggaacgagga atctggtgcc 1320 

ctgaaaggcc cagccggact gccgggcatt ggggccgttt gttaagcggc actcattttg 1380 

cggaggccat gcgggtgctc accaccccca tgcacacgcc atctgtgtaa cttcaggatc 1440 

tgttctgttt caccatgtaa cacacaatac atgcatgcat tgtattagtg ttagaaaaca 1500 

cagctgcgta aataaacagc acgggtgacc cgcaaaaaaa aaaaaaaaaa aaaaaaaaaa 1560 

aaaaaaanaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1620 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa ananaaaaan naaaaaaaaa aaaaaaaaaa 1680 

aaaaaaa 1687 

<210> 219 
<211> 570 
<212> DMA. 

<213> Homo sapiens . 
<220> 

<221> misc_f eature 

<222> (5) . . (5) 

<223> n equals a,t,g, or c 
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<220> 

<221> misc_feature 

<222> (16) . . (16) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (496) . . (496) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (523) . . (523) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (531) . . (531) 
<223> n equals a,t,g, or c 

<400> 219 

ttccngtgcc actatngaat gtacgcctgc aggtaccggt ccggaattcc cgggtcgacc 60 

cacgcgtccg ggcagtgggg tgagggcaca caagcagttc agggtcccag caggaagtgg 120 

ggctgcaggg ccggggtggg tcctgggcct ggccatcagg cagcctagca ggttgttctg 180 

ggcatggagg gggcctggtg tggctgaggg catgcccagg gctccctgga ggatcccgct 240 

ctgtgccctg cccaccctgt gcctggggag ccctctgccc tcacagccca cccaccccat 300 

ttwctatgac cacagagctc cgacctggaa gatggctcac ccaggaggtc ccaggagctc 360 

tcactccccc aggacctgga rgacacccag ctctcagaca aargctgcct tgccggcggg 420 

gggagcccga aacagccctt tgcagctctg caccaggagc aggttttgcg gaacccccat 480 

gtaaggcttc cccggngtgg ggtcctccag ccgtgggcct canggtgacc natcacaggg 540 

agagtggctc cctgtcctgg gcaccccctg 570 

<210> 220 
<211> 1752 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1099) . . (1099) 
<223> n equals a,t,g, or c 

<400> 220 

aaggtacgcc tgcaggtacc ggtccggaat tcccgggtcg acccacgcgt ccgtccagga 60 

cagagagtgc acaaactacc cagcacagcc ccctccgccc cctctggagg ctgaagaggg 12 0 

attccagccc ctgccaccca cagacacggg ctgactgggg tgtctgcccc ccttgggggg 180 

gggcagcaca gggcctcagg cctgggtgcc acctggcacc tagaagatgc ctgtgccctg 240 

gttcttgctg tccttggcac tgggccgaag cccagtggtc ctttctctgg agaggcttgt 300 

ggggcctcag gacgctaccc actgctctcc gggcctctcc tgccgcctct gggacagtga 360 

catactctgc ctgcctgggg acatcgtgcc tgctccgggc cccgtgctgg cgcctacgca 420 

cctgcagaca gagctggtgc tgaggtgcca gaaggagacc gactgtgacc tctgtctgcg 480 

tgtggmtgtc cacttggccg tgcatgggca ctgggaagag cctgaagatg aggaaaagtt 540 

tggaggagca gctgacttag gggtggagga gcctaggaat gcctctctcc aggcccaagt 600 

cgtgctctcc ttccaggcct accctactgc ccgctgcgtc ctgctggagg tgcaagtgcc 660 

tgctgccctt gtgcagtttg gtcagtctgt gggctctgtg gtatatgact gcttcgaggc 720 

tgccctaggg agtgaggtac gaatctggtc ctatactcag cccaggtacg agaaggaayt 780 

caaccacaca cagcagctgc ctgactgcag ggggctcgaa gtctggaaca gcatcccgag 840 

ctgctgggcc ctgccctggc tcaacgtgtc agcagatggt gacaacgtgc atctggttct 900 

gaatgtctct gaggagcagc acttcggcct ctccctgtac tggaatcagg tccagggccc 960 

cccaaaaccc cggtggcaca aaaacctgac tggaccgcag atcattacct tgaaccacac 1020 
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agacctggtt ccctgcctct gtattcaggt gtggcctctg gaacctgact ccgttagacg 1080 

aacatctgcc ccttcaggna ggacccccgc gcacaccaga acctctggca agccgcccga 1140 

ctgcgactgc tgaccctgca gagctggctg ctggacgcac cgtgctcgct gcccgcagaa 1200 

gcggcactgt gctggcgggc tccgggtggg gacccctgcc agccactggt cccaccgctt 1260 

tcctgggaga aygtcactgt ggacaaggtt ctcgagttcc cattgctgaa aggccaccct 1320 

aacctctgtg ttcaggtgaa cagctcggag aagctgcagc tgcaggagtg cttgtgggct 1380 

gactccctgg ggcctctcaa agacgatgtg ctactgttgg agacacgagg cccccaggac 1440 

aacagatccc tctgtgcctt ggaacccagt ggctgtactt cactacccag caaagcctcc 1500 

acgagggcag ctcgccttgg agagtactta ctacaagacc tgcagtcagg ccagtgtctg 1560 

cagctatggg acgatgactt gggagcgcta tgggcctgcc ccatggacaa atacatccac 1620 

aagcgctggg ccctcgtgtg gctggcctgc ctactctttg cctgcgcttt ccctcatcct 1680 

ccttctcaaa aaggatcacg cgaaagggtg gctgaggctc ttgaaacagg acgtccgctc 1740 

gggggcggcc gc 1752 

<210> 221 
<211> 536 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (508) . . (508) 
<223> n equals a,t,g, or c 

<400> 221 

ggtcgaccca cgcgtccgcc cacgcgtccg gcttccttaa tgtaatttaa accctggcaa 60 

acattcttta gaaaccaaga ggaaagaaag aacaaatatc aaaaaagaca tagaatttaa 120 

tattgataca atttcacctc taaaatggat ttgaagaaat gcaactttat atcaaaaaat 180 

gtcatctgat ttcctttgtt tcttttttaa attatgtaat cagatgattt tatgtttttt 240 

tttcagggga gcggaatatt ggtttctttt acttgttgtt ttcagttttc tctgccattc 300 

atgtttcttt tttgtgttca gtgtttcaaa tacaatttgt atttaaggat tttaaaatac 360 

caaactgtaa ctgagtacag tggatcgttt tctgttagga tgttaatatt atacaatgaa 420 

atctataaag tgttgtcaat ttgattattg acacatataa catgtttaca aataaactgt 480 

ggtattgatc aaaaaaaaaa aaaaaaancc cggggggggc cccggaaccc aatccc 536 

<210> 222 

<211> 2409 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> (694) . . (694) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc__f eature 
<222> (716) . . (716) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (755) . . (755) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (761).. (761) 
<223> n equals a,t,g, or c 

<220> 
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<221> misc_feature 

<222> (791) . . (791) . 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (808) . . (808) 
<223> n equals a,t,g, or c 

<220> 

<221> misc__feature 
<222> (880) . . (880) 
<223> n equals a,t,g, or c 

<400> 222 

ccacgcgtcc ggcgagacgc gttcaccacc atatgcactg ttgtcaactc ccctggagat 60 

gcgcccaagc cccacaggaa gccttcctcc tctgcctcct cttcctcatc ctcgtcctcg 120 

ttctccttgg atgcagccgg ggcctccctg gccacactcc ctggaggctc catcctgcag 180 

ccgcggccct ccttgcccct ctcctccacg atgcacttgg ggcctgtggt ttccaaggcc 240 

ctgagtacct cttgccttgt ttgctgcctc tgccaaaacc cggccaactt caaggacctt 300 

ggggacctct gtgggcccta ctaccctgaa cactgcctcc ccaaaaagaa gccaaaactc 360 

aaggagaagg tgcggccaga aggcacctgt gaggaggcct cgctgccgct tgagagaaca 420 

ctcaaaggtc ccgagtgtgc agctgccgcc actgccggga agccccccag gcctgacggc 480 

ccagctgacc cggccaagca gggcccactg cgcaccagtg cccggggcct gtcccggagg 540 

ctgcagagct gctactgctg tgatggccgg gaggatgggg gcgaggaggc agccccagcc 600 

gacaagggtc gcaaacatga gtgcagcaag gaggctccgg cagagcccgg cggggaggcc 660 

caggagcact gggtgcatga ggcctgtgcc gtgnggaccg gcggcgtcta cctggnggcc 720 

gggaagctct ttgggctgca ggaggccatg aaggnggccg nggacatgat gtgttccagc 780 

tgccaagaag ncggggccac catcgggngc tgccacaaag gatgcctcca cacctaccac 840 

tacccgtgtg ccagcgatgc aggttgcata ttcatcgaan agaacttttc tttgaaatgt 900 

cccaaacata agaggctgcc gtagtaatcc accccaacgg ccggaggagc cgccggagcc 960 

cgcctgcccg cccgccgccg aaggagagga gccgcctgcg cagcccccgg gcctttgagc 1020 

tgctcccagc gctggtccag agccgatcct tgatccgggt cccggatcgt ggatccggcc 1080 

gcctagggct cagacttgcg gccccgggtt gggaggaaaa cccgttccgg agccgcctgc 1140 

tcccggaacc ggacggcaca gggcgttctt gcccacccca ggggccaggc ttgcggaggg 1200 

ggagcccgcg gagcggccag actccccggg gcgctcagcc tccggcgagg gtgggagacg 1260 

gctttgtcct ggggacactt tccctctgga atctcaagac gacgtggcac acattccacg 1320 

tgggtgctgc cgccacccca gtcggtcgtg gcgtgcagct gggagccctg ggcttggggg 1380 

tgggggtcga aacagtactg gaagaggcgg agggcggctc ctagctccgt ggactaggcg 1440 

ggggagaaag gaagcctttc tgagagcggg ctaggccggc actggagagg ccggagcctt 1500 

tggaacaaac cgtgcggaac gcgtccaggg gccttcccgc ccagcctttg ccagatctct 1560 

cgtgcggttc gggcaaagcc ggggtagacc tgggctatgc tcagttaggg gttgcgggat 1620 

ccccgagtgt gggcgggact gggacaccct ttggcctctg tttgtcccct ttccagtcct 1680 

ccaccccacc cctggagccc agcctgggag cgcaaaaccc aagaagcggc cagaacgcac 1740 

ctccggctcc ggcggacgcg cgaccgttgt gcaccaccag ggaccgccgc gcctactctg 1800 

cacgggagca gggacagcgc tagatttcgt gtacaaaacc tgtgtacccc tctatatata 1860 

tgttacatag aatgtatata tgttgggaac atgctcgctt ctcccgtgtg tcgccgccgt 1920 

gcgtcgtgcg cccgcaacag agccccaacc gggcctttgc. cgggtaaggg gctaccgcga 1980 

cgccacttgt ccacgcagcc accaccggcc cgggccagtc cctgccagtc cgtccgcctg 2040 

tccgtccgtg tcctcagctc tgtccacgct tcgataggcc tgacgcagcc cccagcccag 2100 

ggccgcccta gcaacttcct gtacatatga ctgtaaaatg gtaaacgtgt gtattatatc 2160 

tggcctcgtt atatagtgta tatatatgta tacatataca tatatataat atatatgaag 2220 

actgtaaatg ttaagacgac tagtgttctt attagtatat tgcttcacac tgaagattgt 2280 

gtgtatcgag ctgtttctaa aagatgttta ttttccttaa gagtaaaaaa cagtcattgc 2340 

attcagaaaa aaaaaaaaaa aaaaagtcaa taaagataca acgattgttt tggaaaaaaa 2400 

aaaaaaaaa 2409 

<210> 223 
<211> 737 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> (1)..(1) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (21).. (21) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (369).. (369) 
<223> n equals a,t,g, or c 

<400> 223 

nagagggaag agagaaaaag ntacaatgct tcgccgtycg sctktmcgkc csygtcctca 60 

gctctgtcca cgcttcgata ggcctgacgc agcccccagc ccagggccgc cctagcaact 120 

tcctgtacat atgactgtaa aatggtaaac gtgtgtatta tatctggcct cgttatatag 180 

tgtatatata tgtatacata tacatatata taatatatat gaagactgta aatgttaaga 240 

cgactagtgt tcttattagt atattgcttc acactgaaga ttgtgtgtat cgagctgttt 300 

ctaaaagatg tttattttcc ttaagagtaa aaaacagtca ttgcattcag aaaaaaaaaa 360 

aaaaaaaang tcaataaaga tacaacgatt gttttggaaa atctgcagcc cgtggattcc 420 

gaccagattc agctgggagc cgggccaggc tttaggttgg ggaatgggaa tgaagggagg 480 

ggctgggggg gggggcatga atggagtcag ggagtcggcc tttcacagaa caggaaacct 540 

cccccgcccc tgtgccccct ctccagtgtg gcggcaggtc gggagggagg aggcttcttt 600 

gctgtgagat gaccaggggc cgggatgggg gaggtgagac gtgccagact tcttgcaggg 660 

agacccaagc tgtagctcct gtcccacaac aggtcctgga agtcagtcca tcctcccgtg 720 

ccacccaggg acctaat 737 

<210> 224 

<211> 1471 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (798) . . (798) 
<223> n equals a,t,g, or c 

<400> 224 

tttctgaatg caatgactgt tttttactct taaggaaaat aaacatcttt tagaaacagc 60 

tcgatacaca caatcttcag tgtgaagcaa tatactaata agaacactag tcgtcttaac 120 

atttacagtc ttcatatata ttatatatat gtatatgtat acatatatat acactatata 180 

acgaggccag atataataca cacgtttacc attttacagt catatgtaca ggaagttgct 240 

agggcggccc tgggctgggg gctgcgtcag gcctatcgaa gcgtggacag agctgaggac 300 

acggacggac aggcggacgg actggcaggg actggcccgg gccggtggtg gctgcgtgga 360 

caagtggcgt cgcggtagcc ccttacccgg caaaggcccg gttggggctc tgttgcgggc 420 

gcacgacgca cggcggcgac acacgggaga agcgagcatg ttcccaacat atatacattc 480 

tatgtaacat atatatagag gggtacacag gttttgtaca cgaaatctag cgctgtccct 540 

gctcccgtgc agagtaggcg cggcggtccc tggtggtgca caacggtcgc gcgtccgccg 600 

gagccggagg tgcgttctgg ccgcttcttg ggttttgcgc tcccaggctg ggctccaggg 660 

gtggggtgga ggactggaaa ggggacaaac agaggccaaa gggtgtccca gtcccgccca 720 

cactcgggga tcccgcaacc cctaactgag catagcccar gtctaccccg gctttgcccg 780 

aaccgcacga gagatctngc aaargctggg cggraaggcc cctgracgcg ttccgcacgg 840 

tttgttccaa aggctccggc ctytccagtg ccggcctagc ccgctctcag aaaggcttcc 900 

tttctccycc gcctagtcca cggagctagg agccgccctc cgcctcttcc agtactgttt 960 

cgacccccac ccccaagccc agggctccca gctgcacgcc acgaccgact ggggtggcgg 1020 

cagcacccac gtggaatgtg tgccacgtcg tcttgagatt ccagagggaa agtgtcccca 1080 

ggacaaagcc gtctcccacc ctcgccggag gctgagcgcc ccggggagtc tggccgctcc 1140 
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gcgggctccc cctccgcaag cctggcccct ggggtgggca agaacgccct gtgccgtccg 1200 

gttccgggag cagkcggctc cggaacgggt tttcctccca acccggggcc gcaagtctga 1260 

gccctaggcg gccggatcca cgatccggga cccggatcaa ggatcggctc tggaccagcg 1320 

ctgggagcag ctcaaaggcc cgggggctgc gcaggcggct cctctccttc ggcggcgsgc 1380 

gggcaggcgg gctccggcgg ctcctmcgrc cgttggggtg gattactacg gcagcctctt 1440 

atgtttggga catttcaaag gggaattctc t 1471 

<210> 225 
<211> 3302 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (3274) . . (3274) 
<223> n equals a,t,g, or c 

<400> 225 

tcgacccacg cgtccgcgac ccacgcgtcc ggggggaggt aactgcagta agtcccgctt 60 

ggccctggag tccacgcgga ttttcgaagc tggggctggc aagaggccgc tggacaccac 120 

gctccagtcg tcagcccact tcctagctga acagcgcgag gcggcggcag cgagccgggt 180 

cccaccatgg ccgcgaatta ttccagtacc agtacccgga gagaacatgt caaagttaaa 240 

accagctccc agccaggctt cctggaacgg ctgagcgaga cctcgggtgg gatgtttgtg 300 

gggctcatgg ccttcctgct ctccttctac ctaattttca ccaatgaggg ccgcgcattg 360 

aagacggcaa cctcattggc tgaggggctc tcgcttgtgg tgtctcccga cagcatccac 420 

agtgtggctc cggagaatga aggaaggctg gtgcacatca ttggcgcctt acggacatcc 480 

aagcttttgt ctgatccaaa ctatggggtc catcttccgg ctgtgaaact gcggaggcac 540 

gtggagatgt accaatgggt agaaactgag gagtccaggg agtacaccga ggatgggcag 600 

gtgaagaagg agacgaggta ttcctacaac actgaatgga ggtcagaaat catcaacagc 660 

aaaaacttcg accgagagat tggccacaaa aaccccagtg ccatggcagt ggagtcattc 720 

ayggcaacag ccccctttgt ccaaattggc aggtttttcc tctcgtcagg cctcatcgac 780 

aaagtcgaca acttcaagtc cctgagccta tccaagctgg aggaccctca tgtggacatc 840 

attcgccgtg gagacttttt ctaccacagc gaaaatccca agtatccaga gktgggagac 900 

ttgcgtgtct ccttttccta tgctggactg agcggcgatg accctgacct gggcccagct 960 

cacgtggtca ctgtgattgc ccggcagcgg ggtgaccagc tagtcccatt ctccaccaag 1020 

tctggggata ccttactgct cctgcaccac ggggacttct cagcagagga ggtgtttcat 1080 

agagaactaa ggagcaactc catgaagacc tggggcctgc gggcagctgg ctggatggcc 1140 

atgttcatgg gcctcaacct tatgacacgg atcctctaca ccttggtgga ctggtttcct 1200 

gttttccgag acctggtcaa cattggcctg aaagcctttg ccttctgtgt ggccacctcg 1260 

ctgaccctgc tgaccgtggc ggctggctgg ctcttctacc gacccctgtg ggccctcctc 1320 

attgccggcc tggcccttgt gcccatcctt gttgctcgga cacgggtgcc agccaaaaag 1380 

ttggagtgaa aagaccctgg cacccgcccg acacctgcgt gagccctagg atccaggtcc 1440 

tctctcacct ctgacccagc tccatgccag agcaggagcc ccggtcaatt ttggactctg 1500 

cacyccctct cctcttcagg ggccagactt ggcagcatgt gcaccaggtt ggtgttcacc 1560 

agctcatgtc ttccccacat ctcttcttgc cagtaagcag ctttggtggg cagcagcagc 1620 

tcatgaatgg caagctgaca gcttctcctg ctgtttcctt cctctcttgg actgagtggg 1680 

tacggccagc cactcagccc attggcagct gacaacgcag acacgctcta cggaggcctg 1740 

ctgataaagg gctcagcctt gccgtgtgct gcttctcatc actgcacaca agtgccatgc 1800 

tttgccacca ccaccaagca catctgtgat cctgaagggc ggccgttagt cattactgct 1860 

gagtcctggg tcaccagcag acacactggg catggacccc tcaaagcagg cacacccaaa 1920 

acacaagtct gtggctagaa cctgatgtgg tgtttaaaag agaagaaaca ctgaagatgt 1980 

cctgaggaga aaagctggac atatactggg cttcacactt atcttatggc ttggcagaat 2040 

ctttgtagtg tgtgggatct ctgaaggccc tatttaagtt tttcttcgtt actttgctgc 2100 

ttcatgtgta ctttcctacc ccaagaggaa gttttctgaa ataagattta aaaacaaaac 2160 

aaaaaaaaca cttaatattt cagactgtta caggaaacac cctttagtct gtcagttgaa 2220 

ttcagagcac tgaaaggtgt taaattgggg tatgtggttt gattgataaa aagttacctc 2280 

tcagtatttt gtgtcactga gaagctttac aatggatgct tttgaaacaa gtatcagcaa 2340 

aaggatttgt tttcactctg ggaggagagg gtggagaaag cacttgcttt catcctctgg 2400 

catcggaaac tcccctatgc acttgaagat ggtttaaaag attaaagaaa cgattaagag 2460 

aaaaggttgg aagctttata ctaaatgggc tccttcatgg tgacgccccg tcaaccacaa 2520 

tcaagaactg aggcctgagg ctggttgtac aatgcccacg cctgcctggc tgctttcacc 2580 
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tgggagtgct 
acgtgttgtg 
aatttctaag 
gatgtcttaa 
gctgggagta 
tgaaaccaga 
tcttcacatg 
attcactaac 
tttcggaaaa 
ttttttctag 
taatgttaag 
tattaaaagt 
aa 



ttcgatgtgg 
tccatagctt 
ttccccaact 
atgattcttt 
attcctttag 
gatctgtaat 
aaaaacacag 
tgcttattac 
cacgctgtat 
taaggatttt 
aaaaatttga 
tttgtaataa 



gcacctgggc 
tagtcttcct 
actctcacac 
gtgtaccttt 
caattaagta 
catctctatt 
ccagcccaag 
atgagcaatt 
acctagatga 
aaataactgc 
aagctttgga 
atttctaaat 



ttcctagggc 
aaataagatc 
ccttttaaag 
tctgtcatat 
tttggtagct 
ggcctggggt 
atgacttatc 
tcatcaaatc 
tgactaaatg 
cgacttcaaa 
aaaccaaatt 
tatnaaaaaa 



tgcttctgag 
cacccacacc 
ataaagtatg 
tcagaaaccg 
gaataagggg 
gcctgtgcta 
tgggtttagg 
tccaaactct 
caaaatcctt 
agtgttctta 
tgtaatatca 
aaaaaaaaaa 



tggttctttc 2640 

taagtcacag 2700 

ttgtaaccag 2760 

ttttgtgcct 2820 

tcagaacttc 2880 

taaatgagtt 2940 

attcaatagt 3000 

taaaggatgc 3060 

gggctttggt 3120 

aaacgaaaga 3180 

ttgtattttt 3240 

aaaaaaaaaa 3300 

3302 



<210> 226 

<211> 2227 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (289) . . (289) 
<223> n equals a,t,g, or c 



<400> 226 

cggacgcgtg 

gttctgtgtc 

cgtaggctca 

ctttcggtag 

cacatctgca 

acgtggggar 

aacgctagac 

cctcaaaaac 

ttttctaaca 

tagaaagaaa 

gttgggtgag 

gcagggttgt 

ggtggtgtaa 

gcctcctgag 

gtcttgatat 

agacccttgc 

tcgaccctca 

cgagagtcac 

tarccatscc 

gaccctcggc 

gtcacaagcc 

ggcccccaca 

ggragaagaa 

tyagcagggg 

gttcttcgca 

tgggctccat 

tttgtcccag 

cccagaggtt 

tgtcgtacga 

agagcagcac 

cacaggtgca 

catatgtgga 

gcagacattg 

attctgcttg 

aatgctttta 

catgtgcctt 



ggtcgaccca 

tgaattgccg 

gatcttcatt 

ctggagtaac 

gccagcgtaa 

ggcctgacar 

agatggggaa 

gtttttattt 

aataacatga 

aattacccat 

gccgtagcct 

ttttttttct 

acagctcact 

tagctgggat 

gcgaggacgc 

tccatcaccc 

ggctcaagcg 

catgctggcc 

tgtttcaaga 

tgtgagaggg 

catctggctg 

gcagaggcga 

gggcccaggc 

ctyarccagg 

ctcccaccgt 

cagggagccc 

tgcaccttgc 

gacggatata 

acagtacctg 

aacatggaaa 

gttagatgct 

taagctgatt 

gatttggtga 

caaaaaaatc 

ttagccaata 

acttttttaa 



cgcgtccggg 
taggctcaga 
tgaggttatg 
acaggcggcc 
tcctcctggg 
ccaattccca 
tgggaaagaa 
atctgctttt 
acacttctag 
tactctttaa 
ctgccaagtc 
ttgagatgga 
gcagccttga 
cccaggcgag 
tgtgtcttcc 
aggccagagt 
atcctcacgc 
tgaatcttca 
gtgtaggtgg 
aggggtgggc 
ggcatcccct 
gccactgrag 
ccccaggaga 
gctggatgtc 
ccgtgtgaac 
agaagacgtg 
tccttccatt 
ttcagacgtt 
tccatggtct 
gagccaaaat 
gttcttcctt 
aatggttttg 
ctgtctcatt 
tatcatgtgc 
ggaattttaa 
aaaggagttt 



aaaaarggaa 

tcttcatttg 

ttctataagt 

tcacagcgac 

agatgcctcc 

ggctgtcccc 

aagaaagctg 

tctgaaagga 

aaaccctaga 

gccagcatta 

aaggagcccg 

gtctcactct 

ccctgaggct 

agtcaccaca 

ctgccacatt 

gtggtsgtgc 

ctcggacccc 

gggtatttta 

tcaccctgtc 

tgggctggag 

gctgtgtcct 

ggtgragggc 

ctcaggagac 

cggagccagc 

agctccagcc 

tgtgcttctg 

tactatcgag 

acgacacgga 

tcagtatcgt 

gtcacagttc 

tagattttgt 

caactgtaat 

gtgccatgag 

ttttctagat 

aataacatgg 

attgtattca 



aaratgccgt 

aggttctgtg 

taacgttgat 

ctctccagcg 

tcaaggccnt 

acccttgrag 

cagacctcaa 

taaaggcttt 

aaaacacaaa 

tccattgcgg 

gtggtggctg 

tgtcacccca 

caagcgatcc 

ccctgtccat 

ttcttcttct 

gaacacggct 

caaagtgctg 

cggttgaagt 

tctgycgctg 

gaacctraag 

gagctgcaca 

ttccacggac 

cagagcctgg 

cccgmagccc 

ccacctgcgc 

aaattgggtc 

atttaaatgc 

tcaggacggc 

atgaccctgg 

ctatctgtga 

cacgtgggga 

agtagctgta 

gtaaatgtaa 

gtctctggyt 

aacttacaca 

ttggaatatg 



gtaaaatctc 60 

tctgaattgc 120 

cttgtgtgag 180 

ccttccaagg 240 

gctccagacc 300 

agtgacccta 360 

gttaaaattc 420 

ttgaaaatta 480 

gtattcaaaa 540 

tgcttttgga 600 

tggcattcct 660 

gctggaatgt 720 

ttctgccttg 780 

gttcctgcag 840 

ttcttgagac 900 

cactgcagcc 960 

ggatcacagg 1020 

gtcacttact 1080 

acctggcctg 1140 

ccc tcgtgat 1200 

tgccccaggt 1260 

ggtcttcagg 1320 

gg t caggggc 1380 

tgkgktcttt 1440 

ctccctgtgc 1500 

cctacatgcc 1560 

ctgttttctc 1620 

tggattcagg 1680 

cctctcgtga 1740 

gggaa tggag 1800 

cccagctgta 1860 

tcgttctaat 1920 

tgtttcaggc 1980 

ctatagtgca 2040 

aaaggctttt 2100 

tgacgtaagc 2160 
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aataaaggga atgttagacg tgtaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2220 



<210> 227 
<211> 2214 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (289) . . (289) 
<223> n equals a,t,g, or c 

<400> 227 

cggacgcgtg ggtcgaccca cgcgtccggg aaaaarggaa aaratgccgt gtaaaatctc 60 

gttctgtgtc tgaattgccg taggctcaga tcttcatttg aggttctgtg tctgaattgc 120 

cgtaggctca gatcttcatt tgaggttatg ttctataagt taacgttgat cttgtgtgag 180 

ctttcggtag ctggagtaac acaggcggcc tcacagcgac ctctccagcg ccttccaagg 240 

cacatctgca gccagcgtam tcctcctggg agatgcctcc tcaaggccnt gctccagacc 300 

acgtgggrar ggcctgacaa gccaattccc aggctgtccc cacccttgra gagtgaccct 360 

aaacgctaga cagatgggga atgggaaaga aaagaaagct gcagacctca agttaaaatt 420 

ccctcaaaaa cgtttttatt tatctgcttt ttctgaaagg ataaaggctt tttgaaaatt 480 

attttctaac aaataacatg aacacttcta gaaaccctag aaaaacacaa agtattcaaa 540 

atagaaagaa aaattaccca ttactcttta agccagcatt atccattgcg gtgcttttgg 600 

agttgggtga ggccgtagcc tctgccaagt caaggagccc ggtggtggct gtggcattcc 660 

tgcagggttg tttttttttc tttgagatgg agtctcactc ttgtcacccc agctggaatg 720 

tggtggtgta aacagctcac tgcagccttg accctgaggc tcaagcgatc cttctgcctt 780 

ggcctcctga gtagctggga tcccaggcga gagtcaccac accctgtcca tgttcctgca 840 

ggtcttgata tgcgaggacg ctgtgtcttc cctgccacat tttcttcttc tttcttgaga 900 

cagacccttg ctccatcacc caggccagag tgtggtsgtg cgaacacggc tcactgcagc 960 

ctcgaccctc aggctcaagc gatcctcacg cctcggaccc ccaaagtgct gggatcacag 1020 

gcgagagtca ccatgctggc ctgaatcttc agggtatttr cggttgargt gycacttact 1080 

tarccatscc tgtttcaaga gtgtaggtgg tcaccctgtc tctgccgctg acctggcctg 1140 

gaccctcggc tgtgagaggg aggggtgggc tgggctggag gaacctraag ccctcgtgat 1200 

gtcacaagcc catctggctg ggcatcccct gctgtgtcct gagctgcaca tgccccaggt 1260 

ggcccccaca gcagaggcga gccactgrag ggtgragggc ttccacggac ggtcttcagg 1320 

ggragaagaa gggcccaggc ccccaggaga ctcaggagac cagagcctgg ggtcaggggc 1380 

tmagcagggg ctyarccagg gctggatgtc cggagccagc cccgmagccc tgkgktcttt 1440 

gttcttcgca ctcccaccgt ccgtgtgaac agctccagcc ccacctgcgc ctccctgtgc 1500 

tgggctccat cagggagccc agaagacgtg tgtgcttctg aaattgggtc cctacatgcc 1560 

tttgtcccag tgcaccttgc tccttccatt tactatcgag atttaaatgc ctgttttctc 1620 

cccagaggtt gacggatata ttcagacgtt acgacacgga tcaggacggc tggattcagg 1680 

tgtcgtacga acagtacctg tccatggtct tcagtatcgt atgaccctgg cctctcgtga 1740 

agagcagcac aacatggaaa gagccaaaat gtcacagttc ctatctgtga gggaatggag 1800 

cacaggtgca gttagatgct gttcttcctt tagattttgt cacgtgggga cccagctgta 1860 

catatgtgga taagctgatt aatggttttg caactgtaat agtagctgta tcgttctaat 1920 

gcagacattg -gatttggtga ctgtctcatt gtgccatgag gtaaatgtaa tgtttcaggc 1980 

attctgcttg caaaaaaatc tatcatgtgc ttttctagat gtctctggtt ctatagtgca 2040 

aatgctttta ttagccaata ggaattttaa aataacatgg aacttacaca aaaggctttt 2100 

catgtgcctt acttttttaa aaaggagttt attgtattca ttggaatatg tgacgtaagc 2160 

aataaaggga atgttagacg tgtaaaaaaa aaaaaaaaaa aaaaaaaaaa aaag 2214 

<210> 228 
<211> 1145 
<212> DNA 

<213> Homo sapiens 



aaaaaaa 



2227 



<400> 228 



tttttttttt tttttttttt ttgactgaac taagtggctt ttttattaga gaaagccaga 
attacaaaag acttcccttt tcttggggta tggctgtctc agcacaatac tcaacataac 
tgcagaactg atgtggctca ggcaccctgg ttttaattcc ttgaggatct ggcaattggc 



60 
120 
180 
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ttacgcaaaa ggtcaccatt tgaggtcctg ccttactaat tatgtgctgc ccaacaacta 240 

aatttgtaat ttgtttttct ctagtttgag cagggtctga attttttcat ttatttcctt 300 

ttttgccagc agacagactt gagtctgtaa agacaagcaa atacactgac agaagtttac 360 

catagtttct aaaatgtaaa aaagaaaacc cccaaaagac tcaagaaaat tagaccacaa 420 

attttgcatt gttcattgta gcactattgg taataaaata acaaatgttt gtgcattttt 480 

atgtgaagat ccttctcgta tttcatttgg aaagatgagc aagaggtctg cttccttcat 540 

tttacttccc cttctgtttt tgaaaggcag tttcgccaag cttaatgcaa gaatatctga 600 

ctgtttagaa gaaagatatt gccacaatct ctggatggtt ttccagggtt gtgttattac 660 

tgagcttcat ctttccagaa tgagcaaaac actgtccagt ctttgttacg attttgtaat 720 

aaatgtgtac atttttttta aatttttgga catcacatga ataaaggtat gtatgtacga 780 

atgtgtatat attatatata tgacatctat tttggaaaat gtttgccctg ctgtacctca 840 

tttttaggag gtgtgcatgg atgcaatata tgaaaatggg acattctgga actgctggtc 900 

aggggacttt gtcgccctgt gcactaaaag ggccagattt tcagcagcca aggacatcca 960 

tacccaagtg aatgtgatgg gacttaaaag aagtgaactg agacaattca ctctggctgt 1020 

ttgaacagca gcgtttcata ggaagagaaa aaaagatcaa tcttgtattt tctgaccaca 1080 

taaaggcttc ttctctttgt aataaagtag aaaagctctc ctcaaaaaaa aaaaaaaaaa 1140 

aaaaa 1145 

<210> 229 
<211> 802 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (337) . . (337) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (359) . . (359) 
<223> n equals a,t,g, or c 

<400> 229 

ggtgggtgac cagagagtcc tgtctatcct aggaggagaa cattcagccc aaatcccagc 60 

cccatcatgc acagatcaga gccatttctg aaaatgtcgc tgctgattct gcttttcctg 120 

ggattggcag aagcctgtac tcctcgtgaa gtcaacttgc tgaaagggat cataggtctc 180 

atgagcagac tgtcaccgga tgagatccta ggcttgctga gcctccaagt actgcatgaa 240 

gaaacaagtg gctgcaagga ggaagttaaa cccttctcag gcaccacccc atccaggaaa 300 

ccactcccca agagggaaga acacgtggaa yttcctngaa atgcgsctac atggtgrtng 360 

acctacctct tcgtatccta caacaaaggg gactggttca ctttttcctc ccaagtgtta 420 

ctgccaytac tgtaacttgg aactggacat cagggatgat ccctgctgtt ctttctagtg 480 

agcctgctcc atctcagctt agccttcaca aggcctccat ctcccaggca ttctaacctc 540 

tgaagaaagc tctctgtccc ctggactgcc tgtgtggagg gtaatgaact gggtccttta 600 

aggaatggca cctgggtgcc cagaggcatg gccagaaggt gtctgtgggg gccatgcctt 660 

agggggatgc acccagggcg gctgagagag caactgcagg agtttcccct aaaatctctc 720 

ctccagatcg ttctcgaact ttccccacta cttccataat aaaatgtata cttgttgaaa 780 

aaaaaaaaaa aaaaaactcg ag 802 

<210> 230 
<211> 711 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (345).. (345) 
<223> n equals a,t,g, or c 

<400> 230 

ggcacgagcg aagaccctgt tcggaccctg ccccgattcc agactcaggt agatcgtcgg 60 
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cataccctct accgtggaca ccaggcagcc ctggggctga tggagagaga tcaggtatcc 120 

cccagggagt aggggctacc ttgaggggat gatagacctc ccccactccc agtgkkactc 180 

tggaaatatg aaggaactag ggagtggaag agatttcaga gctggggaga ggagttcctc 240 

ccttcaaagc cagcaactgc ctttggggaa tgtcgggggg tctctccttt ctcctgcttg 300 

tttraggtgg tacacagtcc ccccttcamc tggsgggaag ctgtnccgga car ac teat c 360 

tcagctttcc cttggggcag gateggggge agcagctcca gcagaaacag caggatctgg 420 

agcaggaagg cctcgaggcc acacaggggc tgctggccgg cgagtgggcc ccacccctct 480 

ggragctggg cagcctcttc caggccttcg tgaagaggga gagecagget tatgegtaag 540 

cttcatagct tctgctggcc tggggtggac ccaggacccc tggggcctgg gtgecctgag 600 

tggtggtaaa gtggagcaat cccttcacgc tccttggcca tgttctgagc ggccagcttg 660 

gcctttgcct taataaatgt gctttatttt caaaaaaaaa aaaaaaaaac t 711 

<210> 231 
<211> 1614 
<212> DMA 

<213> Homo sapiens 
<400> 231 

ggtgattggt agttactatg tggggacaca attacttggg ctgaaataat ccacctgttg 60 

tggttggggt cctctggggc attccagggt gagaggttgt cactgccacc tgggccatgt 120 

gggccggcac cagcattttg tggttacgaa ttctacagtc acaaatatct ttgggcaaat 180 

ccccttctat acctcaaggc agcttttggt ttgcaacccc actggecaga gggaagggee 240 

agtcacttgg ctctctcact gccctgcgcc ccagatggtt etagggctge tgttttccct 300 

tggccctgcc aacaccactg tttttacttc tgctcattgg ctgagtgcag tggttcctgg 360 

aagccagtgg cacgtttccc cgcgtagctc gcttatccca cagcacacac ccaagggttc 420 

tgttgctaac aegctgaatt aattctttgc tcatcttaca gagtgtgttt tgactgcccc 480 

catttctgag gccttgtaag gecagagett tgttgcttca teggcaggtt gggacttaga 540 

tggccgtgaa tgtttcctct ctgctgctgc agtaagtaag tgcccgcacc atagtgtgtt 600 

tggaggctga agttgaagcg aggctgtgag gggagatgga cgtgtgagga gggatgatgg 660 

ggcttgagca aagtggggga ggggcaaagc agttggccca acacattccc cacccctttg 720 

agaggtctga ggectgeaga cctggctcgg agcccacctg gtagtcctca gactgtgtgt 780 

gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtaaaagaga gaagttgtgg 840 

agaaatgggg ggctgattct gctcagattc atcaggatga §tagaaggca cccagctctc 900 

accctggcct gacatgtgtg tccctgagca ggttacagtc ctctctgagc ctctgcttcc 960 

catctggacc ctgctgggca gggcttctga gctccttagc actagcagga ggggctccag 1020 

gggccctccc tccatggcag ccaggacagg actctaaaat gaggacagca gagctcgtgg 1080 

ggggctccca cggacccgcc gtgggcccag gggaggcaga gcctgagcca acagcagtgg 1140 

tgctgtggac cgtggatcct gagggtggcc tggggcaagt aceggctgag ggtccaggtg 1200 

ggctttgtgt acctttgggt cctggggccc tggtgacttg gactccaggt tagagtcaag 1260 

tgacaggaga aaggctggtg gggccctgtg cttccgactt catttcgagt gatggcagtt 1320 

cccaggaagg aatccacagc tgacggtggc tgacagatca gagaatggaa ggcgaggcag 1380 

gegggegtet gcgtgacctc aggtgcttgg ggcccagcag acccagagaa ccatttccac 1440 

taggecaggg tgccggaagt gtccacaggt cttagattcc ctgt.tcagat gaaaagattt 1500 

gtgcctttaa tgataaaagt gatctgeata gagtcaaaaa ttcaagecat gggtataaaa 1560 

tgcaagtaaa atccctgccc tcacctatcc caccctacta cacagagatg tcct 1614 

<210> 232 
<211> 1087 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (14) . . (14) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (55) . . (55) 

<223> n equals a,t,g, or c 
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<220> 

<221> misc_f eature 

<222> (63).. (64) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (174).. (174) 
<223> n equals a,t,g, or c 

<400> 232 

caagttaaag taangtggcc ccggcaacca ataagtgttg tttttggaag ggctngaaag 60 

ttnnaaagcg agggcttgta aaggggaaga tgggaccgtt gtgaaggaag gatgattggg 120 

gctttgaagc aaaagtgggg gaagggggca aaggcagttg gcccaacaca ttcnccaccc 180 

ctttgagagg tctgaggcct gcagacctgg ctcggagccc acctggtagt cctcagactg 240 

tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtaaa agagagaagt 300 

tgtggagaaa tggggggctg attctgctca gattcatcag gatgagtaga aggcacccag 360 

ctctcaccct ggcctgacat gtgtgtccct gagcaggtta cagkcctctc tgagcctctg 420 

cttcccatct ggaccctgct gggcagggct tctragctcc ttagcactag caggaggggc 480 

tccaggggcc ctccctccat ggcagccagg acaggactct aaaatgagga cagcagagct 540 

ccftggggggc tcccacggac ccgccktggg cccaggggag gcagagcctg agccaacagc 600 

agtggtgctg tggaccgtgg atcctgaggg tggcctgggg caagtaccgg ctgagggtcc 660 

aggtgggctt tgtgtacctt tgggtcctgg ggccctggtg acttggactc caggttagag 720 

tcaagtgaca ggagaaaggc tggtggggcc ctgtgcttcc gacttcattt cgagtgatgg 780 

cagttcccag gaaggaatcc acagctgacg gtggctgaca gatcagagaa tggaaggcga 840 

ggcaggcggg cgtctgcgtg acctcaggtg cttggggccc agcagaccca gagaaccatt 900 

tccactaggc cagggtgccg gaagtgtcca caggtcttag attccctgtt cagatgaaaa 960 

gatttgtgcc tttaatgata aaagtgatct gcatagagtc aaaaattcaa gccatgggta 1020 

taaaatgtca agtaaaatcc ctgccctcac ctatcccacc ctactacaca gagatgtcct 1080 

ctcgagg 1087 

<210> 233 

<211> 1191 

<212> DNA 

<213> Homo sapiens 

<400> 233 

gctgggctgg aacacaagar cccacagggc tgccgtccac actctcccgg tcagagtcct 60 

gggaccacat ggggacgctg ccatggcttc ttgccttctt cattctgggt ctccaggctt 120 

gggatactcc caccatcgtc tcccgcaagg agtggggggc aagaccgctc gcctgcaggg 180 

ccctgctgac cctgcctgtg gcctacatca tcacagacca gctcccaggg atgcagtgcc 240 

agcagcagag cgtttgcagc cagatgctgc gggggttgca gtcccattcc gtctacacca 300 

taggctggtg cgacgtggcg tacaacttcc tggttgggga tgatggcagg gtgtatgaag 360 

gtgttggctg gaacatccaa ggcttgcaca cccagggcta caacaacatt tccctgggca 420 

tcgccttctt tggcaataag ataagcagca gtcccagccc tgctgcctta tcagctgcag 480 

agggtctgat ctcctatgcc atccagaagg gtcacctgtc gcccaggtat attcagccac 540 

ttcttctgaa agaagagacc tgcctggacc ctcaacatcc agtgatgccc agraaggttt 600 

gccccaacat catcaaacga tctgcttggg aagccagaga gacacactgc cctaaaatga 660 

acctcccagc caaatatgtc atcatcatcc acaccgctgg cacaagctgc actgtatcca 720 

cagactgcca gactgtcgtc cgaaacatac agtcctttca catggacaca cggaactttt 780 

gtgacattgg atatcaataa ggccaggcgt ggcggcgatt acgtctgtaa tcccaggact 840 

ttgggaggcc aaggcgggca gatcacttca ggccaggaat tcaagagcag cctggccaat 900 

atggcgaaac tctgtctcta ctgaaaacaa acaaacaaac aaacaaacaa acaaagaaac 960 

aacaaaaatt agccgggtgt ggtggcacac gcctgtagtc ccagctactc aggaggctga 1020 

ggcataagaa ttgcttgaac cctggaggcg gaggttgcag tgagctgaga ttgggccacc 1080 

gcactccagt ctgggagaca gagtgagact gtctcaaaac aacaacaaaa aaatccctaa 1140 

cataatctca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa agggcggccg c 1191 

<210> 234 
<211> 1626 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (525) . . (525) 
<223> n equals a,t,g, or c 

<220> 

<221> misc — feature 
<222> (542) . . (542) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (562) . . (562) 
<223> n equals a,t,g r or c 

<220> 

<221> misc_feature 
<222> (607) . . (607) 
<223> n equals a,t,g, or c 

<400> 234 

ccacgcgtcc gacgcggcgc acgcggcagt cctgatggcc cggcatgggt taccgctgct 60 

gcccctgctg tcgctcctgg tcggcgcgtg gctcaagcta ggaaatggac aggctactag 120 

catggtccaa ctgcagggtg ggagattcct gatgggaaca aattctccag acagcagaga 180 

tggtgaaggg cctgtgcggg aggcgacagt gaaacccttt gccatcgaca tatttcctgt 240 

caccaacaaa gatttzcaggg attttgtcag ggagaaaaag tatcggacag aagctgagat 300 

gtttggatgg agctttgtct ttgaggactt tgtctctgat gagctgagaa acaaagccac 360 

ccagccaatg aagtctgtac tctggtggct tccagtggaa aaggcatttt ggaggcagcc 420 

tgcaggtcct ggctctggca tccgagagag actggagcac ccagtgttac acgtgagctg 480 

gratgacgcc cgtgcctaat gtgcytkgsg ggggraaacg actgncccac sggagggaag 540 

antggggagt ttttccgccc gnaggggggc ttgaarggtc caagtttacc ccatgggggg 600 

aactggnttc cagccaaacc gcaccaacct gtggcaggga aagttcccca agggagacaa 660 

agctgaggat ggcttccatg gagtctcccc agtgaatgct ttccccgccc agaacaacta 720 

cgggctctat gacctcctgg ggaacgtgtg ggagtggaca gcatcaccgt accaggctgc 780 

tgagcaggac atgcgcgtcc tccggggggc atcctggatc gacacagctg atggctctgc 840 

caatcaccgg gcccgggtca ccaccaggat gggcaacact ccagattcag cctcagacaa 900 

cctcggtttc cgctgtgctg cagacgcagg ccggccgcca ggggagctgt aagcagccgg 960 

gtggtgacaa ggagaaaagc cttctagggt cactgtcatt ccctggccat gttgcaaaca 1020 

gcgcaattcc aagctcgaga gcttcagcct caggaaagaa cttccccttc cctgtctccc 1080 

atccctctgt ggcaggcgcc tctcaccagg gcaggagagg actcagcctc ctgtgttttg 1140 

gagaaggggc ccaatgtgtg ttgacgatgg ctgggggcca ggtgtttctg ttagaggcca 1200 

agtattattg acacaggatt gcaaacacac aaacaattgg aacagagcac tctgaaaggc 1260 

cattttttaa gcattttaaa atctattctc tccccctttc tccctggatg attcaggaag 1320 

ctgmacattg tttcctcaag gcagaatttt cctggbtctg ttttctcagc cagttgctgt 1380 

ggaaggagaa tgctttcttt gtggcctcat ctgtggtttc gtgtccctct gaaggaaact 1440 

agtttccact gtgtaacagg cagacatgta actatttaaa gcacagttca gtcctaaaag 1500 

ggtctgggag aaccagatga tgtactaggt gaagcattgc attgtgggaa tcacaaagca 1560 

aatagtactc cagaaagacc ctgtctcaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1620 

aaaaaa 1626 

<210> 235 
<211> 2351 
<212> DNA 

<213> Homo sapiens 
<400> 235 

ccacgcgtcc ggcagaagca gcagcagcag aagacacagc gccggtccag gaggcggctc 60 
gagctgttcg taaagtcgcc cgacagcttt ttctccgtag tatgcgagtt gacaaaacag 120 
ccagagaaca gggctcccca ttacaatctt ttcgagatct tttcccttgc taaccggatc 180 
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tgatttgtgc gaaaacatgc cttgcacttg tacctggagg aactggagac agtggattcg 240 

acctttagta gcggtcatct acctggtgtc aatagtggtt gcggttcccc tatgcgtgtg 300 

ggaattacag aaactggagg ttggaataca caccaaggct tggtttattg ctggaatctt 360 

tttgctgtga ctattcctat atcactgtgg gtgatattgc aacacttagt gcattataca 420 

caacctgaac tacaaaaacc aataataagg attctttggg atggtaccta tttacagttt 480 

tagatagttg gatagctttg aaatatcccg gaattgcaat atatgtggat acctgcagag 540 

aatgctatga agcttatgta atttacaact ttatgggatt ccttaccaat tatctaacta 600 

accggtatcc aaatctggta ttaatccttg aagccaaaga tcaacagaaa catttccctc 660 

ctttatgttg ctgtccacca tgggctatgg gagaagtatt gctgtttagg tgcaaactaa 720 

gtgtattaca gtacacagtt gtcagacctt tcaccaccat cgttgcttta atctgtgagc 780 

tgcttggtat atatgacgaa gggaacttta gcttttcaaa tgcttggact tatttggtta 840 

taataaacaa catgtcacag ttgtttgcca tgtattgtct cctgctcttt tataaagtac 900 

taaaagaaga actgagccca atccaacctg ttggcaaatt tctttgtgta aagctggtgg 960 

tttttgtttc tttttgattt ggcgtttacc ttttcctaac atataggcaa gcagtagtta 1020 

ttgctttgtt ggtaaaagtt ggcgttattt ctgaaaagca tacgtgggaa tggcaaactg 1080 

tagaagctgt ggccaccgga ctccaggatt ttattatctg tattgagatg ttcctcgctg 1140 

ccattgctca tcattacaca ttctcatata aaccatatgt ccaagaagca gaagagggct 1200 

catgctttga ttcctttctt gccatgtggg atgtctcaga tattagagat gatatttctg 1260 

aacaagtaag gcatgttgga cggacagtca ggggacatcc caggaaaaaa ttgtttcccg 1320 

aggatcaaga tcaaaatgaa catacaagtt tattatcatc atcatcacaa gatgcaattt 1380 

ccattgcttc ttctatgcca ccttcaccca tgggtcacta ccaagggttt ggacacactg 1440 

tgactcccca gactacacct accacagcta agatatctga tgaaatcctt agtgatacta 1500 

taggagagaa aaaagaacct tcagataaat ccgtggattc ctgaacagta tggaaaagca 1560 

aactgtgcaa ctactacatt atatcattac ctggtatccc atggattttg tgcttgggac . 1620 

agaccataaa tgatggaaaa tgtcaacaca aaaatagctg aaagccaggt acaactactg 1680 

catttatata tgtaagtttt gtatatcaaa aataattggt ctaaatttcc tagacttaga 1740 

cttgatttct taacattagg gtatcgcata ctcaaatggt agacaatgac cccaactaaa 1800 

tcttcctgat gttacactgc tttatcaaga ggatggactt tttttttttt gagacagaca 1860 

gagtcttgct ctgtcaccca ggctggagtg cagtggcgca atctcgggtc actgcaagct 1920 

ctgcctccca agttcatgcc attctcctgc ctcagccctc ccaagtagct gggactacag 1980 

gcacctgcca ccatgcccag ctaatttttt ttttttcagt agagacaggg tctcaccatg 2040 

ttagccagga tggtcttgat ctgacctcgt gatccgccga cctcggcctc ccaaagtgct 2100 

ggaattacag gcgtgagcca ctgcgcctgg ccaagaatgg acatttttta aaaaaacatc 2160 

agtacttcct accactgctg catgagtata atgctccgga attatcagaa agcataatgc 2220 

agaaatacga attagtggaa cttaatcatg tgccatataa gcttacctaa caaacagtta 2280 

tatccctatt cctcaactga atgtctttca ataaataaga atttatcatt taaaaaaaaa 2340 

aaaaaaaaaa a 2351 

<210> 236 
<211> 1001 
<212> DNA 

<213> Homo sapiens 
<400> 236 

cgcgctggaa ccctgtggcg gcggccatgg ccatatggcg ctgcccgcct ggctgcagcc 60 

aggtatagga agaatgcgta tcttttcatc tattacttaa tccagttctg tggccactct 120 

tggatattta caaatatgac agtcagattc ttttcatttg gaaaaggtaa aactccgaaa 180 

cagttttttt atttttaact tttaatcctt gttttcacct catcctgctt atattaaatt 240 

tctacacacc tcaaccttct accacgggat acagattcaa tggttgacac tttttatgct 300 

attggacttg tgatgcgact ttgccaatcc gtatctctcc tggaactgct gcacatatat 360 

gttggcattg agtcaaacca tcttctccca aggtttttgc agctcacaga aagaataatc 420 

atcctttttg tggtgatcac cagtcaagag gaagtccaag agaaatatgt ggtgtgtgtt 480 

ttattcgtct tttggaatct attggatatg gttaggtaca cttatagcat gttatcagtc 540 

ataggaatat cctatgctgt cttgacatgg ctcagtcaaa cactatggat gccaatttat 600 

cctttgtgtg ttcttgctga agcatttgcc atctatcaat cgctgcctta ttttgaatca 660 

tttggcactt attccaccaa gctgcccttt gacttatcca tctatttccc atatgtgctg 720 

aaaatatatc tcatgatgct ctttataggt atgtatttta cctacagtca tctatactca 780 

kaaagaagag acatcctcgg aatctttccc attaaaaaaa agaagatgtg aagtacagca 840 

ttccagtgtg acacgagaaa agacaggctg tggattcagt gcagtaaata aaacacagga 900 

agtattctgg tggaaaaaaa aaaaaaaaaa aaaaaaaaar aaraaaaaaa aa\-/aaaaaaa 960 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 1001 
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<210> 237 
<211> 669 
<212> DNA 
<213> Homo sapiens 

<400> 237 

ccacgcgtcc ggacactttt tatgctattg gacttgtgat gcgactttgc caatccgtat 

ctctcctgga actgctgcac atatatgttg gcattgagtc aaaccatctt ctcccaaggt 

ttttgcagct cacagaaaga ataatcatcc tttttgtggt gatcaccagt caagaggaag 

tccaagagaa atatgtggtg tgtgttttat tcgccttttg gaatctattg gatatggtta 

ggtacactta tagcatgtta tcagtcatag gaatatccta tgctgtcttg acatgggctc 

agtcaaacac tatggatgcc aatttatcct ttgtgtgttc ttgctgaagc atttgccatc 

tatcaatcgc tggcttattt tgaatcattt ggcacttatt ccaccaagct gccctttgac 

ttatccatct atttcccata tgtgctgaaa atatatctca tgatgctctt tataggtatg 

tattttacct acagtcatct atactcagaa agaagagaca tcctcggaat ctttcccatt 

aaaaaaaaga agatgtgaag tacagcattc cagtgtgaca cgagaaaaga caggctgtgg 

attcagtgca gtaaataaaa cacaggaagt attctggtgg aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaa 

<210> 238 
<211> 417 
<212> DNA 
<213> Homo sapiens 

<400> 238 

ccacgcgtcc gctcctctag aggctccaca tgaagtccca gtgctacagt cctagttatt 60 

ttgccttctt ctgcctggtt ttctttcaga tcacctcagc cagttctcag acacttaggg 120 

gacatgttct ctgcaggacc actctgaggg actcttctgc atattgctga cctgagagga 180 

tggcctcaga gctgacttgg gcaatcctcc ccaacaggaa ggggagacat tgcctgccac 240 

tgaggaaaca ggtcatgaag gtggagataa gctgcaaggg gcgaagcaac tttatgtcag 300 

tggaaaacgt gtctctttaa agctgctatg tgaacagctt ttacagtcat taaatttacc 360 

taaactaagg ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 417 

<210> 239 

<211> 1949 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (1130) . . (1130) 
<223> n equals a,t,g, or c 

<220> 

<221> mis c_feature 
<222> (1948) . . (1948) 
<223> n equals a,t,g, or c 

<400> 239 

gctacgccgt gggcacttcc tcaacgacct gtgcgcgtcc atgtggttca cctacctgct 60 

gctctacctg cactcggtgc gcgcctacag ctcccgcggc gcgggctgct gctgctgctg 120 

ggccaggtgg cgacgggctg tgcacaccgc tcgtgggcta cgaggccgac cgcgccgcca 180 

gctgctgcgc ccgctacggc ccgcgcaagg cctggcacct ggtcggcacc gtctgcgtcc 240 

tgctgtcctt ccccttcatc ttcagcccct gcctgggctg tggggcggcc acgccgagtg 300 

ggctgccctc ctctactacg gcccgttcat cgtgatcttc cagtttggct gggcctccac 360 

acagatctcc cacctcagcc tcatcccgga gctcgtcacc aacgaccatg agaaggtgga 420 

gctcacggca ctcaggtatg cgttcaccgt ggtggccaac atcaccgtct acggcgccgc 480 

ctggctcctg ctgcacctgc agggctcgtc gcgggtggag cccacccaag acatcagcat 540 

cagcgaccag ctggggggcc aggacgtgcc cgtgttccgg aacctgtccc tgctggtggt 600 

gggtgtcggc gccgtgttct cactgctatt ccacctgggc acccgggaga ggcgccggcc 660 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
669 
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gcatgcggag gagccaggcg agcacacccc cctgttggcc cc tgccacgg cccagcccct 720 

gctgctctgg aagcactggc tccgggagcc ggctttctac caggtgggca tactgtacat 780 

gaccaccagg ctcatcgtga acctgtccca gacctacatg gccatgtacc tcacctactc 840 

gctccacctg cccaagaagt tcatcgcgac cattcccctg gtgatgtacc tcagcggctt 900 

cttgtcctcc ttcctcatga agcccatcaa caagtgcatt gggaggaaca tgacctactt 960 

ctcaggcctc ctggtgatcc tggcctttgc cgcctgggtg gcgctggcgg agggactggg 1020 

tgtggccgtg taygcagcgg ctgtgctgct gggtgctggc tgtgccacca tcctcgtcac 1080 

ctcgctggcc atgacggccg acctcatcgg tccccacacg aacagcggan ckttcgtgta 1140 

cggctccatg agcttcttgg ataaggtggc caatgggctg gcagtcatgg ccatccagag 1200 

cctgcaccct tgcccctcag agctctgctg cagggcctgc gtgagctttt accactgggc 1260 

gatggtggct gtgacgggcg gcgtgggcgt ggccgctgcc ctgtgtctct gtagcctcct 1320 

gctgtggccg acccgcctgc gacgctggga ccgtgatgcc cggccctgac tcctgacagc 1380 

ctcctgcacc tgtgcaaggg aactgtgggg acgcacgagg atgcccccca gggccttggg 1440 

gaaaagcccc cactgcccct cactcttctc tggaccccca ccctccatcc tcacccagct 1500 

cccgggggtg gggtcgggtg agggcagcag ggatgcccgc cagggacttg caaggacccc 1560 

ctgggttttg agggtgtccc attctcaact ctaatccatc ccagccctct ggaggatttg 1620 

gggtgcccct ctcggcaggg aacaggaagt aggaatccca gaagggtctg ggggaaccct 1680 

aaccctgagc tcagtccagt tcacccctca cctccagcct gggggtctcc agacactgcc 1740 

agggccccct caggacggct ggagcctgga ggagacagcc acggggtggt gggctgggcc 1800 

tggaccccac cgtggtgggc agcagggctg cccggcaggc ttggtggact ctgctggcag 1860 

caaataaaga gatgacggca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1920 

aaaaaaaaaa aggggggggg gctagttnt 1949 

<210> 240 

<211> 1487 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> miscJEeature 

<222> (78) . . (78) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (948).. (948) 
<223> n equals a,t,g, or c 

<400> 240 

ccgctgctga taactatggc atcccccggg cctgcaggaa ttcggcacgg agctacggcg 60 

ccgcctggct cctggtgnca cctgcaggct cgtcgcgggt ggagcccacc caagacatca 120 

gcatcagcga ccagctgggg ggccaggacg tgcccgtgtt ccggaacctg tccctgctgg 180 

tggtgggtgt cggcgccgtg ttctcactgc tattccacct gggcacccgg gagaggcgcc 240 

ggccgcatgc ggasgagcca ggcgagcaca cccccctgtt ggcccctgcc acggcccagc 300 

ccctgctgct ctggaagcac tggctccggg agcsggcttt ctaccaggtg ggcatactgt 360 

acatgaccac caggctcatc gtgaacctgt cccagaccta catggccatg tacctcacct 420 

actcgctcca cctgcccaag aagttcatcg cgaccattcc cctggtgatg tacctcagcg 480 

gcttcttgtc ctccttcctc atgaagccca tcaacaagtg cattgggagg aacatgacct 540 

acttctcagg cctcctggtg atcctggcct ttgccgcctg ggtggcgctg gcggagggac 600 

tgggtgtggc cgtgtacgca gcggctgtgc tgctgggtgc tggctgtgcc accatcctcg 660 

tcacctcgct ggccatgacg gccgacctca tcggtcccca cacgaacagc ggagckttcg 720 

tgtacggctc catgagcttc ttggataagg tggccaatgg gctggcagtc atggccatcc 780 

agagcctgca cccttgcccc tcagagctct gctgcagggc ctgcgtgagc ttttaccact 840 

gggcgatggt ggctgtgacg ggcggcgtgg gcgtggccgc tgccctgtgt ctctgtagcc 900 

tcctgctgfcg gccgacccgc ctgcgacgct gatgagacct gcacgcantg gctcacagca 960 

gcacgatttg tgacagcccg aggcggagaa caccgaacac ccagtgaagg tgaggggatc 1020 

agcacggcgc ggccacccac gcacccacgc gctggaatga gactcagcca caaggaggtg 1080 

cgaagctctg acccaggcca cagtgcggat gcaccttgag gatgtcacgc tcagtgagag 1140 

acaccagaca cagaagggta cgctgtgatc ccacttctat gaaatgtcca ggacagacca 1200 

atccacagaa tcagggagag gattcgtggg tgccgggact ggggaggggg acctgggggt 1260 

gactaggtga ca taatgggg acagggctgc cttctgggtg atgagaatgt tctggaatca 1320 
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gatgggatgg ctgcacggcg tggtgaaggt actgaacgcc acctcactgt aagacggtag 1380 
attttgtatt ttaccacaat aaacaaaaca aaacaaaacc aaaaaaaaaa aaaaaaaaaa 1440 



<210> 241 

<211> 1525 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> (78) . . (78) 

<223> n equals a,t,g,. or c 

<400> 241 

ccgctgctga taactatggc atcccccggg cctgcaggaa ttcggcacgg agctacggcg 60 

ccgcctggct cctgctgnca cctgcaggct cgtcgcgggt ggagcccacc caagacatca 120 

gcatcagcga ccagctgggg ggccaggacg tgcccgtgtt ccggaacctg tccctgctgg 180 

tggtgggtgt cggcgccgtg ttctcactgc tattccacct gggcacccgg gagaggcgcc 240 

ggccgcatgc ggasgagcca ggcgagcaca cccccctgtt ggcccctgcc acggcccagc 300 

ccctgctgct ctggaagcac tggctccggg agcsggcttt ctaccaggtg ggcatactgt 360 

acatgaccac caggctcatc gtgaacctgt cccagaccta catggccatg tacctcacct 420 

actcgctcca cctgcccaag aagttcatcg cgaccattcc cctggtgatg tacctcagcg 480 

gcttcttgtc ctccttcctc atgaagccca tcaacaagtg cattgggagg aacatgacct 540 

acttctcagg cctcctggtg atcctggcct ttgccgcctg ggtggcgctg gcggagggac 600 

tgggtgtggc cgtgtacgca gcggctgtgc tgctgggtgc tggctgtgcc accatcctcg 660 

tcacctcgct ggccatgacg gccgacctca tcggtcccca cacgaacagc ggactktcgt 720 

gtacggctcc atgagcttct tggataaggt ggccaatggg ctggcagtca tggccatcca 780 

gagcctgcac ccttgcccct cagagctctg ctgcagggcc tgcgtgagct tttaccactg 840 

ggcgatggtg gctgtgacgg gcggcgtggg cgtggccgct gccctgtgtc tctgtagcct 900 

cc tgctgtgg ccgacccgcc tgcgacgctg ggaccgtgat gcccggcccfc gactcctgac 960 

agcctcctgc acctgtgcaa gggaactgtg gggacgcacg aggatgcccc ccarggcctt 1020 

ggggaaaagc ccccactgcc cctcactctt ctctggaccc ccaccctcca tcctcaccca 1080 

gctcccgggg gtggggtcgg gtgagggcag cagggatgcc cgccagggac ttgcaaggac 1140 

cccctgggtt ttgagggtgt cccattctca actctaatcc atcccagccc tctggaggat 1200 

ttggggtgcc cctctcggca gggaacagga agtaggaatc ccagaagggt ctgggggaac 1260 

cctaaccctg agctcagtcc agttcacccc tcacctccag cctgggggtc tccagacact 1320 

gccagggccc cctcaggacg gctggagcct ggaggagaca gccacggggt ggtgggctgg 1380 

gcctggaccc caccgtggtg ggcagcaggg ctgcccggca ggcttggtgg actctgctgg 1440 

cagcaaataa agagatgacg gcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1500 

aaaaaaaaaa aaacccaccg tccgc 1525 

<210> 242 

<211> 1050 

<212> DNA 

<213> Homo sapiens 



aaaaaaaagg aattcgatat caagcttatc gataccgtcg acctcga 



1487 



<400> 242 



ccacgcgtcc ggccagccag tccgcccgtc cggagcccgg ctcgctgggg cagcatggcg 
gggtcgccgc tgctctgggg gccgcgggcc gggggcgtcg gccttttggt gctgctgctg 
ctcggcctgt ttcggccgcc ccccgcgctc tgcgcgcggc cggtaaagga gccccgcggc 
ctaagcgcag cgtctccgcc cttggctaga ctggcgctcc tcgccgcttc cggcggtcag 
tgccccgagg tgaggcggcg ggggcggtgc agacctggcg cgggcgctgg cgcatctgct 
ggagccgaac gtcaggagcg ggcgcgggcc gaggcgcaga ggctgaggat cagcaggcgc 
gcgtcctggc gcagctgctg cgcgtctggg gcgccccccg caactctgat ccggctctgg 
gcctggacga cgaccccgac gcgcctgcag cgcagctcgc tcgcgctctg ctccgcgccc 
gccttgaccc tgccgcccta gcagcccagc ttgtccccgc gcccgtcccc gccgcggcgc 
tccgaccccg gcccccggtc tacgacgacg gccccgcggg cccggatgct gaggaggcag 
gcgacgagac acccgacgtg gaccccgagc tgttgaggta cttgctggga cggattcttg 
cgggaagcgc ggactccgag ggggtggcag ccccgcgccg cctccgccgt gccgccgacc 
acgatgtggg ctctgagctg ccccctgagg gcgtgctggg ggcgctgctg cgtgtgaaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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gcctagagac cccggcgccc caggtgcctg cacgccgcct cttgccaccc tgagcactgc 840 

ccggatcccg tgcaccctgg gacccagaag tgcccccgcc atcccgccac caggactgct 900 

ccccgccagc acgtccagag caacttaccc cggccagcca gccctctcac ccgaggatcc 960 

ctaccccctg gccccacaat aaacatgatc tgaagcagca aaaaaaaaaa aaaaaaaaaa 1020 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1050 

<210> 243 
<211> 647 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (525) . . (525) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (578) . . (578) 
<223> n equals a,t,g, or- c 

<220> 

<221> mis cofeature 
<222> (581) . . (581) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 

<222> (620) . . (620) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (629) . . (630) 
<223> n equals a,t,g, or c 

<400> 243 

gtgaaacgcc tgggctcaag ctgattcacc tgcctccacc tcccacagtg ctgggattac 60 

aaacatgatc ccccacgccc agccaacaca aaacttctga tgctctgttt tctcatctgt 120 

gaactggagc taaggctaag tggtctgtct gtttaataag agtttgaatc agatggcctg 180 

gcatgaagag tcactggcct gagagaatgt caggggcatt tgtaaatgtg taaagggctg 240 

aaaaatcctg agggattatt attattgcta ttgttgttat tattcacaga cacatycaac 300 

agccattgtc tgcctcctta tctgtcatgc tttctgcacg agcgtcagcc tgagcttcaa 360 

tctgtgtgta tatctgcagc ttacgtcctt gcacccctcc agaacccagt ttcatccttg 420 

taggtttttc craagcagga tttgcacaag tggcgtgttt tcttaagtat ttattttgca 480 

ggccatttac tcggcatggc tatttttaca gtgggtaagg agcanggcta aaaataactt 540 

agctcataac cagacaggtt ctgcatttga cattacgngg nattcatttg catcccattt 600 

ggtcgccttt ctggttaacn ggtagaatnn aagaaagctc acccgaa 647 

<210> 244 

<211> 1321 

<212> DNA 

<213> Homo sapiens 

<400> 244 

gcggggggga ggaggagggg gaggagggag cggagatctc ggggctcgga gccggccgcc 60 

gctccgctcc gatcgctgtg gggcttggtt ttttgggggt gggggggcgg gggggctcag 120 

atatggaggc aaatgggagc caaggcacct cgggcagcgc caacgactcc cagcacgacc 180 

ccggtaaaat gtttatcggt ggactgagct ggcagacctc accagatagc cttagagact 240 

attttagcaa atttggagaa attagagaat gtatggtcat gagagatccc actacgaaac 300 

gctccagagg cttcggtttc gtcacgttcg cagacccagc aagtgtagat aaagtattag 360 



J 



146 



WO 02/076488 PCT/US02/08276 



gtcagcccca ccatgagtta gattccaaga cgattgaccc caaagttgca tttcctcgtc 420 

gagcgcaacc caagatggtc acaagaacaa agaaaatatt tgtaggcggg ttatctgcga 480 

acacagtagt ggaagatgta aagcaatatt tcgagyagtt tkgcaaggtg gaagatgcaa 540 

tgctgatgtt tgataaaact accaacaggc acagagggtt tggctttgtc acttttgaga 600 

atgaagatgt tgtggagaaa gtctgtgaga ttcatttcca tgaaatcaat aataaaatgg 660 

tagaatgtaa gaaagctcag ccgaaagaag tcatgttccc acctgggaca agaggccggg 720 

cccggggact gccttacacc atggacgcgt tcatgcttgg catggggatg ctgggtgagt 780 

ctggacagga ccgcaggtca ccatggactg ggagggctat ggaggcctct actcccaact 840 

gggtcaccta ccagtggggc aaactgcttc acctttctaa gcctcagttt ccttgtctgt 900 

agatgaggat gataattccc cgttccaaga cagttgtgat gattaagtgt gggtgtgtgt 960 

gtgtgcatgc atgtgtgtgt gtgtgtgtgt gtgtttgtat ttataatatt gccccatgcc 1020 

tggcttatag 0 gatatgttag actattttct ctcttttcca tctccttcct caaaagaagg 1080 

aaaagtcccc ctctatctgc ctcagccctc tcatctgagt gggagttytt aagatgtaag 1140 

gactcctggc tgacttgact tgtgtgggct aaggctacgt tttctaaaac ttgggagagg 1200 

agggaagtgg taagggtggg cgataatcct gtctatttaa atgattaaca tttttctctt 1260 

gggatatcaa aatttgcatt taaatggatg ttttaaatag cctgttttac tctttatttg 1320 

c 1321 

<210> 245 
<211> 1084 
<212> DNA 

<213> Homo sapiens 
<400> 245 

ggatggcgct acgtctgctg cggagggcgg cgcgcggagc tgcggcggcg gcgctgctga 60 

ggctgaaagc gtctctagca gctgatatcc ccagacttgg atatagttcc tcatcccatc 120 

acaagtacat cccccggagg gcagtgcttt atgtacctgg aaatgatgaa aagaaaataa 180 

agaagattcc atccctgaat gtagattgtg cagtgctcga ctgtgaggat ggagtggctg 240 

caaacaaaaa gaatgaagct cgactgagaa ttgtaaaaac tcttgaagac attgatctgg 300 

gccctactga aaaatgtgtg agagtcaact cagtttccag tggtctggcg gaagaagacc 360 

tagagaccct tttgcaatcc cgggtccttc cttccagcct gatgctacca aaggtggaaa 420 

gtcctgaaga aatccagtgg gcagtgtgtg aagaaaccct gaaggtcggg cctcaagtag 480 

gtctctttct agatgcagtc cgtttttgga ggaraagact ttcgagccac ataggtgcam 540 

caagtartaa agaaaccctg gatawtctct acgcccggca aaagattgtt gtcatagcga 600 

aagcctttgg tctccaagcc gtaratctgg kgkacattga ctttcgagat ggarctkggc 650 

tgcttagaca gtcacgagaa ggagccgcca tgggcttcac tggtaagcag gtgattcacc 720 

ctaaccaaat tgccgtggtc caggagcagt tttctccttc ccctgaaaaa attaagtggg 780 

ctgaagaact gattgctgcc tttaaagaac atcaacaatt aggaaagggg gcctttactt 840 

tccaagggag tatgatcgac atgccattac tgaagcaggc ccagaacact gttacgcttg 900 

ccacctccat caaggaaaaa tgatctgtta aatgaagctg tcatcaggct aaagggtatt 960 

gaagctgcag agggatcaac ttgtgcttgc cagaggacgc caatgaagtt tgaaacacca 102 0 

acaatcagag attttgtttc tgttcctcat taaatcatga gcttttgtgc cgagaaaaaa 1080 

aaaa 1084 

<210> 246 
<211> 1776 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1748) . . (1748) 
<223> n equals a,t,g, or c 

<400> 246 

agccactgtg cccagcctcg gctcaggttt ttmaatacag tcttgacctt ggcattcagt 60 

atcctcacag catggttcta attaactttc tagctctatt tcccttttcc tgctccctct 120 

ctctacaact agtctttctc tgattgcccc gccctcaacc catctaaact agaccccagg 180 

gaagcacctt ggtccccttc ctctctccca ctcaccatcc aaccaatcac cagagcctgt 240 

acattctata ttttcaacat cgattcaatt gtctacttct ttctagcctg ccctctctga 300 

ctgggactcc ttgagccagc ctgatcaccc caatccatcc ctcacactgt gcccatcttt 360 
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ctgaagtagg aatctgatca caccamcctg ctaaaaacac tctggttctc cccacggcat 420 

gtggtgccct tgtatagctg gcaaagcctt gcatggcacg gccccagcct gtgcttcaac 480 

tcaattgccc gactctctcc agctctgctg agccacctaa gtcacagatg gtttctcctc 540 

tcatctctgc tctcttccat gtgccatttc tgtggcttgg aatgttcttc cctcattctc 600 

tttctggccc tttcccgtca caccttagac gtgcatcttc ctctcgaaaa cctctagtga 660 

agcctcccag ggccaggcag taccctcctc tggcttcttc tggatacaga ggaagaatct 720 

gagcatcgat tctccatctc agcaggcctc tgtgtgcctg ctgactccga ctagaccaga 780 

gatccgtaag gacagggatc gagttttttt tcttttaatk cactgcctca aaaatcctct 840 

gtgcattacc tattcatcct cttctctccc ttaacctgaa ccagtgatct tactgtctcc 900 

atcattgttt ttttcttttc ttttcttttc tttttttttt ttgaggtgga gtctggctct 960 

tcacccaggc tggagtgcag tgatgcgatc tcgactcact gcaacctcca tctcctgggt 1020 

tcaagcgatt ctcctgcctc agcctcccca gtagctggga ttacaggcat gcgctaccat 1080 

ccccaactaa tttttgcctc cataattytg ccttttstag aatgtcatac aggtggaatt 1140 

actcagtatg ctgccttttt cagattggct tctttcactt agtaatatgs tygttttttg 1200 

agacagggtc ttgctctgtc gcccaggcta gagtgtggtg gtgcgatctt agctcactga 1260 

aacctccacc tcccaggttc aagtgaytct sctgcctcag cctcccgagt agctgggact 1320 

acaggcacgt gccaccatac ccggctaatt tgtggatttt tagtacagac ggsgtttcgt 1380 

catgttggcc agtgtgytgt tgaattcctg acctcaagtg atccacctgc ctcagcctcc 1440 

caaagtgttg cgattacagg tgtgagccac tgcgccaagc ctcatttagt aataygcatt 1500 

taaactttct ccatgkcttt aatggcttga tagctcattt atttttatca wggaatattt 1560 

cattgtctgg atggaccaca gtttatttct ccattcacct actgaaggac atctcggttg 1620 

cttctaagtt ttggcaatta tgaataaagc tgctataacc atcaagtgca ggtttttgtg 1680 

tggacctatt atcaactaat tcgggtaaat ctcaaggagt gcaattgctg gatccacagt 1740 

aagagtgngt ttagttttaa gtgcttggcc attttc 1776 

<210> 247 
<211> 784 
<212> DNA. 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<222> (1) . . (1) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 

<222> (6).. (6) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (32) . . (32) 

<223> n equals a,t,g, or c 

<400> 247 

ngagtntaat tgctggatca cacagtaaga gngtggttag ttttaagtgg ctgtgccatt 60 

ttgcattccc accagcaatg aatgagagtt tctgttgctc cacattctca ctaccattcg 120 

gtgttgtcag tgttttgcat tttggccatt ctagtaggtg tttacatggt atctagtcat 180 

ttgaatgggc atatgatgtg gaacatcttt ttttttttat tttwttatta ttatacttta 240 

agttttaggg tacatgtgca caaygtgcag gttwgttaca tatgtataca tgtgccatgy 300 

tggtgtgctg cacccaytaa ctcgtcatyt agcattaggt atatctccya atgctattgg 360 

aacatctttt catgtgttta tttgccatct gtatatcttc cctgatgagt tggggatgca 420 

ttctttccat ctcagagtcc ccagaaacta acatagcagt tggtacagag ttggtgctca 480 

acaaacatca gcttaggaac tatgtcctat gtttttttgt tttttttttt ttttaaaaag 540 

gaatgtgagc tgttcccaaa acgtatgtcc ttcccccatg cctctaccct gcccttccac 600 

aaactttctg atcttcagca cacactaccc aaccatcaag gctgagactt cccgtggcca 660 

gcagtgtctc atgctggctt caagccccac agcactgctt ttttcaactt ctcttgtggt 720 

ttagactgtc tttagcccag caagagaatt cgatatcaag cttatcgata ccgtcgacct 780 

cgag 784 
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<210> 248 
<211> 699 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mi sc_f eature 

<222> (12) . . (12) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (30) . . (30) 

<223> n equals a t t,g, or c 

<220> 

<221> misc_feature 

<222> (46) . . (46) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (66) . . (66) 

<223> n equals a,t,g, or c 

<400> 248 

ggttcatgct tnaaccccat ttgctggatn ctgctttgac cctgtntccc tgtactggtg 60 

ttatgngaat cctggcacac attgctatcc cactctgaga ggmcwtgagc aaagaamccc 120 

cagtrgcaga agccacattg tgctcaggtc ttagttctaa caaacaccat tccccattaa 180 

aaggaaccag gctccttaga gaaatggatg attccagggc tgtggcaggg taggtacaag 240 

atgaacctaa agtgtcgttt tataccagaa agtaagaaag tattaaagtg tttaaaaaag 300 

tgatgggagc atatcacaag gattcagaag ggataccaac tggctaaatc tggaacaatt 360 

tgatcaccaa agtaagtaca ataataaatt ctaagctatt gaagtaaagg catttattat 420 

gtgtagtaat aataaataga taatgagaga gaaatgagga ctcatgctta cagtaaaatg 480 

ccaggagctg actggcataa atgtggaagg aaggctggag tgggaaaatt attattttgc 540 

aaccatcatg gtaattacca gatcagataa ggatcaacag atgccaaatc tagggcaaat 600 

ttgatgagga gcagaatatt tgcactgtct ttgagagttt ctcccagaga tcacttattt 660 

gttgtaaaaa aaaagaaaaa aaaaaaaaaa aaactcgag 699 

<210> 249 
<211> 774 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (618) . . (618) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (715) . . (715) 
<223> n equals a,t,g, or c 

<400> 249 

gtctacctcc gggctgaaac gtcaccatgc ctccccacag acagacggat ggacagatgg 60 

gcctccctgc acctgctctg tgggtgtggg ggctcctgct cagcagcagt ttccagaccc 120 

ttctccctgc tttccccaag ccacccgcct tgaatctggg gtgctctacc agacccatcc 180 

cctcatttct aaagatttga gccactagtc gtgtccctct ccctcagaaa tgccttggtg 240 

acacttggct gctttcaact cttccaccca tctgcctctt ggtctcatct ttaccttctg 300 

ctaaaggtcc tgacccccac ccccgccacg ccatggggca ccccatggtg gtgcgtcctt 360 
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gggagcagct ctgtcccttt ccccgtggcc tttgccccgc ctcctatgac ttcgattccc 420 

acctgtcccc gacccctggg accactgacc gggcccgatc accctgtcac tgccctgtca 480 

tctgcttacc ccacacggtg ctctgctgac ccaggtcttg ctgtctccca ayagccccac 540 

gaggcttycc gtcgctcctg gacactrmag gctgagcccg ctgccccgcc gcctccatga 600 

ggaaggcttt tcctctgnga gccccaggcc accctttccc tcctttaagt aattacttaa 660 

gtcccttgcc agggccctcc cagtaccctt tctaaagaca cccctgcccc agcangctgc 720 

aggctcctgc tccactttcc tctcaggccc tcgtcgctgt ggtgctgcct ttga 774 

<210> 250 
<211> 1396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1187) . . (1187) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (1325) . . (1325) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (1327) . . (1327) 
<223> n equals a,t,g, or c 

<400> 250 

cctcgagcca tgtgtccagg tggggcagat catggagggc gcccaggtgc ggtgctgagg 60 

ctgagcatgc actggctggg gagggtgggc agagcaggag gaaatccctt gttctccgga 120 

gctggagagc cagaaagagc cctgcagcct gggcctcatc atcacacctc gccctcaagg 180 

cctccaggca cagcatccac tgccagcctc tgctcctgcc tctgagggtc tgtctccaag 240 

gtcttttggg ggctgcccca gctcccccaa cacagacagc accagagctg ggcccacctg 300 

tgaggtgctg agttccccat ctcagagact gtgcgagcag aggcagggag gcgcttgatg 360 

tcggatggag aaaggacagg ggaggggtgt ggggctcagg gccccgccag gtgaaggaac 420 

aagcttggaa gggtgtcctt atgtctgaga gttggggaga cacccccaag ccccagatgg 480 

rcctcgaatg ccaggcaggg ccaagctggg cccagaagtg ggawggwtcc cttggctgcc 540 

ccaggaatgc aggttccggg gcaagaatcc agccctggct ggttgaagtc catcccaagt 600 

ctccctcccc acgaggctcc tgcagatgcc aggaatgggg ctggattcca gattccaagc 660 

ctggcsgccc agcccctatc tgggacccca gcccagagcc cccaggcctg gcctccaacc 720 

tggccccagc ctcaggggag ctgaattcag agaatcctgt cctaggagcc agaagcgggg 780 

gagggaggra gggcggccct gtcctgggtg caggcccggg ggctgggggy tgcccgcccg 840 

tctgggtcag ccgcagctgc aaaccggccc tggctgagtc atggsgcctc catctccagg 900 

gcctggcttg aggtggggaa tagcagtgag gttggacatc caggcacctg agggtgggca 960 

gggctccctg cggctggggt ggccagtggc acctggctgt tgcccccctg caccccagcc 1020 

ctttggcccc caagtctctg ccacctccct ggggttctgc tcccatattc ctcacaccca 1080 

gcacagaacc cagcatgtct cctgtagaca cctgcatata aaccctgact cacacacaca 1140 

cacacacaca cacacacgca cacatgcagg ccaggctcct cggccangtc accctaccgg 1200 

cagagctcta gacatttctg gcctctggtg actattcttc aggcagctca ccctgcaagt 1260 

ctttattgag cacctactgt gtgccaggca gtggtacagc aagggcagaa gccccacctc 1320 

caagnanctg aacccctgcc gtggcagaga cagaaaacaa aggcagcacc acggcgacga 1380 

gggctaaaga gaacgc 1396 

<210> 251 

<211> 1397 

<212> DNA 

<213> Homo sapiens 

<400> 251 

tcgacccacg cgtccgctga attgcggccg tatgcgcggc tctgtggagt gcacctgggg 60 
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ttgggggcac tgtgccccca gccccctgct cctttggact ctacttctgt ttgcagcccc 120 

atttggcctg ctgggggaga agacccgcca gctgcttgag tttgacagca ccaacgtgtc 180 

cgatacggca gcaaagcctt tgggaagacc atatcctcca tactccttgg ccgatttctc 240 

ttggaacaac atcactgatt cattggatcc tgccaccctg agtgccacat ttcaaggcca 300 

ccccatgaac gaccctacca ggacttttgc caatggcagc ctggccttca gggtccaggc 360 

cttttccagg tccagccgac cagcccaacc ccctcgcctc ctgcacacag cagacacctg 420 

tcagctagag gtggccctga ttggagcctc tccccgggga aaccgttccc tgtttgggct 480 

ggaggtagcc acattgggcc agggccctga ctgcccctca atgcaggagc agcactccak 540 

cgaacgatga atatgcaccg gccgtcttcc agttggacca gctactgtgg ggctccctcc 600 

catcaggctt tgcacagtgg cgaccagtgg cttactccca gaagccgggg ggccgagaat 660 

cagccctgcc ctgccaagct tcccctcttc atcctgcctt agcatactct cttccccagt 720 

cacccattgt ccgagccttc tttgggtccc agaataactt ctgtgccttc aatctgacgt 780 

tcggggcttc cacaggccct ggctattggg accaacacta cctcagctgg tcgatgctcc 840 

tgggtgtggg cttccctcca gtggacggct tgtccccact agtcctgggc atcatggcag 900 

tggccctggg tgccccaggg ctcatgctgc tagggggcgg cttggttctg ctgctgcacc 960 

acaagaagta ctcagagtac cagtccataa attaaggccc gctctctgga gggaaggaca 1020 

ttactgaacc tgtcttgctg tgcctcgaaa ctctggaggt tggagcatca agttccagcc 1080 

ggccccttca ctcccccatc ttgcttttct gtggaacctc agaggccagc ctcgacttcc 1140 

tggagacccc caggtggggc ttccttcata ctttgttggg ggactttgga ggcgggcagg 1200 

ggacagggct attgataagg tccccttggt gttgccttct tgcatctcca cacatttccc 1260 

ttggatggga cttgcaggcc taaatgagag gcattctgac tggttggctg ccctggaagg 1320 

caagaaaata gatttatttt ttttcamaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1380 

aaaaaaaggg cggccgc 1397 

<210> 252 
<211> 1368 
<212> DNA 

<213> Homo sapiens 
<400> 252 

ctgaattgcg gccgtatgcg cggctctgtg gagtgcacct ggggttgggg gcactgtgcc 60 

cccagccccc tgctcctttg gactctactt ctgtttgcag ccccatttgg cctgctgggg 120 

gagaagaccc gccagctgct tgagtttgac agcaccaacg tgtccgatac ggcagcaaag 180 

cctttgggaa gaccatatcc tccatactcc ttggccgatt tctcttggaa caacatcact 240 

gattcattgg atcctgccac cctgagtgcc acatttcaag gccaccccat gaacgaccct 300 

accaggactt ttgccaatgg cagcctggcc ttcaggtcca ggccttttcc aggtccagcc 360 

gaccagccca accccctcgc ctcctgcaca cagcagacac ctgtcagcta gaggtggccc 420 

tgattggagc ctctccccgg ggaaaccgtt ccctgtttgg gctggaggta gccacattgg 480 

gccagggccc tgactgcccc tcaatgcagg agcagcactc catcgacgat gaatatgcac 540 

cggccgtctt ccagttggac cagctactgt ggggctccct cccatcaggc tttgcacagt 600 

ggcgaccagt ggcttactcc cagaagccgg ggggccgaga atcagccctg ccctgccaag 660 

cttcccctct tcatcctgcc ttagcatact ctcttcccca gtcacccatt gtccgagcct 720 

tctttgggtc ccagaataac ttctgtgcct tcaatctgac gttcggggct tccacaggcc 780 

ctggctattg ggaccaacac tacctcagct ggtcgatgct cctgggtttg ggcttccctc 840 

cagtggacgg cttgtcccca ttagtcctgg gcatcatggc agtggcctgg gtgccccagg 900 

gctcatgctg ctagggggcg gcttggttct gctgctgcac cacaagaagt actcagagta 960 

ccagtccata aattaaggcc cgctctctgg agggaaggac attactgaac ctgtcttgct 1020 

gtgcctcgaa actctggagg ttggagcatc aagttccagc cggccccttc actcccccat 1080 

cttgcttttc tgtggaacct cagaggccag cctcgacttc ctggagaccc ccaggtgggg 1140 

cttccttcat actttgttgg gggactttgg aggcgggcag gggacagggc tattgataag 1200 

gtccccttgg tgttgccttc ttgcatctcc acacatttcc cttggatggg acttgcaggc 1260 

ctaaatgaga ggcattctga ctggttggct gccctggaag gcaagaaaat agatttattt 1320 

tttttcaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1368 

<210> 253 
<211> 1763 
<212> DNA 

<213> Homo sapiens 
<400> 253 

ccacgcgtcc gattcaagtg atcaagattt taaaatatga aaagaaactg gccaaaatgt 60 
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gctttttaat gatattcacc ttcctggtct gttggatgcc ttatatcgtg atctgcttct 120 

tggtggttaa tggtcatggt cacctggtca ctccaacaat atctattgtt tcgtacctct 180 

ttgctaaatc gaacactgta tacaatccag tgatttatgt cttcatgatc agaaagtttc 240 

gaagatccct tttgcagctt ctgtgcctcc gactgctgag gtgccagagg cctgctaaag 300 

acctaccagc agctggaagt gaaatgcaga tcagacccat tgtgatgtca cagaaagatg 360 

gggacaggcc aaagaaaagt gactttcaac tcttcttcca tcatttttat catcaccagt 420 

gatgaatcac tgtcagttga cgacagcgac aaaaccaatg ggtccaaagt tgatgtaatc 480 

caagttcgtc ctttgtagga atgaagaatg gcaacgaaag atggggcctt aaattggatg 540 

ccacttttgg actttcatca taagaagtgt ctggaatacc cgttctatgt aatatcaaca 600 

gaaccttgtg gtccagcagg aaatccgaat tgcccatatg ctcttgggcc tcaggaagag 660 

gttgaacaaa aacaaattct tttaattcaa cgggtgcttt acataatgaa aaaaccactt 720 

gtggcacacg atgggcatct aacatcatca tcttctaatg tgttggagat tttcatttca 780 

aatatatttt ttaaattact ctattttcca aaacacgtaa tgcatttttc tcgaaaatac 840 

cttactgtaa aaataactgt cgcgtacaca tgtgtgaagt agctagaaca tactgaattt 900 

ttttttgtac tgttggactc tattcagtgt catgtcctat atctgatcaa gttatcaagg 960 

agataattct agaatgaaaa agaaaatcct cttgttggaa acaaaagacg ttttatatgt 1020 

gcagtatgac aaagaggagt ttcagagaca actttgaatc cttgtcagcc tggagaccag 1080 

caccagagga atctacaagg caaactccca tatatttgct tcccccaaat tgctgcccct 1140 

acagactcaa agctcttttt ctttgttttg ttgtttctct aaaaatttac tgttctttgt 1200 

cgatgctata taagccaggg agttctaaga cgccagctct ttgagatttg ctcattcccc 1260 

tgtatttccc acatatatat tacatatacc cgctaataaa tttatgtttg tttttctctt 1320 

gtcaatctgt cttttgttat aggggcccca gccaaggaac ctaaagtggg tagaaggaaa 1380 

aattattttt tctttcccta caaactgaac atggattatt agaactcaag gttttcattg 1440 

acaatataga aaagaaacac tgaatcattt tattttattg cccaattttt atttcttata 1500 

tgactctagt gtttcatctt cataattaat catgtttgaa ggatttctga gtgactcagc 1560 

agcctgttaa agaaggatga accaaagaaa acatttcact aaatgtgctt ttaaaaatca 1620 

agtgtattgc tggttctgct gcagtatgta gtcgaagaat aaattagtaa attgcttctg 1680 

agggtctgaa attgaataaa gtaatggctt tgtatttcta taaaaaaaaa aaaaaaaaaa 1740 

aaaaaaaaaa aaaaaaaaaa aaa 1763 

<210> 254 
<211> 1274 
<212> DNA 

<213> Homo sapiens 
<400> 254 

gcccacgcgt ccgctgttgc tcaaaggaaa taggagttgg tgtgcttgtg accaaggggt 60 

tacacttmca gcttttaaaa ttctccttta catgtgctca gtgttttgkt ttgtgttttg 120 

gtttctgttt tttattttaa ttcccacatt gggcacaaga atcagaatat ggatagctag 180 

tttaagaaac ttttgtgggt gcactgtagc atagatgaca gaatttgatg ttccccccat 240 

ctccaattca gttcagggca ttccacagtt aaacagaaat gggaacgtgg ggctcttata 300 

aatgaatggg cgctcacagt tttggttttc agctcttcat gtctgtaagt gtgctttggg 360 

graggctatg tctgtatggt cgattctcag ttatcacatt tgcctctcct cccactacct 420 

tcatgamcat tcagtgctgt tcgcactgca gttagagaga agggacggac agttggtgac 480 

actcagccac attgctactt ttatctgttc tggtaagaag tfcagatagat ggtagattga 540 

agcaattggg tagaattagt tgggggaata tttatgagtt gctgtgtttg ttgattagtt 600 

ccatctcttt cccattttaa ctgagaattg attatatata gctctaagta tataggtatt 660 

taaacaaccc cacaagcggc tgtatcagta acatttatta attccactat agtgagggag 720 

gatttccatt ctaaatacct tattttgagg gatttataaa acttagttgt aaaagagaaa 780 

gcccacatag tgggaataaa ttgcttcagc catttttagt atttgagagc actagggaag 840 

atgtttagta gctgtgtgga tgcctttttt cacaccctgt ctattgaatg ctgcatccat 900 

tcacgaagtt aaatgttaca tgcagttagt ccttaatgtg gactggatct gtacttttgt 960 

tttggattaa aacatttaaa gatttttgaa gtgcagctac tccccacgtg catttgmtac 1020 

acataaaagt catactgtgt gtgcacaaag agtacatgga ttttccagca taytgcttta 1080 

aaaaattata taaactgtta aaatattaac acctcaggct acctgctgta ttctgtccca 1140 

ttgacccctg gaattggatt tactgcaagt gattgataat tcaattatgt ggcttttccc 1200 

ctttaatctt gccatttaaa ttacagtaga aagacaaaat caagtaaaat aaagtgttag 1260 

ataatagaaa gagt 1274 

<210> 255 
<211> 2409 
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<212> DNA 

<213> Homo sapiens 
<400> 255 

ccacgcgtcc gcttcgacga cgacacctgc agaagtgcgg acccgccatg ccgcgccacc 60 

tctcgggact gctcctgctg ctctggccgc tgctgctgct gctgccgccg acccccgccg 120 

cccccggccc cctggcccgc ccgggtttgc ggaggctggg cacgcggggc ccagggggca 180 

gtcccgggcg ccgccctggc tctgctgtcc ccacccgcgc gccctattcc ggggccggcc 240 

agcccggcgg ggcccgaggc gcaggtgttt gcaggagcag gcccttggat ttggtgttca 300 

tcatcgatag ttcccgcagt gtgcggcccc tggagttcac caaagtgaag acctttgtct 360 

cccagataat tgacactctg gacattgggg cggcagatac acgggtggca gtggtgaact 420 

atgctagcac cgtgaagatt gagttccatc tccagaccca ctcagataaa cagtccttga 480 

aacaggctgt ggctcggatc acacccctgt ctacaggcac catgtccggc ctggctatcc 540 

agacagcaat ggatgaggcc ttcacggtgg aggcaggagc tcgggggccc acttccaaca 600 

tccctaaggt ggccatcatc gtgacagatg ggaggcccca ggaccaggtg aatgaggtgg 660 

cggctcgggc ccgggcatct ggtattgaac tctacgccgt gggcgtggac cgggcagaca 720 

tggagtccct caagatgatg gccagcgagc ccctagacga gcacgttttc tatgtggaga 780 

cctacggggt cattgagaaa ctctcctcta gattccagga aaccttttgc gctctggacc 840 

cgtgtgtgct tggcacacac cggtgccagc acgtgtgtgt cagtgatggg gaaggcaagc 900 

accactgtga gtgcagccaa ggctactcct tgaacgccga tcagaagacg tgttcagcta 960 

tcgataagtg tgctctgaac actcacggtt gtgaacacat ctgtgtgaac gacagaactg 1020 

gctcttacca ctgtgagtgc tacgaaggtt acaccctgaa ccaagacagg aagacttgtt 1080 

cggctcaaga ccaatgtgcc tttggtacac atggctgcca gcacatttgt gtaaatgaca 1140 

gagatgggtc ccatcactgt gaatgctacg agggttatac tctgaatgct gacaacaaaa 1200 

cgtgttcagt tcgcagcgag tgtgctgggg gctcgcacgg ctgccagcac ctgtgtgtgg 1260 

acgacgggcc cgcggcctat cactgcgatt gtttccccgg ctacaccctg accgaagacc 1320 

ggaggacgtg cgcagccatt gaagaagcac gaagactcgt ctctacagaa gatgcttgtg 1380 

ggtgtgaagc caccctggcc ttccaggaga gggccagctc atatctgcag agactgaatg 1440 

ccaaactcga tgatattttg ggcaagttgc aagcagatgc gtatggacaa atacatcgtt 1500 

gaattactca gatttttcac ctggatatac ggagagcttg gtctatttaa tatttttgca 1560 

tacttcaatg ttcctgctaa taatttgcca ttgcaaatgc tttaatatta ctggataagt 1620 

agtatgagga tcttctagag aatcagtagg acataaacgt tcacatcctt aagagcaaac 1680 

tttagtgtct ctaagctatg actgtgaaat gattcatggg gaatagaatg aaaagtttgg 1740 

tatctcttta tttaccaatt gagccattta atttttaaat gtttatatta gtaagataac 1800 

cattcttaca atgggaactt tttatctatt ttctcttgat agtatttata gtataaacca 1860 

gttttattat tgagagtgta aattatacaa gtatttacac ataaaaaagt tcatataatt 1920 

gaggtaaata taatttagaa ctgtttcttt aatgctttgt tttttgctca ctttttgctg 1980 

gaatatcact gaagctgtga tcaggggatt ataacacata tcaagatcaa gtgaacacta 2040 

catgaaatat tgtaagaaac acataactaa agactttagt tttgaattaa gtgttataac 2100 

ttcttaccaa gttttggtaa aaaatcctac attatcttta ctgtttcact ttaggattca 2160 

atcaagaaaa ttatatactt ataaatattg atctaaaaag ttaacaacaa acccaatgtc 2220 

gccattttaa agtttaagct taacttttct tcacttacat atttagtata tgtattttat 2280 

ttttccgctt gaaagcttat agctcttagg agaaaaccat cctttaaatt gtgactactc 2340 

attttttctg tttgtattgt ctttagtata ataaaaagtt actatcttta taaaaaaaaa 2400 

aaaaaaaaa 2409 

<210> 256 
<211> 876 
<212> DNA 
<213> Homo sapiens 

<400> 256 

caggtaccgg tccggaattc ccgggtcgac ccacgcgtcc gcttcgacga cgacacctgc 60 

araagtgcgg acccgccatg ccgcgccacc tctcgggact gctcctgctg ctctggccgc 120 

tgctgctgct gctgccgccg acccccgccg cccccggccc cctggcccgc ccgggtttgc 180 

ggaggctggg cacgcggggc ccagggggyw ktcccgkkcg ccgccctgkc tctgctgtcc 240 

ccacccgcgc gccctattcc ggggccggcc agcccggcgg kgcccgaggc gcaggtgttt 300 

gcaggagcag gcccttggat ttggtgttca tcatcgatag ttcccgcagt gtgcggcccc 360 

tggagttcac caaagtgaag acctttgtct cccagataat tgacactctg gacattgggg 420 

cggcagatac acgggtggca gtggtgaact atgctagcac cgtgaagatt garttccawc 480 

tccagaccca ctcagataaa cagtccttga aacaggctgt ggctcggatc acacccctgt 540 
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ctacaggcac catgtccggc ctggctatcc agacagcaat ggatgargcc ttcacggtgg 600 

aggcaggagc tcgggggccc acttycaaca tccctaaggt ggccatcatc gtgacagatg 660 

ggaggcccca ggaccaggtg aatgargtgg cggctcgggc ccgggcatct ggtattgaac 720 

tctacgccgt gggcgtggac csggcaraca tggagtccct tcaagatgaa tggccagcga 780 

agcccctaga cgagcacgtt ttctatgtgg agacctacgg ggtyattgag aaaccttcct 840 

ytagattcca ggaaaccctt ttgcgctctt ggaacc 876 

<210> 257 
<211> 1586 
<212> DNA 

<213> Homo sapiens 
<400> 257 

tttattatac taaagmcaat acaamcagaa aaaatgagta gtcacaattt aaaggatggt 60 

tttctcctaa gagctataag ctttcaagcg gaaaaataaa atacatatac taaatatgta 120 

agtgaagaaa agttaagctt aaactttaaa atggcgacat tgggtttgtt gttaactttt 180 

tagatcaata tttataagta tataattttc ttgattgaat cctaaagtga aacagtaaag 240 

ataatgtagg attttttacc aaaacttggt aagaagttat aacacttaat tcaaaactaa 300 

agtctttagt tatgtgtttc ttacaatatt tcatgtagtg ttcacttgat cttgatatgt 360 

gttataatcc cc tgatcaca gcttcagtga tattccagca aaaagtgagc aaaaarcaaa 420 

gcattaaaga aacagttcta aattatattt acctcaatta tatgaacttt tttatgtgta 480 

aatacttgta taatttacac tctcaataat aaaactggtt tatactataa atactatcaa 540 

gagaaaatag ataaaaagtt cccattgtaa gaatggttat cttactaata taaacattta 600 

aaaattaaat ggctcaattg gtaaataaag agataccaaa cttttcattc tattccccat 660 

gaatcatttc acagtcatag cttagagaca ctaaagtttg ctcttaagga tgtgaacgtt 720 

tatgtcctac tgattctcta gaagatcctc atactactta tccagtaata ttaaagcatt 780 

tgcaatggca aattattagc aggaacattg aagtatgcaa aaatattaaa tagaccaagc 840 

tctccgtata tccaggtgaa aaatctgagt aattcaacga tgtatttgtc catacgcatc 900 

tgcttgcaac ttgcccaaaa tatcatcgag tttggcattc agtctctgca gatatgagct 960 

ggccctctcc tggaaggcca gggtggcttc acacccacaa gcatcttctg tagagacgag 1020 

tcttcgtgct tcttcaatgg ctgcgcacgt cctccggtct tcggtcaggg tgtagccggg 1080 

gaaacaatcg cagygatagg ccgcgggccc gtcgtccaca cacaggtgct ggcagccgtg 1140 

cgagccmmca gcacactcgc tgcgaactga acacgttttg ttgtcagcat tcagagtrka 1200 

assctcgtag matwmacagt gatgggaccc atctctgtca tttacacaaa tgtgctggca 1260 

gmcatgtgta ccaaaggcac attggtcttg agccgaacaa gtcttcctgt cttggttcag 1320 

ggtgtaacct tcgtagcact cacagtggta agagccagtt ctgtcgttca cacagatgtg 1380 

ttcacaaccg tgagtgttca gagcacactt atcgatagct gaacacgtct tctgatcggc 1440 

gttcaaggag tagccttggc tgcactcaca gtggtgcttg ccttccccat cactgacacm 1500 

cacgtgctgg caccggtgtg tgccaarcac acacgggwcc agagcgcaaa aggtttcctg 1560 

gaatctagag gagagtttgg caattt 1586 

<210> 258 

<211> 1011 

<212> DMA 

<213> Homo sapiens 

* 

<220> 

<221> misc_f eature 

<222> (2) . . (2) 

<223> n equals a,t,g, or c 

<400> 258 

cncccttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 60 

ttgcagctta aaagcatttt attaagcatt tttggatgtt gcttcctacc acttaagaat 120 

aaaaaatgca ttttaataaa aacaaatcta tactgaagtc attttccttt gtgagaggaa 180 

atatgaatga aatacattca tacttaaaaa cagagtattt tactgccaaa actttaaata 240 

tctcaagagc ataccacatt ttaaacacat tatggtcatg tagctatttc aatattcctg 300 

ggagtggtgg gcaattagcc tgtctatggc ttaggatctg tttccatgct tgcttcctga 360 

gcttcttcta cctctgagag tttttcatca ttttccaatg tttgctgaag ttcatggatg 420 

gtactaagaa gaacatgaaa ctgtttccgt ctcaattcca gcttatcttc aacactttct 480 

ttaatgtgtg aaagatgctc taattctttt cccagagcct ctagttcctt taatgtctca 540 
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tgcctgtctg gatggtgctg aatcactttt gccaaagcat catattcttg gcgatttttt 600 

cgtattcgtt ttgcttgaag aatttgcttt ttgcactcag caattttttc atgtgctcca 660 

gctatgctac attctatttc cttgtaaatt ttttcataat tttccatttc tctgagattc 720 

atatcatata ctagtaaagt tttgcccatt gaaaattcac attgagacag cgtgctcagc 780 

atacgttggt actggctata tccctcttcc tgggacccag agttgcacca tttaatgaaa 840 

ctcttcacta gcagattaat tctccgatca tctccagcac catctccatc aatgaggaga 900 

cgcttccgta taattcgtcg tcagtcacgg ctcccatggc gtgcgcggcg gcggcggcgg 960 

gcaagctgag gcggcggttg gcggcggcgg cggaggtcaa actcccacaa t 1011 

<210> 259 

<211> 1395 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> (1338) . . (1338) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (1382) . . (1384) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (1390) . . (1390) 
<223> n equals a,t,g, or c 

<400> 259 

tcgacccacg cgtccgggag ccatggcgcc gtccgggccg ctgctgctgg tgctgctcgt 60 

gccgctggcc gccgcgcggc cgggccctac ttccgtcccg gccggggctg ccgcctgccc 120 

ctgcgggggg accagctgtc ggggctgggg cgcaggacct acccccggcc gcacgagtac 180 

ctgtccccat ctgacctgcc caagagctgg gactggcgca acgtgaacgg ggtcaactat 240 

gccagtgcca ccaggaacca gcatatcccc cagtactgtg gctcctgctg ggcccacggc 300 

agcaccagtg ccatggcgga ccggatcaac atcaagagaa agggggcgtg gccctccamc 360 

ctgctgtccg tgcaacamkt cytcgaytgg cgccaacgcg ggytctgtga gggggcaack 420 

acctgccggt gtsgacgtac gcccatgagc amggcatccc ggacgagacc tgcaacaact 480 

accaggctaa ggaccaggaa tgcaacaagt tcaaccagtg tggaacatgc acggaattca 540 

aggagtgcca ctacatccag aactacacgc tctggaaagt gggtgactac ggctccctct 600 

ccggcaggga gaagatgatg gcggaaatct atgccaacgg ccccatcagc tgcggtatca 660 

tggccacgga gaagatggtg aactacacgg gaggcatcta cgcggagtac caggatcagg 720 

cctacataaa ccacgtcatt tctgtggtcg gctggggcgt cagcgacggc acggagtact 780 

gggttgtccg .gaattcgtgg ggggaaccgt ggggggagca cggctggatg aggattgtga 840 

ccagcaccta taaagacggg cagggcgcca gttacaacct cgctgtcgag gacacctgta 900 

cgtttgggga ccccatcgtt taagggacag gtctccccag aagagcagtg ttatcgtgaa 960 

ccataatcag ggggtcctat cgctctgggc actgggttgg ttccaccatg gtctgaaggg 1020 

actggggact ggcatcaaac gtgtctgatg gctgctcgcg gccccgtgcg cccagaaggg 1080 

agaaggggcg cctgtcagca cacagcctgc cgcggcgccg gccgggagcg cgctcctggg 1140 

gaagagtctg caatgggacg gctgagagcc ccgggccggc cactgccctg cccagtgtct 1200 

gcctggccac cgtgtgatcc gcaaggccca aacgatgtga ctgcaagctt ctctgtccct 1260 

gatttggtgt ttcctgtctg gcagctgtgg tccatgatgt ggtgcggaag cccaagcttc 1320 

tcaaagctct tacgttgnct gggattcggt gggggggagt cggggggtgg aaggagaaag 1380 

cnnnccttgn aagat 1395 

<210> 260 
<211> 270 
<212> DNA 
<213> Homo sapiens 

<220> 
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<221> misc_f eature 
<222> (260) . . (260) 
<223> n equals a,t,g, or c 

<400> 260 

tgtcagcaca cagcctgccg cggcgccgcc gggagcgcgc tcctggggaa gagtctgcaa 60 

tgggacggct gagagcctcg ggccggccac tgccctgccc cagtgtctgc ctggccaccg 120 

tgtgatccgc aaggcccaaa cgatgtgact gccaagctcc tctgtccctg atttggtgtt 180 

tcctgtctgc agctgtggtc catgatgtgg tgcggaagcc caggcttctc aaagctctta 240 

cgttgctggg attcggtggn ggggartcgg 270 

<210> 261 
<211> 2324 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> (15) . . (15) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (23) . . (23) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (36).. (36) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (92) . . (92) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (95).. (95) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (107) . . (107) 
<223> n equals a,t,g, or c 

<400> 261 

cgccaaaccc gcctnttccc cgngcgtttg ccgtcnttta aatgccagga tcgatccagc 60 

catgataaga tccattgatg agtttggcca anccncaact tagaatncag tgaaaaaaat 120 

gctttatttg tgaaatttgt gatgctattg ctttatttgt aaccmttata agctgcaata 180 

accaagttac camcamcaat tgcattcatt ttatgtttca ggttcagggg raggtgtggr 240 

aggtttttta aagcaagtaa amcctctaca aatgtggtat gsctgattat gatcatgamc 300 

agactgtgag gactgagggg cctgaaatga gccttgggac tgtgaatcta aaatacacaa 360 

mcaattagaa tcactagctc ctgtgtataa tattttcata aatcatactc agtaagcaaa 420 

actctcaagc agcaagcata tgcagytagt ttaacmcatt atacacttaa aaattttata 480 

tttaccttag agctttaaat ctctgtaggt agtttgtcca attatgtcac accacagaag 540- 

taaggttcct tcacaaagat cccaagctag cttataatac gactcactat agggagagag 600 

ctatgacgtc gcatgcacgc gtaagcttgg gcccctcgag ggatcctcta gagcggccgc 660 

cctttttttt tttttttcat cttttattta tttattattt ttttttacta aggcacatga 720 

cgtagaaata ttgaggtaca aaatgcaaat ttctgcataa gatttttaag atattcattt 780 

tggaaaatga aggtgaacat catctcccag aatattcagc ttttagcttg ttttttcttt 840 
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tggaccagtt caaccagcaa cttgtaccta gcgatacagt cttccttgct cttggacggg 900 

acacatctgg ctattttgtc ccagcggtca gaggatcccc ttgggtactg ctgcaacgcc 960 

agttccagaa gtttctgttg attttgagtc cacggctcct ctgcagaccg agctctctct 1020 

tttctcaggc tctcctcgtc gctggactcg ttttgttctg ctatgtcaaa gtccttctgc 1080 

cgcttggctc tggacttctc ctctggctcc ggcttcgctg tagcctccag cagcctggct 1140 

ggcttccgcc tccgaggccg ggcatcagtg gccccggtct cctgctcacc ggagtctccc 1200 

tcctgctcct cctccgctgc caccccctct gcgtcctctc gctgggtgat catgtcatcg 1260 

ggcaaggtgg tggccgtttt gatgggcctg gaattctgaa ctgtcgattt gagttcggag 1320 

agtctaacca ttcctgggga gcaggtcact gaatccttca gttgcttggc tttggttgtc 1380 

acatctgtca cagatcgacc caattcgtgg gcaatctttt cccatcgacc tggagtccct 1440 

cctgggaact taaccatact tcttgtcagt tggctgaggt cctcttctgt ccattcaggt 1500 

gcctgttttt tctgtgttcg gttcctgttt tccaaccaat catccatttg ttcctcaatt 1560 

tcttctatgg aagttccatg atcataagac tgaatatatg tagtttctaa aggtgtgtat 1620 

acaggaaatt caggttttgg ttttttaact ttcttctgtt tttgaagtgt ttcaagttca 1680 

gttctagtca gtgcatcttc cttttccttc aatcttgttt ctttatattt agcataaaac 1740 

tgcccagcat cctggatgag gtgaggtaat gcttttagtg taaggcaaaa ccaaatcccc 1800 

agtttgcatg gaagcaaatc atgccactgt ggtttcatca gcaatctttc atttttttct 1860 

gaagcaccga gttttgatac atccacactc ttgctgccag tctttttttt cttttctctc 1920 

ttttttctac ttagtagttc atccagttgt ttttccaggt agattgacca aaccacagca 1980 

taatgaccca ctgtgagaat aatgaacaag agtaatgcca gctcagcatt gctcattttt 2040 

ctcacccgcc tgtagtagaa tacaggctgt cgccaatctg gaagtccatt gatcagaata 2100 

tcatcatacc tctgccttcg ttcatcatcc tttaaaactt cataaatggc caccaattgt 2160 

ctaaactgag tttctgcatt ttcatcttta ttcttgtctg gatgtaaagt tagtgaaagc 2220 

ttacgatatg cttttctgat gtctgcagat gatgcatcct gctgcacccc gaggaactgg 2280 

tagaagttga gcggacgcgt gggtcgaccc gggaattccg gacc 2324 

<210> 262 

<211> 585 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (570) . . (570) 

<223> n equals a,t,g, or c 

<400> 262 

gcttcctgca cctggtgacg cttggcgaaa ctgaggtctc atggagaagc cccggagtat 60 

tgaggagacc ccatcttcag aaccaatgga ggaagaggaa gatgacgact tggagctgtt 120 

tggtggctat gatagtttcc ggagttataa cagcagtgtg ggcagtgaga gcagctccta 180 

tctggaggag tcaagtgaag cagaaaatga ggatcgggaa gcaggggaac tgccgacctc 240 

cccgctgcat ttgctcagcc ctgggactcc tcgctccttg gatggcagtg gttctgagcc 300 

agctgtctgt gagatgtgtg gtatcgtggg tacaagggaa gccttcttct ccaagaccaa 360 

gaggttctgc agcgtctcct gctccaggag ctactcctcc aactccaaga aagccagtat 420 

cttggctagg ttacagggaa aaccaccgac caaaaaagcc aaagtcctkc acaaggtgcc 480 

tggtctkcca aaattggagc cttcctccaa tctcaaagga caggacagct ggcaratkgg 540 

acaccaacag gacaagacgc tctggtcttn ggcttcgact ggggg 585 

<210> 263 

<211> 4344 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> (754) . . (754) 

<223> n equals a,t,g, or c 

<220> 

<221> mis cofeature 
<222> (2242) . . (2242) 
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V 

<223> n equals a,t,g, or c 
<400> 263 

tgagaatcct tgtggctttg attcggagtg tctgaacagg atgctgatgt ttgagtgcca 60 

cccgcagtgt gtcccgcggg cragttctgc cagaaccagt gcttcaccaa gcgccagtac 120 

ccagagacca agatcatcaa gacagatggc aaagggtggg gcctggtcgc caagagggac 180 

atcagaaagg gagaatttgt taacgagtac gttggggagc tgatcgacga ggaggagtgc 240 

atggcgagaa tcaagcacgc acacgagaac gacatcaccc acttctacat gctcactata 300 

gacaaggacc gtataataga cgctggcccc aaaggaaact actctcgatt tatgaatcac 360 

agctgccagc ccaactgtga gaccctcaag tggacagtga atggggacac tcgtgtgggc 420 

ctgtttgccg tctgtgacat tcctgcaggg acggagctga yttttaacta caacctcgat 480 

tgtctgggca atgaaaaaac ggtctgccgg tgtggagcct ccaattgcag tggattcctc 540 

ggggatagac caaagacctc gacgaccctt tcatcagagg aaaagggcaa aaagaccaag 600 

aagaaaacga rgcggcgcag agcaaaaggg gaagggaaga ggcagtcaga ggacgagtgc 660 

ttccgctgcg gtgatggcgg gcagctggtg ctgtgtgacc gcaagttctg caccaaggcc 720 

taccacctgt cctgcctggg ccttggcaag cggnccttcg ggaagtggga atgtccttgg 780 

catcattgtg acgtgtgtgg caaaccttcg acttcatttt gccacctctg ccccaattcg 840 

ttctgtaagg agcaccagga cgggacagcc ttcagctgca ccccggacgg gcggtcctac 900 

tgctgtgagc atgacttagg ggcggcatcg gtcagaagca ccaagactga gaagcccccc 960 

ccagagccag ggaagccgaa ggggaagagg cggcggcgga ggggctggcg gagagtcaca 1020 

gagggcaaat agcgccaggc ggccgcttgg ccggatccag gggcggtgca gggcggccgg 1080 

ccctgcctgc gggagagggc gagcatgaac tggcccggag gacccagctc gagccgccag 1140 

gacacagacg tacaggcctc ctcgggaggg agcgcctccc caccactgag ccatcctcag 1200 

cagcgtccgc tgcgtctgca ctgatgaccg tctgagccca gctcagcgtt cctggacaaa 1260 

cagcctcact cctcagcgtt accgccacac ttgaatttct ccgaatgtca aggttccctc 1320 

ccactctatt tttttaggtt aaagttaatt ggcatatgga atgttttaat ctcctctgaa 1380 

atgtgtagcg taggcttttc ccaagggtcg ctagaaactc gtcttcgcgt tgcccccttt 1440 

ctggctctca gcgccgtcgc cactcgggag aggctgggtg aggcccgtgt gaggactgac 1500 

cctggattcc tcgaaactgc cattgtgatc attactctgc tctttggaaa tggctgtatc 1560 

atttttttgt actaatgtga attgttcctc agaaacgctt cttttccatc ctagtgagaa 1620 

gctggccctg caggtggtgg cagcaatggt gttgtaagat ttcctcccgt agttttttct 1680 

cctcatggat ttgaatgaaa tgccaataac acgtccactt tcaacgtgta gtttacgcgg 1740 

agcactttcg aggcctggcc gggttgggcc tacttctcac ctgggcctat cttctgaact 1800 

cgctaggttc ttatcaacat ttgggggata actttgtata tttttttcat ttggcttttc 1860 

tttaccagtt tctgattttt attctcaata tatttttgct aaacctattt cacaaatcac 1920 

caccgactga agtgtgtgtt tactgatgcg gccctgagct ccatggcgaa aggagtgact 1980 

ttgcagggcg tgagaccgca gtctgcttag agcacaggaa gtgacaactt agggagcccc 2040 

gtagggcgct gcaggccccg gggaccccag cacgtgggtc taaagagaga cggagtctag 2100 

ctctcctgcc acccagagtg gcttccatct cagcactctg tgggtctggt gatggaagat 2160 

gcagtctctg ctgatcacat gtgccctctg ccagggcacc tactgagagg tgcggtcctg 2220 

ggggtggagg cctgcctggc angtgtgsgt gcctcgtacg tgtgttatgg gcactggtct 2280 

aggccaggta tgacacccac tctyctgtga gatttcactt tagtttttaa aaggtccagt 2340 

tctacagagt gagacctatc tatctgagta ctacatatgt tttaagactt ggttcttttt 2400 

ttgagggatc cttgaccctg ggaagtctgg agcaccctga gaagggggca ccatgtgtgc 2460 

ctttgcccac gtgtcctgag gggctgcttg tctgggaggg agggagagaa cattcagcag 2520 

caggtgcttt tttatggcct tttcttaaaa taacctaagg gggacacatc catcttgcag 2580 

agaagtttac agaactcccc ttgaaaactg ctgctgaggc tcctgttaaa ttttctgtgg 2640 

catcttttat gccttggtaa aaactgcagt gtctttggac ctgagagtgg ctactccgtg 2700 

gttttgtgac ctgtaagcgt ggggttcagg ggtgtgtggc cctgcagggt cccacgcctc 2760 

cctgagcact gactggaagt ttcactggct ggtggctgtc ccttctccca tcagggtccc 2820 

cagcaaagtt aactacacag aggacccagg ggaaacgagc tgtgtagcca ctgacttgct 2880 

cgcgcggccg tggcctctga ggggcactcg ccggttaaga cagggtggga gtagtgcttt 2940 

ccagttcaga ctctaacttc tcccaaagtg tcctaagaaa atactggatc ggctcataga 3000 

tttatgctcc ttatgatgcc ctaacttgga aggttgttct agggacaggc cgggcagtgt 3060 

ccccacacac accttagagt cgaaggcccc agggccccgc tgtcacttgc ccaaaagatc 3120 

ccttccggca ggtaagggac taccaatgct tacgtcaaaa cagcagaatc ggctttgcag 3180 

tgcactttgg ggagcagata ttaacttatt tttgtgttgg acagtagtga aatcttgtga 3240 

tttttaatcg ctttgataat acttccaaat tttatgattt ttctgaagga aataatgcaa 3300 

acattttaaa tatgtttctc cccctttcca aaaactgtta aactaatgag caagtaacac 3360 

taactttgaa tgtctctaca atacccgttg ataactcagt ggagccaggc tttggggtag 3420 

cggccctgag cttgcagggt ttctcgccac tggggctgac cacgccccca gctgtgaccg 3480 



158 



WO 02/076488 PCTAJSO 2/082 76 



tgggtgtggc tggctctcgg ccctgcccag ctttgttctg aggacgtggt gacttcctga 3540 

acatcagctt caatcctcca tcattaatgt gaagcaaaac acaaaaaccg ccccaatccc 3600 

tcaggattcc ttggcatccg aaaccagcat ctgcacctaa acccataccc acccgtgtgc 3660 

gcccacaggg ggatgtgtcc gaatgggcag cttaaaatgt ggtcacctgt gggggaaact 3720 

cttcaggcac ctgaagtgag aacccagctg tccgtcctca ggccggcctt tcttccggcg 3780 

acacccgtcc atggctggct gggtcccctt cgcagtgttt gtctgtcttg acatctaaac 3840 

cccggcgtgt gcagtgccca tcttccagga ctaccttatt ttccagaatt aaacctgttt 3900 

tataattcaa gttaatgcaa atgactgtca gttgccaaat atcttgatcc tatgagtgta 3960 

gttgatgact gtttgttagt cagtagagta aaatgctgtg tccacggggt gtcacagcct 4020 

caccataccc tgttgaggtg tgaaatgccc cgtcagaaat taaatacaaa cttaaatgtg 4080 

cctattggtg tctaaacttc atacaatgta aggtcagatt ccttttagga atactgggtg 4140 

ctgtcaccag gtttgatagt tagacttaaa aacttgaaat tcactttttg gggggaggga 4200 

tatactgaaa tagagagttg agacttgcca gttgggggaa aatagcattt aaaatggaaa 4260 

gctgtgtttg gaaaattgtg tatgagtatt tttgtattaa aaacatttta aaggcttttt 4320 

tcttaaaaaa aaaaaaaaaa aaaa 4344 

<210> 264 

<211> 1258 

<212> DNA 

<213> Homo sapiens 

<400> 264 

aaccctcact aaagggaaca aaagctggag ctccaccgcg gtggcggccg ctggctgacc 60 

ggcctaaaac taaaatgaca tttattccct agctacaaac atcagcgtta ttatgttaat 120 

tataccttgc cctctatcat tataaatggt tgccatggtg tttctaaaaa taagtgtttt 180 

accattaatg tgtagagggc aaacaaagca taaagtacta agggatcatg cttatcctag 240 

ggtctcacag aagagaggac atatttaatt aatcttgtga attacagaac aggttgtggt 300 

ccagacacca agaatcatag gggttttttt ttaaaaaacc taatagaagt agggtgacct 360 

ctctctttgg tctaagagtt ctaaaggaag gtaggcatct gtttaattag ttggttcacc 420 

ctggctttac ctctggttaa tgcttgtgtt aataggaagg aaaaatcact ttatcttttc 480 

ttccaagccc ctccctgcct gacttaccca gactgggatt accagatacc aggtgattta 540 

tgtggagatg atttttcacc tttaaactct aagccaagtg taagaaactc ttgatagcta 600 

tgtctatttt atatcagtca ctgagacttt tttttaagtt tttatttatt attaagacaa 660 

ctttgccaaa aaagtcccct aagcacaact atttacattt ctttatagcc tcttctgatc 720 

tctaacacat atgcagtttt aactgttatt ttcatagtaa ctgatctttt gtctaaggat 780 

ttttacctga aagcacaatg tattgagtct cttgaaaatc atctttcaga tctttttaca 840 

gaatgaactt atgcactgct actgtagtat tctcaaggaa tatatgtaaa cacaaatgta 900 

tgcctgaggt tggtttttgc agaaaacagt ctctgcttct aaaaacttct atgtctagtc 960 

ttccatagga aatcctcact gtttaaccat gtgaggagcc taagtcatta aacggatcat 1020 

gtctgtacat tgtgtaatga atgaaaagca cataaatgta atctactttg aactttgtaa 1080. 

aaatgatgtg tggaggctat tcttgtttct ccatctcaag tcctgtgtgt gcacgtgtgt 1140 

gcaagtgcac atgtgtgtgt gtaataacac attgtaaaga acagaaatta ctttaaaaaa 1200 

taaacagaaa tggagacctg aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1258 

<210> 265 
<211> 1739 
<212> DNA 

<213> Homo sapiens 
<400> 265 

ggcacgagag atcctcagga tatctttagc caaaggaaaa gctccgcatt cccacctggg 60 

gggaaagctg gattgccatg ggcacgaata gtggtgcaga gtccctggcc atcctgaata 120 

tccagaatgg tgtttctgaa gttcttctgc atgagtttct tctgccacct gtgtcaaggc 180 

tacttcgatg gccccctcta cccagagatg tccaatggga ctctgcacca ctacttcgtg 240 

cccgatgggg actatgagga gaacgatgac cccgagaagt gccagctgct cttcagggtg 300 

agtgaccaca ggcgctgctc ccagggggag gggagccagg ttggcagcct gctgagcctc 360 

accctgcggg aggagttcac cgtgctgggc caccaggtgg aaggatgctg ggcgcgtgct 420 

ggagggcatc agcaaaagca tctcctacga cctagacggg gaagagagct atggcaagta 480 

cctgcggcgg gagtcccacc agatcggggg atgcctactc caactcggac aaatccctca 540 

ctgagctgga gagcaagttc aagcagggcc aggaacagga cagccggcag gagagcaggc 600 

tcaacgagga ctttctggga atgctggtcc acaccaggtc cctgctgaag gagacactgg 660 
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acatctctgt ggggctcagg gacaaatacg agctgctggc cctcaccatt aggaccatgg 720 

gacccgacta gtcggctgaa aaatgattat cttaaagtat aggtggaagg atacaaatgc 780 

ttagaaagag ggaatcaaat cagccccgtt ttggaaggtg ggggacagaa aatggggcta 840 

catttccccc atacctacta tttttttata tcccgatttg cactttgaga atacatctaa 900 

ggtcatcttt caaaagagaa aaattggaca cttgagtgac tttgttttta gttttgtttt 960 

tgaacattat ttatgtgatt gttatggaat tgtcacctgg aaagaacaat tttaagcaat 1020 

gtcatttcta gatgggtttc taattctgca gagacacccg tttcagccac atctaaaaga 1080 

gcacagttta tgtggtgcgg aattaaactt ccccatcctg cagattatgt ggaaataccc 1140 

aaagataata gtgcatagct cctttcagcc tctagccttc actcctgggc tccaaaagct 1200 

atcccagttg cctgtttttc aaatgaggtt caaggtgctg ctttgcatgc ctgccaaccc 1260 

atggaagttg tttcttactt cttttctctc ttatttatta accatggtct gagagttgtt 1320 

tttgttctat gtaacagtat tgccacaaaa ctataggcaa atcgtgtttg cagggagatt 1380 

tctgatgcct ctgtgggtgt gtgtaagtta aagtggccac atttaagaag gccaagcttt 1440 

gtagtggttg cacagtcaca ctgatatgct gatttgctct ttctcattgt atgtctatgc 1500 

tttgtcatca gtgctatagt aaattacaaa gaaataggta gattgtatga acatacccac 1560 

aaatgcctat gatttaggtt accaatgtat tctttctcat ttggggtttt gcttctgtct 1620 

gtctgtttat tggaaacttg tacttcaagt agggggaatc ctaattctaa taactcctta 1680 

gctaagtttt attattcagg caataaacat gttttcatgt aaaaaaaaaa aaaaaaaaa 1739 

<210> 266 
<211> 538 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (462) . . (462) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (498) . . (498) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (520) . . (520) 
<223> n equals a,t,g, or c 

<400> 266 

gcttgtaggt actcattgag gtttattgtg taagatgaat gaatgttgca aattcctaaa 60 

catgtgattc agatgcccaa tcttactctg ttactttatg aaaatttttt aaagctatat 120 

gatgttatat caaaatatgt tgttatactt taggataatc ggtgtgttag ccctgaattt 180 

cagcataagt cccatttttt tccatgggag tctaggaaag ctatatgttt attcagcagc 240 

aaaatacagt ttggaactta aataaactat tgatcaattc tggtcttatg ctagaaggaa 300 

taaagcatca agaaaaagaa aagattgctg tcaagaccag gaaaattgac aatagagtat 360 

tagaatgcag aaatgagggg aagtggaaar gccascaagt aggagagaaa aagtgcaggg 420 

acagtagaaa gtgaatgtag gagcttctga cccagcactc angaacgcaa ttcatcccta 480 

aaaagctgtt gcgtctangt tgccagtaac caattaaaan ccgtttgaag tagagtga 538 

<210> 267 
<211> 1346 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (5) . . (5) 

<223> n equals a,t,g, or c 

<220> 
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<221> misc_feature 

<222> (17) . . (17) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 

<222> (21) . . (21) 

<223> n equals a,t,g, or c 

<220> 

<221> misc__f eature . 

<222> (36).. (36) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_jE eature 
<222> (107) . . (107) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (150) . . (150) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (323) . . (323) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (1307) . . (1307) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (1337) . . (1337) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (1341) . . (1341) 
<223> n equals a,t,g, or c 

<220> 

<221> misc__feature 
<222> (1343) (1343) 
<223> n equals a,t,g, or c 

< <400> 267 

tcccntcaag gccgggnggc ntgttcttgg cctttncaac ttttgccttt ccactattta 60 

gaatgcaggt gtgtctgtcc tgcaacccag cttctggctc tggaaancca gctgcatcac 120 

ccatgtgcct ggaccttctc cagaccatgn aggcccaggc gagtgactca ctgccattca 180 

gtctccatct ttgggcagat ccaccatgag acataacttc ccagaaatcc agttacaagg 240 

aggaataagt attgaagact taagaaatgc attttgcagc aggtcctcgc tgtactgggg 300 

cagcggtcca ttcatagagc ccngctagaa tagaggtcac aagctcagaa gcttctctaa 360 

ggcaggcagg aaatttaagt cgatactatg atctgcattg tgggctggaa tgaacggaag 420 

gtgcctagtc taaacagctg cttgttgctc agctgttgtt gccgtattgg gaattcaagc 480 

ctaatgatgt ttggtattcc cattttcaaa agaagtcagg aaatgcagat ttctatgtaa 540 

atttttaaaa cttctgaact gtgtatgagc catacaaaat acatttgcag gccagtcgac 600 

atcctctgat ccagaatatc aatttgtgag acaagttgtt ggtgaggcag cattmcatag 660 
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tagttaaaag catacatttt agagccagac tgcccatgtc caaaccctgg tcccatcact 720 

cactmcctty catttcactt ctctttgctt cactttcctc atcagtaaaa taaaaataat 780 

atcagtacct acctcatagg gtttcatgag cattaaataa attaaaaccc ataaagtact 840 

ttcaatgcta tcaggcattt agttacatgg taaataagtg tttaaaacat ttaaaacaaa 900 

agttcaaaga taataarcaa ggaaacagaa aacctgacag gccagctttg gaaccttctt 960 

gatggcagat ctatcaacat ttctcccttt ggctgggatg aaaaggcatt tgggaataaa 1020 

agatcccata aaaataaatg agaagaagtg aaacaccttc attafcggcaa ttttggtgtc 1080 

agagcctaaa agacagaggg atcaaaatat tgcagtacta aaatctgatg gtttcatgta 1140 

gaaatgagat ttaagcttat taaaatgtat ttctttctgg tgttaataac ccttcacaag 1200 

acctgggcaa ttttgaaaga aggaaagaaa atggttctcc ctgtggacaa raagaaacaa 1260 

atgactatta aattttctaa tttggagtga actagggkgr ttccccnagt ttatgtggga 1320 

aggtttcagt gagggtngta ngngaa 1346 

<210> 268 
<211> 912 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mi sc_f eature 
<222> (36) . . (36) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 

<222> (93) . . (93) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (158) ..(158) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 

<222> (592).. (592) 

<223> n equals a,t,g, or c 

<400> 268 

aaaatggttc tccctgtggc aagaagaacc aaatgnctat taaattttct aatttggagt 60 

gaactaagtt gatccctagg tttttgtggg agnccggggt gggggagtcc agtggaaagc 120 

aattgctgga gagtagtcct tgttctttgc tgacaganca ggagcagagt gtggaatgaa 180 

aactcaatag cctcctctat tctcaagaga caattgactt ccatctgttt aaacctcccc 240 

aggggaccct gctcccccca tttccattta ctctcctttc caccaaccta gggtgacatt 300 

aagaaaacca aacccatttg aaacacaagc tcttacacat caaaagtcag gggagaagtc 360 

tggttgacct gtaagccact gcatgaggca caaagatgca aaaaggaact ttcaggaaca 420 

actgctgctc cgaggactct atgtcagata taacatccgc tttggcccaa aagtaggctt 480 

gagccccaga agaggaggaa tgtcmagtat gtttaaaatg tgaaaccttt agttatactt 540 

gctctttact cagaaaggag agagtattcc cttatgccaa cgaggtctct gngagttgtt 600 

tgcactattg gtagcaggtg ctgcctgggg tagctcttat ggtctgtgct tgaagtgtgc 660 

accagctgct gccctggaca tgactgttgg tccctgcata caagcagcca cctttaaaca 720 

gatcaaatga ctcttatgat gacagctgtc tcactrtact ttcaaactgg ttttaatttg 780 

gttacttgca acctaagaca gcaracagca ttttagggat gaattgcgtt cctgaagtgc 840 

atgggtcaga aagctcmtac attcacttty tactgtccct gcactttttc tatcctactt 900 

gctgcccttc ca 912 

<210> 269 

<211> 1177 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 
<222> (1095) . . (1095) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (1115) . . (1115) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (1142) . . (1142) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (1162) . . (1162) 
<223> n equals a,t,g, or c 

<400> 269 

agccaccatg cccggcctag attaaaaatt tgaagacata ttctctacta tgagccaatg 60 

aaattactca ttttgtttct atcccatttg ctgtcccttg cttttggaat tttgtgtctt 120 

agtgtgactg tgattctttc tctccttttg tctttcagca aacggggatt cagcgtccga 180 

tcctttggaa cagggactca cgtgaagctt ccaggaccag ctcccgacaa gcccaatgtt 240 

tatgatttca aaaccacata tgaccagatg tacaatgatc ttcttaggaa agacaaagaa 300 

ctctatacac agaatgggat tttacatatg ctggacagaa ataagagaat caagccccgg 360 

ccagaaagat tccagaactg caaagacctg tttgatctga tcctcacttg cgaagagaga 420 

gtgtatgacc aggtggtgga agate tgaat tccagagaac aggagacctg ccagccygtg 480 

cacgtggtca atgtggacat ccaggacaac cacgaggagg ccaccctggg ggcgtttctc 540 

atctgtgagc tetgecagtg tatccagcac aeggaagaca tggagaacga gatcgacgag 600 

ctgetgeagg agttcgagga gaagagtggc cgcacctttc tgcacaccgt ctgcttctac 660 

tgagcccagc gcccgcatgg agccgcctct ggagcttcct gttgttcata ctttttcctt 720 

cctgacattt gtttttactt acaggtgttc tgctggtgac ggtagcatta cccaaataaa 780 

ctgtgcatat gaaatgggag aggagatgee aaaacgccag atgaaagcaa tcaagtttct 840 

tcttttccac ttttacttat gagcrggata ttgattacaa agtttttctt ctttaaccaa 900 ■ 

aaaggaaaga caacggtttg tgtgcacttc ccgacatacc tgtgtcttcg tgtgcctgcc 960 

ttccctccct cctccccacc gggceggact gtacagagee ctgctgcggc gtgttaggaa 1020 

tgacctggaa ttgtcaataa acagatgetg ctgtcaaaaa aaaaaaaaaa aaaaaaaaaa 1080 

aaaaaaaaaa raaancaaaa aaaaaaaaaa aaggnggggc cgaaggtttt ttccctttgg 1140 

tnggggttat ttttggcttg gnattggect tcgtttt 1177 

<210> 270 

<211> 1775 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 
<222> (820) . . (820) 
<223> n equals a,t,g, or c 



<400> 270 



gcggcgcggg tgggggttgt gcgttttacg caggctgtgg cagcgacgcg gtccccagcc 
tgggtaaaga tggccccatg gcccccgaag ggcctagtcc cagctgtgct ctggggcctc 
agcctcttcc tcaacctccc aggacctatc tggctccagc cctctccacc tccccagtct 
tctcccccgc ctcagcccca tccgtgtcat acctgccggg gactggttga cagctttaac 
aagggcctgg agagaaccat ccgggacaac tttggaggtg gaaacactgc ctgggaggaa 
gagaatttgt ccaaatacaa agacagtgag acccgcctgg tagaggtget ggagggtgtg 
tgcagcaagt cagacttcga gtgccaccgc ctgctggagc tgagtgagga gctggtggag 



60 
120 
180 
240 
300 
360 
420 
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agctggtggt ttcacaagca gcaggaggcc ccggacctct tccagtggct gtgctcagat 480 

tccctgaagc tctgctgccc cgcaggcacc ttcgggccct cctgccttcc ctgtcctggg 540 

ggaacagaga ggccctgcgg tggctacggg cagtgtgaag gagaagggac acgagggggc 600 

agcgggcact gtgactgcca agccggctac gggggtgagg cctgtggcca gtgtggcctt 660 

ggctactttg aggcagaacg caacgccagc catctggtat gttcggcttg ttttggcccc 720 

tgtgcccgat gctcaggacc tgaggaatca aactgtttgc aatgcaagaa gggctgggcc 780 

ctgcatcacc tcaagtgtgt agactgtgcc aaggcctgcn taggctgcat gggggcaggg 840 

ccaggtcgct gtaagaagtg tagccctggc tatcagcagg tgggctccaa gtgtctcgat 900 

gtggatgagt gtgagacaga ggtgtgtccg ggagagaaca agcagtgtga aaacaccgag 960 

ggcggttatc gctgcatctg tgccgagggc tacaagcaga tggaaggcat ctgtgtgaag 1020 

gagcagatcc cagagtcagc aggcttcttc tcagagatga cagaagacga gttggtggtg 1080 

ctgcagcaga tgttctttgg catcatcatc tgtgcactgg ccacgctggc tgctaagggc 1140 

gacttggtgt tcaccgccat cttcattggg gctgtggcgg ccatgactgg ctactggttg 1200 

tcagagcgca gtgaccgtgt gctggagggc ttcatcaagg gcagataatc gcggccacca 1260 

cctgtaggac ctcctcccac ccacgctgcc cccagagctt gggctgccct cctgctggac 1320 

actcaggaca gcttggttta tttttgagag tggggtaagc acccctacct gccttacaga 1380 

gcagcccagg tacccaggcc cgggcagaca aggcccctgg ggtaaaaagt agccctgaag 1440 

gtggatacca tgagctcttc acctggcggg gactggcagg cttcacaatg tgtgaatttc 1500 

aaaagttttt ccttaatggt ggctgctaga gctttggccc ctgcttagga ttaggtggtc 1560 

ctcacagggg tggggccatc acagctccct cctgccagct gcatgctgcc agttcctgtt 1620 

ctgtgttcac cacatcccca caccccattg ccacttattt attcatctca ggaaataaag 1680 

aaaggtcttg gaaagttaaa aaaaaaaaaa aaaaaaaaaa aaaaaactcg agggggggcc 1740 

cgtacccaat cgccctatga tgtagtcgta ttaca 1775 

<210> 271 
<211> 866 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<222> (14).. (14) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 

<222> (27).. (27) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 

<222> (33) . . (33) 

<223> n equals a,t,g, or c 

<220> 

<221> misc__ feature 
<222> (105) . . (105) 
<223> n equals a,t,g, or c 

<400> 271 

cctggcttgc tggncaagcc ttggtgncca tgntgaacaa gttttgtgga agttctgggg 60 

agactccaag aactaccagg aacagggata cgagtgccag gctgnatctc ttgctcctct 120 

gcagagtcag caggcttctt ctcagagatg acagaagacg agttggtggt gctgcagcag 180 

atgttctttg gcatcatcat ctgtgcactg gccacgctgg ctgctaaggg cgacttggtg 240 

ttcaccgcca tcttcattgg ggctgtggcg gccatgactg gctactggtt gtcagagcgc 300 

agtgaccgtg tgctggaggg cttcatcaag ggcagataat cgcggccacc acctgtagga 360 

cctcctccca cccacgctgc ccccagagct tgggctgccc tcctgctgga cactcaggac 420 

agcttggttt atttttgaga gtggggtaag cacccctacc tgccttacag agcagcccag 480 

gtacccaggc ccgggcagac aaggcccctg gggtaaaaag tagccctgaa ggtggatacc 540 

atgagctctt cacctggcgg ggactggcag gcttcacaat gtgtgaattt caaaagtttt 600 

tccttaatgg tggctgctag agctttggcc cctgcttagg attaggtggt cctcacaggg 660 
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gtggggccat cacagctccc tcctgccagc tgcatgctgc cagttcctgt tctgtgttca 720 

ccacatcccc acaccccatt gccacttatt tattcatctc aggaaataaa gaaaggtctt 780 

ggaaagttaa aaaaaaaaaa aaaaaaaaaa aaaaaaactc gagggggggc ccgtacccaa 840 

tcgccctatg atgtagtcgt attaca 866 

<210> 272 

<211> 1237 

<212> DNA 

<213> Homo sapiens 

<400> 212 

agcaaaccca ggaaggtgtg gcgtccccgc ttcgcgccaa gatggtgctg gtgctgcgcc 60 

atcctttgtg tgcccgggaa agggcgttcc gggagccggg tcgggggctc ctgactcgca 120 

ctgggcagca tgacggtgcg ccggctgtca ctgctgtgcc gggacctctg ggcgctgtgg 180 

ctgctgctga aggccggcgc agtgcgtggg gcgcgggcgg gtcctcgcct ccccggaagg 240 

tgttgtgggg cgacatgcgg ggacgccggg cgggggtgga cgttctgggc ccagccctgt 300 

cctcagaagc tgctggggca gaagcccggg gctgggggat gccggggatg ggtgttgggg 360 

tgggtgcctc cgagaccaga ggagccctgt tccttggcag ggaaggtgtg cacgggcctt 420 

gcccgatgga tggtttaggg ccatggccct ggggtccctg gtgagcagtg gggccgcctc 480 

tgcccttggc ctgtgaggga ctgtctgtgc tggtcccaga aggctgggat cacctttcca 540 

ctggctcctt tgttcgaggt ttttcataga caggctatgt ggacaaatga gggcagcgcc 600 

cacgtctggc tggtggaggg gctgcggctc ctccttggag gggacgcctg gccactgctg 660 

tccccacaat ggggccaccc gtggtgcaag gcgtgacaag ctgccctctc taggtaagca 720 

ggacttggga ggcccctggc caagcctgtg gacccggctg ggcggcctct gtggtctcag 780 

gtttgggtgt gtttggtctg gtcagggctc aggggctgct ggtccacact ggccccatcc 840 

tgacaattgg agctttgggg caaggtccct ggagaagggg tcacgtcggg aggaaacagc 900 

ctgggttttg ttgatgcttt tctaagaatg gagtactcgt tttcaagaga tttgtcctaa 960 

ttatattttc cagcgggtac ttatgccaag tattgatgaa taattcataa aataagcatc 1020 

tttgtgaatt ttagtgaatc agaccttaac tatcaacggc aatgaatgaa catctaaagt 1080 

ttccaatttt aaagtaaaga actggctggg tacagcagtt cacgcctgta atcccagcac 1140 

tttgggaggc caaggctaga ggatcgcttg agcccaggag tttgagatca gcctgggcaa 1200 

cataccaaga cctcatctgt taaaaaaaaa aaaaaaa 1237 

<210> 273 

<211> 1681 

<212> DNA 

<213> Homo sapiens 

<400> 273 

cgatggcccc gcggccgctc tagaaagtcc cgtttttttt tttttttttt tttttttttt 60 

ttttagagta cgttctgcat tttatttytg caggcaacac tttgctcacc agcaagaaca 120 

cagcccragg aagggaccca ataacctttc aaaacscaaa ctgctkcctg cggtgagggc 180 

ccagggtcct ccacggagag gacaggcatc ttcctttccc accaggaagg agtcagcccg 240 

gagcctctgc tatgtgcaag gcggtgtgca agcaccggct gcggctcttt gctgtctctt 300 

ctttctcttt ggggctgggc tgggtgtgcg ttctggtgct gatgctttgg cctgtgaggc 360 

tgagcttggc ayctcgaccc gttcaattac agcaacgaag aagccactgc tragygtggt 420 

ctcaggggar gcccggaggc agtgctcggc acccgggaac gtgctcaggc ctcggtgggg 480 

ccaggcaggc agggcgggag ctagcctgaa ggcgcccggg ttctgctgca gcgcatctcg 540 

caccacgtct tcattctcct cctggcagag ggagcacgtg gagtagacga gccgctgcag 600 

ggaagggaaa gtgagcgcgt ggcacagggc tcgctgctgg aaccctgcca gggcatgcag 660 

acgcaccggg ctaggtgtsc ctgccccggg mtcctccagc tgtctgctcg gcatacccga 720 

gccactgcag gaaggatcca gcaggayrta gtggacctca ygrtagcgyg gatcyraggg 780 

ggagaccgcc aggaagtcct cctcagccag ytcacagcar gagacgccag cccrggccag 840 

cagcgtggcc atggatgcca gccgcttggc atccaggtca aaggcaaaga tcttcccttg 900 

gttcttcaga agagcagcca agtgactggt cttattgcct ggggcggcac aggcatcgat 960 

gacatgggag cctggcgggg ggtccagcag catggctggg agacagctgg ccctgtcctg 1020 

cagaatgagg tgtccggccc ggtacagtgg gtgttcatgc agate tgtct gggcgggaaa 1080 

caccagcagc tccggcatca aggggtccag gagaaaatgc ttccccttga gggctcgtaa 1140 

gtcatcgagg ctggaagccc gaccctgata ggagaaacct tgtctcttga aataatcaac 1200 

tacatcatcg gagcaggtct tgagagtgtt cacacgcaca aatcgaggca gctgggaggc 1260 

tggaccaggc ctggatccca cttccaacag gtcctcattc cggctcacac cccgatgaac 1320 
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cttgagccga gccaactcag ccttgagcct cgcctggtgc cggcccaaca gagccttcca 1380 

tcggccccca ccccctcgaa agccctttcc caacaacaac tcatacacta gcaccttggc 1440 

caggtgcggc cgcagcttct tctccgcacg gaggaggccg gcgctggcga tcacagcatc 1500 

cagcacggcg gagtagcgct gcgtttcgca caccagcgcg tacagctgct tcacgttctg 1560 

gaagttgctg gagtacacca accccttgat agagcctggc ggctctccac gccggccaac 1620 

acgcctgcag ctgcagcata cagccccatg ttccgtcgcg ctttacggct ttgtggcaaa 1680 

a 1681 

<210> 274 
<211> 1863 
<212> DNA 

<213> Homo sapiens 
<400> 274 

gactaggccg cgagcttagt cctgggagcc gcctccgtcg ccgccgtcag agccgcccta 60 

tcagattatc ttaacaagaa aaccaactgg aaaaaaaaat gaaattcctt atcttcgcat 120 

ttttcggtgg tgttcacctt ttatccctgt gctctgggaa agctatatgc aagaatggca 180 

tctctaagag gacttttgaa gaaataaaag aagaaatagc cagctgtgga gatgttgcta 240 

aagcaatcat caacctagct gtttatggta aagcccagaa cagatcctat gagcgattgg 300 

cacttctggt tgatactgtt ggacccagac tgagtggctc caagaaccta gaaaaagcca 360 

tccaaattat gtaccaaaac ctgcagcaag atgggctgga gaaagttcac ctggagccag 420 

tgagaatacc ccactgggag aggggagaag aatcagctgt gatgctggag ccaagaattc 480 

ataagatagc catcctgggt cttggcagca gcattgggac tcctccagaa ggcattacag 540 

cagaagttct ggtggtgacc tctttcgatg aactgcagag aagggcctca gaagcaagag 600 

ggaagattgt tgtttataac caaccttaca tcaactactc aaggacggtg caataccgaa 660 

cgcagggggc ggtggaagct gccaaggttg gggctttggc atctctcatt cgatccgtgg 720 

cctccttctc catctacagt cctcacacag gtattcagga ataccaggat ggcgtgccca 780 

agattccaac agcctgtatt acggtggaag atgcagaaat gatgtcaaga atggcttctc 840 

atgggatcaa aattgtcatt cagctaaaga tgggggcaaa gacctaccca gatactgatt 900 

ccttcaacac tgtagcagag atcactggga gcaaatatcc agaacaggtt gtactggtca 960 

gtggacatct ggacagctgg gatgttgggc agggtgccat ggatgatggc ggtggagcct 1020 

ttatatcatg ggaagcactc tcacttatta aagatcttgg gctgcgtcca aagaggactc 1080 

tgcggctggt gctctggact gcagaagaac aaggtggagt tggtgccttc cagtattatc 1140 

agttacacaa ggtaaatatt tccaactaca gtctggtgat ggagtctgac gcaggaacct 1200 

tcttacccac tgggctgcaa ttcactggca gtgaaaaggc cagggccatc atggaggagg 1260 

ttatgagcct gctgcagccc ctcaatatca ctcaggtcct gagccatgga gaagggacag 1320 

acatcaactt ttggatccaa gctggagtgc ctggagccag tctacttgat gacttataca 1380 

agtatttctt cttccatcac tcccacggag acaccatgac tgtcatggat ccaaagcaga 1440 

tgaatgttgc tgctgctgtt tgggctgttg tttcttatgt tgttgcagac atggaagaaa 1500 

tgctgcctag gtcctagaaa cagtaagaaa gaaacgtttt catgcttctg gccaggaatc 1560 

ctgggtctgc aactttggaa aactcctctt cacataacaa tttcatccaa ttcatcttca 1620 

aagcacaact ctatttcatg ctttctgtta ttatctttct tgatactttc caaattctct 1680 

gattctagaa aaaggaatca ttctcccctc cctcccacca catagaatca acatatggta 1740 

gggattacag tgggggcatt tctttatatc acctcttaaa aacattgttt ccactttaaa 1800 

agtaaacact taataaattt ttggaagatc tctgaaaaaa aaaaaaaaaa aaagggcggc 1860 

cgc 1863 

<210> 275 

<211> 1134 

<212> DNA 

<213> Homo sapiens 

<400> 275 

tccatctaca gtcctcacac aggtattcag gaataccagg atggcgtgcc caagattcca 60 

acagcctgta ttacggtgga agatgcagaa atgatgtcaa gaatggcttc tcatgggatc 120 

aaaattgtca ttcagctaaa gatgggggca aagacctacc cagatactga ttccttcaac 180 

actgtagcag agatcactgg gagcaaatat ccagaacagg ttgtactggt cagtggacat 240 

ctggacagct gggatgttgg gcagggtgcc atggatgatg gcggtggagc ctttatatca 300 

tgggaagcac tctcacttat taaagatctt gggctgcgtc caaagaggac tctgcggctg 360 

gtgctctgga ctgcagaaga acaaggtgga gttggtgcct tccagtatta tcagttacac 420 

aaggtaaata tttccaacta cagtctggtg atggagtctg acgcaggaac cttcttaccc 480 
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actgggctgc aattcactgg cagtgaaaag gccagggcat catggaggag gttatgagcc 540 

tgctgcagcc cctcaatatc actcaggtcc tgagccatgg agaagggaca gacatcaact 600 

tttggatcca agctggagtg cctggagcca gtctacttga tgacttatac aagtatttct 660 

tcttccatca ctcccacgga gacaccatga ctgtcatgga tccaaagcag atgaatgttg 720 

ctgctgctgt ttgggctgtt gtttcttatg ttgttgcaga catggaagaa atgctgccta 780 

ggtcctagaa acagtaagaa agaaacgttt tcatgcttct ggccaggaat cctgggtctg 840 

caactttgga aaactcctct tcacataaca atttcatcca attcatcttc aaagcacaac 900 

tctatttcat gctttctgtt attatctttc ttgatacttt ccaaattctc tgcattctag 960 

aaaaaggaat cattctcccc tccctcccac cacatagaat caacatatgg tagggattac 1020 

agtgggggca tttctttata tcacctctta aaaacattgt ttccacttta aaagtaaaca 1080 

cttaataaat ttttggaaga tctctgaaaa aaaaaaaaaa aaaaaaaaaa aaaa 1134 

<210> 276 
<211> 626 
<212> DNA 

<213> Homo sapiens 
<400> 276 

gcccacgcgt ccgcctaaac acagtcacca tgaagctggg ctgtgtcctc atggcctggg 60 

ccctctacct ttcccttggt gtgctctggg tggcccagat gctactggct gccagttttg 120 

agacgctgca gtgtgaggga cctgtctgca ctgaggagag cagctgccac acggaggatg 180 

acttgactga tgcaagggaa gctggcttcc aggtcaaggc ctacactttc agtgaaccct 240 

tccacctgat tgtgtcctat gactggctga tcctccaagg tccagccaag ccagtttttg 300 

aaggggacct gctggttctg cgctgccagg cctggcaaga ctggccactg actcaggtga 360 

ccttctaccg agatggctca gctctgggtc cccccgggcc taacagggaa ttctccatca 420 

ccgtggtaca aaaggcagac agcgggcact accamtgcag tggcatcttc cagagccctg 480 

gtcctgggat cccagaaaca gcatctgttg tggctatcac agtccaagaa ctgtttccag 540 

cgccaattct ccttctacaa ggatggaagg atagtgcaaa gcaggggggc tctcctcaga 600 

attccagatc cccacagctt cagaaa 626 

<210> 277 
<211> 152 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (41) . . (41) 

<223> n equals a,t,g, or c 

<400> 277 

cagcccagct tcatggtgac tgtgtttagg tctccctcgt nccgaattcc tgcagcccgg 60 

gggatccact agttctagag cggccgccac cgcggtrgag ctccagcttt tgttcccttt 120 

agtgagggtt aatttcgagc ttggcgtaat ca 152 

<210> 278 
<211> 1760 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (1693) . . (1693) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (1748) . . (1748) 
<223> n equals a,t,g, or c 

<400> 278 
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gggaattctc catcaccgtg gtacaaaagg cagacagcgg gcactaccac tgcagtgcat 60 

cttccasagc cctggtcctg ggatcccaga aacagcatct gttgtggcta tcacagtcca 120 

agaactgttt ccagcgccaa ttctcagagc tgtaccctca gctgaacccc aagcaggarg 180 

ccccatgacc ctgagttgtc agacaaagtt gcccctgcag aggtcagctg cccgcctcct 240 

cttctccttc tacaaggatg gaaggatagt gcaaagcagg gggctctcct cagaattcca 300 

gatccccaca gcttcagaag atcactccgg gtcatactgg tgtgaggcag ccactgagga 360 

caaccaagtt tggaaacaga gcccccagct agagatcaga gtgcagggtg cttccagctc 420 

tgctgcacct cccacattga atccagctcc tcagaaatca gctgctccag gaactgctcc 480 

tgaggaggcc cctggctctg cctccgccgc caaccccatc ttctgaggat ccaggctttt 540 

cttctcctct ggggatgcca gatcctcatc tgtatcacca gatgggcctt cttctcaaac 600 

acatgcagga tgtgagagtc ctcctcggtc acctgctcat ggagttgagg gaattatctg 660 

gccaccrgaa gcctgggacc acaaaggcta ctgctgaata gaagtaaaca gttcatccat 720 

gatctcactt aaccacccca ataaatctga ttctttattt tctcttcctg tcctgcacat 780 

atgcataagt acttttacaa gttgtcccag tgttttgtta gaataatgta gttaggtgag 840 

tgtaaataaa tttatataaa gtgagaatta gagtttagct ataattgtgt attctctctt 900 

aacacaacag aattctgctg tctagatcag gaatttctat ctgttatatc gaccagaatg 960 

ttgtgattta aagagaacta atggaagtgg attgaataca gcagtctcaa ctgggggcaa 1020 

ttttgccccc aagaggacat tgggcaatgt ttggagacat tttggtcatt atacttgggg 1080 

ggttggggga tggtgggatg tgtgtgctac tggcatccag taaatagaag ccaggggtgc 1140 

cgctaaacat cctataatgc acagggcagt accccacaac gaaaaataat ctggcccaaa 1200 

atgtcagttg tactgagttt gagaaacccc agcctaatga aaccctaggt gttgggctct 1260 

ggaaatggga ctttgtccyt tctaattatt atctctttcc agcctcattc agctattctt 1320 

actgacatac cagtctttag ctggtgctat ggtctgttct ttagttctag ttfcgtatccc 1380 

ctcaaaagcc attatgttga aatcctaatc cccaaggtga tggcattaag aagtgggcct 1440 

ttgggaagtg attagatcag gagtgcagag ccctcatgat taggattagt gcccttattt 1500 

aaaaaggccc cagagagcta actcaccctt ccaccatatg aggacgtggc aagaagatga 1560 

catgtatgag aaccaaaaaa cagtgtcgcc aaacaccgac tctgtcgttg ccttgatctt 1620 

gaacttccag cctccagaac tatgagaaat aaaattctgt tgtttgtaag ctaatccagt 1680 

tgtgtaattt ggnatagtag cccaaatgga ctaggcagtt ggcctctggc cacatgatga 1740 

gttatggnat gtaaaaatac 1760 

<210> 279 
<211> 880 
<212> DNA 

<213> Homo sapiens 
<400> 279 

ggcacgagac tggatgaaca caaactccac atgtatcttt ctgccttgca gtccttgatc 60 

ccatctctct ttgcattagt gctacagaat gcacctttct ccagcaaagc caagcttcat 120 

ggtgaagtgc cacagataga agtgactagg tttcctcggc ctatgtcgcc tcttcaagat 180 

gtgtccacta ttatcggaag tcgtgagcaa ttggcagtgc tgctgcaact ttatgactac 240 

cagctagaac aagagggtac aacaggctgg gagagtttac tgtgggttgt caatcaattg 300 

ttgccacaac ttatagaaat agttggcaaa attaatgtta cttcaactgc ctgtgtccat 360 

gaattctcca gatttttctg gcgcctttgc cggacatttg gcaaaatttt tacaaacact 420 

aaggtaaaac ctcagttcca ggagatttta agactatctg aagaaaacat tgattcctca 480 

gcaggaaatg gggtcctcac taaagctaca gtccccattt atgcaacagg agtccttacg 540 

tgttatattc aggaagaaga ccgaaaactg ttagttggat tcttagaaga tgtaatgacg 600 

ctgctttcat tatctcatgc tcctcttgat agcctgaagg cttcttttgt ggaattgggt 660 

gcaaacccag cctaccatga gttactatta actgttttgt ggtatggtgt tgtccatact 720 

tcagcactcg tgaggtgtac tgctgctaga atgtttgagg tatgtcaaca catgcctctg 780 

ttggtttcaa ttataatgat tttttttttt ttgcgaagaa gaagggaatt ttttttaata 840 

aaaaggcttt gcatatcaaa aaaaaaaaaa aaaaaaaaaa 880 

<210> 280 
<211> 1106 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (5).. (5) 
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<223> n equals a,t,g, or c 
<220> 

<221> misc__feature 
<222> (857) . • (857) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (1037) . . (1037) 
<223> n equals a,t,g, or c 



PCT/US02/08276 



<220> 

<221> misc_f eature 
<222> (1058) . . (1058) 
<223> n equals a,t,g, or c 

<400> 280 

tccancatta tgggatacat tgatgatcca gacaaatatc atcagggttt tgaattgttg 60 

ctgtcagcct tgggtgatcc ctcagaaaga gtagttagtg ctacacatca agtattttta 120 

ccagcttacg ctgcgtggac tacagaactt ggaaatttac agtctcatct tatacttaca 180 

ctactgaaca agattgaaaa acttctcagg gaaggagaac atggactgga tgaacacaaa 240 

ctccacatgt atctttctgc cttgcagtcc ttgatcccat ctctctttgc attagtgcta 300 

cagaatgcac ctttctccag caaagccaag cttcatggtg aagtgccaca gatagaagtg 360 

actaggtttc ctcggcctat gtcgcctctt caagatgtgt ccactattat cggaagtcgt 420 

gagcaattgg cagtgctgct gcaactttat gactaccagc tagaacaaga gggtacaaca 480 

ggctgggaga gtttactgtg ggttgtcaat caattgttgc cacaacttat agaaatagtt 540 

ggcaaaatta atgttacttc aactgcctgt gtccatgaat tctccagatt tttctggcgc 600 

ctttgccgga catttggcaa aatttttaca aacactaagg taaaacctca gttccaggag 660 

attttaagac tatctgaaga aaacattgat tcctcagcag gaaatggggt cctcactaaa 720 

gctacagtcc ccatttatgc aacaggagtc cttacgtgtt atattcagga agaagaccga 780 

aaactgttag ttggattctt agaagatgta atgacgctgc tttcattatc tcatgctcct 840 

cttgatagcc tgaaggnttc ttttgtggaa ttgggtgcaa accaggccta ccatgagtta 900 

ctattaactg ttttgkggta tggkgtkgkc catacttcag cactcgtgag gtgtactgct 960 

gctagaatgt ttgagctgtt ggtgaagggg gtgaatgaaa ctctggtagc tcagagggtt 1020 

gttcctgctc ttcattnact ctctccagtg gaccctgnaa atctctgtca ggattgccac 1080 

aatttccagc ctttgggact atttat 1106 

<210> 281 
<211> 646 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<222> (19) . . (19) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (544) . . (544) 
<223> n equals a,t,g, or c 

<400> 281 

cagatgccag ggacttggnc ttcccccggt tgaaccacag gttccaagaa acctgcaggg 60 

tccagcctcc cccccatccc cagtyttccc caccctggcc cggccctcca ggtgcagaaa 120 

catgcaggcc cctctccagg actgtgggag gagtgtgtcc ctcagactgg cctgtgtcct 180 

ggctcctctt accacctctt ccagaggttg tcacctgcag ctgccccagg ataaaggcaa 240 

ggccagarag gactcctgaa ctcctgtgtg cctggggtgg caggggcaaa catagccaac 300 

tggtggcctg agcggggcca tggtgargac acccttggtg gcttgtccca catcaagctg 360 

ggargtgaca cttaggatgc atttttcaat attttagtgt ttgaataacg ggctawcttg 420 
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agaaaaaaat aatttgaatc acacatcaca ccaaaaataa attctaggtg gattttaaca 480 

ctttccaaaa attattatta gtttagagac agggtctcac tccgtcgcct aggctggagt 540 

gcangggtat gatcatggtt cactgcaacc ttaaactccc tggcctcata tgatcccccc 600 

gggctccagc ccctccaaag ttactgggaa actaccaaac atgccc 646 

<210> 282 

<211> 1590 

<212> DNA 

<213> Homo sapiens 

<400> 282 

tttttttttt tttgtttaaa tgatacaact taattttatt aggacagacg ctggcggcca 60 

ccagaagttt gagcctcttt ggtagcagga ggctggaaga aaggacagaa gtagctctgg 120 

ctgtgatggg gatcttactg ggcctgctac tcctggggca cctaacagtg gacacttatg 180 

gccgtcccat cctggaagtg ccagagagtg taacaggacc ttggaaaggg gatgtgaatc 240 

ttccctgcac ctatgacccc ctgcaaggct acacccaagt cttggtgaag tggctggtac 300 

aacgtggctc agaccctgtc accatctttc tacgtgactc ttctggagac catatccagc 360 

aggcaaagta ccagggccgc ctgcatgtga gccacaaggt tccaggagat gtatccctcc 420 

aattgagcac cctggagatg gatgaccgga gccactacac gtgtgaagtc acctggcaga 480 

ctcctgatgg caaccaagtc gtgagagata agattactga gctccgtgtc cagaaacact 540 

cctcaaagct actcaagacc aagactgagg cacctacaac catgacatac cccttgaaag 600 

caacatctac agtgaagcag tcctgggact ggaccactga catggatggc taccttggag 660 

agaccagtgc tgggccagga aagagcctgc ctgtctttgc catcatcctc atcatctcct 720 

tgtgctgtat ggtggttttt accatggcct atatcatgct ctgtcggaag acatcccaac 780 

aagagcatgt ctacgaagca gccagggcac atgccagaga ggccaacgac tctggagaaa 840 

ccatgagggt ggccatcttc gcaagtggct gctccagtga tgagccaact tcccagaatc 900 

tgggcaacaa ctactctgat gagccctgca taggacagga gtaccagatc atcgcccaga 960 

tcaatggcaa ctacgcccgc ctgctggaca cagttcctct ggattatgag tttctggcca 1020 

ctgagggcaa aagtgtctgt taaaaatgcc ccattaggcc aggatctgct gacataattg 1080 

cctagtcagt ccttgccttc tgcatggcct tcttccctgc tacctctctt cctggatagc 1140 

ccaaagtgtc cgcctaccaa cactggagcc gctgggagtc actggctttg ccctggaatt 1200 

tgccagatgc atctcaagta agccagctgc tggatttggc tctgggccct tctagtatct 1260 

ctgccggggg cttctggtac tcctctctaa ataccagagg gaagatgccc atagcactag 1320 

gacttggtca tcatgcctac agacactatt caactttggc atcttgccac cagaagaccc 1380 

gagggaggct cagctctgcc agctcagagg accagctata tccaggatca tttctctttc 1440 

ttcagggcca gacagctttt aattgaaatt gttatttcac aggccagggt tcagttctgc 1500 

tcctccacta taagtctaat gttctgactc tctcctggtg ctcaataaat atctaatcat 1560 

aacagcaaaa aaaaaaaaaa aaaactcgag 1590 

<210> 283 
<211> 1179 
<212> DNA 

<213> Homo sapiens 
<400> 283 

gggctgcagg aattcggcac gagtttaaag ggtgactcgt cccacttgtg ttctctctcc 60 

tggtgcagag ttgcaagcaa gtttatcgga gtatcgccat gaagttcgtc ccctgcctcc 120 

tgctggtgac cttgtcctgc ctggggactt tgggtcaggc cccgaggcaa aagcaaggaa 180 

gcactgggga ggaattccat ttccagactg gagggagaga ttcctgcact atgcgtccca 240 

gcagcttggg gcaaggtgct ggagaagtct ggcttcgcgt tcgactgccg caacacagac 300 

cagacctact ggtgtgagta cagggggcag cccagcatgt gccaggcttt cgctgctgac 360 

cccaaatctt actggaatca agccctgcag gagctgaggc gccttcacca tgcgtgccag 420 

ggggccccgg tgcttaggcc atccgtgtgc agggaggctg gaccccaggc ccatatgcag 480 

caggtgactt ccagcctcaa gggcagccca gagcccaacc agcagcctga ggctgggacg 540 

ccatctctga ggcccaaggc cacagtgaaa ctcacagaag caacacagct gggaaaggac 600 

tcgatggaag agctgggaaa agccaaaccc accacccgac ccacagccaa acctacccag 660 

cctggaccca ggcccggagg gaatgaggaa gcaaagaaga aggcctggga acattgttgg 720 

aaacccttcc aggccctgtg cgcctttctc atcagcttct tccgagggtg acaggtgaaa 780 

gacccctaca gatctgacct ctccctgaca gacaaccatc tctttttata ttatgccgct 840 

ttcaatccaa cgttctcaca ctggaagaag agagtttcta atcagatgca acggcccaaa 900 

ttcttgatct gcagcttctc tgaagtttgg aaaagaaacc ttcctttctg gagtttgcag 960 
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agttcagcaa tatgataggg aacaggtgct gatgggccca agagtgacaa gcatacacaa 1020 

ctacttatta tctgtagaag ttttgctttg ttgatctgag ccttctatga aagtttaaat 1080 

atgtaacgca ttcatgaatt tccagtgttc agtaaatagc agctatgtgt gtgcaaaata 1140 

aaagaatgat ttcagaaaaa aaaaaaaaaa aaaactcga 1179 

<210> 284 
<211> 819 
<212> DNA 

<213> Homo sapiens 
<400> 284 

gaattcggca cgaggagaat catgggcctc tggctgggca tgctggcctg tgtcttcctg 60 

gcaactgctg cctttgttgc ttatactgcc cggctggact ggaagcttgc tgcagaggag 120 

gctaagaaac attcaggccg gcagcagcag cagagagcag agagcactgc aaccagacct 180 

gggcctgaga aagcagtcct atcttcagtg gctacaggca gttcccctgg cattaccttg 240 

acaacgtatt caaggtctga gtgccacgtg gacttcttca ggactccaga ggaggcccac 300 

gccctttcag ctcctaccag cagactatca gtgaaacagc tggtcatccg ccgtggggct 360 

gctctggggg cggcgtcagc acactgatgg tggggctcac ggtcaggatc ctagccacca 420 

ggcactagca aagaagcttg gaaatagaaa gccaggagtg gctgtcccca gtatgcaaac 480 

acaccacggt ctgccctgca aaaacaccaa tggggtctag tgcaggtgga cactttgaac 540 

cactcctcaa aaaaagaact ttggctgaty ccttgtggtg acactcagag gggtctgaac 600 

agacttgaca attctgttct ggtcaagctg gagttttctt ctgtgacttg gactgctcta 660 

cagaagacat cagccaactg cacgagtcag agtccaggga ttgtcactat tattaataat 720 

gtaaatggct tcaaatggga cactgcagat aammycacaa aaaccactgt tatattaaag 780 

attacacatt tcctggaaaa aaaaaaaaaa aaaactcga 819 

<210> 285 

<211> 1792 

<212> DNA 

<213> Homo sapiens 

<400> 285 

ggcacgaggt tgtttgagtt tggtttggag caaaactgag gtagtcctaa catttctggg 60 

actgaatcca ggcaagagaa agaagaaaaa gaagaagaaa aagaggagga aaaagtggat 120 

tacacaatga catggagaat gggaccccgt ttcactatgc tgttggccat gtggctagtg 180 

tgtggatcag aaccccaccc ccatgccact attagaggca gccacggagg acggaaagtg 240 

cctttggttt ctccggacag cagtaggcca gctcggtttc tgaggcacac tgggaggtct 300 

cgcggaattg agagatccac tctggaggaa ccaaaccttc agcctctcca gagaaggagg 360 

agtgtgcccg tgttgagact agctcgccca acagagccgc cagcccgctc ggacatcaat 420 

ggggccgccg tgagacctga gcaaagacca gcagccaggg gctctccgcg tgagatgatc 480 

agagatgagg ggtcctcagc tcggtcaaga atgttgcgtt tcccttcggg gtccagctct 540 

cccaacatcc ttgccagctt tgcagggaag aacagagtat gggtcatctc agcccctcat 600 

gcctcggaag gctactaccg cctcatgatg agcctgctga aggacgatgt gtactgtgag 660 

ctggcggaga ggcacatcca acagattgtg ctcttccacc aggcaggaga ggaaggaggc 720 

aaggtgagaa ggatcaccag cgagggccag atcctggagc agcccctgga ccctagcctc 780 

atccctaagc tgatgagctt cctgaagctg gagaagggca agtttggcat ggtgctgctg 840 

aagaagacgc tgcaggtgga ggagcgctat ccatatcccg ttaggctgga agccatgtac 900 

gaggtcatcg accaaggccc catccgtagg atcgagaaga tcaggcagaa gggctttgtc 960 

cagaaatgta aggcctctgg tgtagagggc caggtggtgg cggaggggaa tgacggtgga 1020 

gggggagcag gaaggccaag cctgggcagc gagaagaaga aagaggaccc aaggagagca 1080 

caagtcccac caaccagaga gagtcgggtg aaggtcctga gaaaactggc cgccactgca 1140 

ccagcttttc cccaacctcc ctcaaccccc agagccacca cccttcctcc tgccccagcc 1200 

acaacagtga ctcggtccac gtcccgggcg gtaacagttg ctgcaagacc tatgaccacc 1260 

actgcctttc ccaccacgca gaggccctgg accccctcac cctcccacag gccccctaca 1320 

accactgagg tgatcactgc caggagaccc tcagtttcag agaatcttta ccctccatcc 1380 

cggaaggatc agcacaggga gaggccacag acaaccagga ggcccagcaa ggccaccagc 1440 

ttggagagct tcacaaatgc ccctcccacc accatctcag aacccagcac aagggctgct 1500 

ggcccaggcc gtttccggga caaccgcatg gacaggcggg aacatggcca ccgagaccca 1560 

aatgtggtgc caggtcctcc caagccagca aaggagaaac ctcccaaaaa gaaggcccag 1620 

gacaaaattc ttagtaatga gtatgaggaa gtatgacctc agccggccta ctgcctctca 1680 

gctggaggac gagctgcagg tggggaatgt tccccttaaa aaagcaaagg agtctaaaaa 1740 
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gcatgaaaag cttgagaaac cagagaagga gaagaaaaaa aaaaaaaaaa aa 

<210> 286 
<211> 1673 
<212> DNA 

<213> Homo sapiens 



1792 



<400> 286 

ggcacgagag 

cagaacccca 

tttctccgga 

ttgagagatc 

ccgtgttgag 

ccgtgagacc 

aggggtcctc 

tccttgccag 
aaggctacta 
agaggcacat 
gaaggatcac 
agctgatgag 
cgctgcaggt 
tcgaccaagg 
gtaaggcctc 
caggaaggcc 
caccaaccag 
ttccccaacc 
tgactcggtc 
tttcccacca 
tgagggtgat 

gaaggatcag 
tggagagctt 
gcccaggccg 
atgtggtgcc 
acaaaattct 
agctggagga 
agcatgaaaa 



aatgggaccc 
cccccatgcc 
cagcagtagg 
cactctggag 
actagctcgc 
tgagcaaaga 
agctcggtca 
ctttgcaggg 
ccgcctcatg 
ccaacagatt 
cagcgagggc 
cttcctgaag 
ggaggagcgc 
ccccatccgt 
tggtgtagag 
aagccagggc 
agagagtcgg 
tccctcaacc 
cacgtcccgg 
cgcagaggcc 
cactgccagg 
cacagggaga 
cacaaatgcc 
tttccgggac 
aggtcctccc 
tagtaatgag 
cgagctgcag 
gcttgagaaa 



cgtttcacta 
actattagag 
ccagctcggt 
gaaccaaacc 
ccaacagagc 
ccagcagcca 
agaatgttgc 
aagaacagag 
atgagcctgc 
gtgctcttcc 
cagatcctgg 
ctggagaagg 
tatccatatc 
aggatcgaga 
ggccaggtgg 
agcgagaaga 
gtgaaggtgc 
cccagagcca 
gcgggaaaca 
ctggaccccc 
agaccctcag 
ggccacagac 
cctcccacca 
aaccgcatgg 
aagccagcaa 
tatgaggaga 
gtggggaatg 
ccagagaagg 



tgctgttggc 
gcagccacgg 
ttctgaggca 
ttcagcctct 
cgccagcccg 
ggggctctcc 
gtttcccttc 
tatgggtcat 
tgaaggacga 
accaggcagg 
agcagcccct 
gcaagtttgg 
ccgttaggct 
agatcaggca 
tggcggaggg 
agaaagagga 
tgagaaaact 
ccacgcttac 
gatgctgcaa 
tcacccttcc 
tttccagaga 
aaccaggagg 
ccatctcaga 
acaggcggga 
aggagaaacc 
agtatgacct 
ttccccttaa 
agaagaaaaa 



catgtggcta 
aggacggaaa 
cactgggagg 
ccagagaagg 
ctcggacatc 
gcgtgagatg 

ggggtccagc 

ctcagcccct 
tgtgtactgt 
agaggaagga 
ggaccctagc 
catggtgctg 
ggaagccatg 
gaagggcttt 
gaatgacggt 
cccaaggaga 
ggccgccact 
tcctgcccca 
gacctatgac 
cacaggcccc 
atctttaccc 
cccagcaagg 
acccagcaca 
acatggccac 
tcccaaaaag 
cagccggcct 
aaaagcaaag 
aaaaaaaaaa 



gtgtgtggat 
gtgcctttgg 
tctcgcggaa 
aggagtgtgc 
aatggggccg 
atcagagatg 
tctcccaaca 
catgcctcgg 
gagcbggcgg 
ggcaaggtga 
ctcatcccta 
ctgaagaaga 
tacgaggtca 
gtccagaaat 
ggagggggag 
gcacaagtcc 
gcaccagctt 
gccacaacag 
caccactggc 
ctacaaccac 
tccattcccg 
cccaccagct 
agggctgctg 
cgagacccaa 
aaggcccagg 
actgcctctc 
gagtctaaaa 
aaa 



<210> 287 

<211> 2084 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (775) . . (775) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2080) . . (2080) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2083) . . (2083) 
<223> n equals a,t,g, or c 

<400> 287 

ggcagagggc catttcctgc aaagagccaa acccccattc ctctgtgccc ctcctctccc 
accaagtgct ttataaaaat agctcttgtt accggaaata actgttcatt tttcactcct 
ccctcctagg tcacactttt cagaaaaaga atctgcatcc tggaaaccag aagaaaaata 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1673 



60 
120 
180 
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tgagacgggg aatcatcgtg tgatgtgtgt sctgcctttg gctgagtgtg tggagtcctg 240 

ctcaggtgtt aggtacagtg tgtttgatcg tggtggcttg aggggaaccg cttgttcaga 300 

gctgtgactg cggctgcact gcagagaagc tgcccttggc tgctcgtagc gccgggcctt 360 

ctctcctcgt catcatccag agcagccagt gtccgggagg cagaaggtac cggggcagct 420 

actggaggac tgtgcgggcc tgcctgggct gccccctccg ccgtggggcc ctgttgctgc 480 

tgtccatcta tttctactac tccctcccaa atgcggtcgg cccgcccttc acttggatgc 540 

ttgccctcct gggccttctc gcaggcactg aacatcctcc tgggcctcaa gggcctggcc 600 

ccagctgaga tctctgcagt gtgtgaaaaa gggaatttca acgtggccca tgggctggca 660 

tggtcatatt acatcggata tctgcggctg atcctgccag agctccaggc ccggattcga 720 

acttacaatc agcattacaa caacctgcta cggggtgcag tgagccagcg gtgtnatatt 780 

ctcctcccat tggactgtgg ggtgcctgat aacctgagta tggctgaccc caacattcgc 840 

ttcctggata aactgcccca gcagaccggt gaccgtgctg gcatcaagga tcgggtttac 900 

agcaacagca tcfcatgagct tctggagaac gggcagcggg cgggcacctg tgtcctggag 960 

tacgccaccc ccttgcagac tttgtttgcc atgtcacaat acagtcaagc tggctttagc 1020 

ggggaggata ggcttgagca ggccaaactc ttctgccgga cacttgagga catcctggca 1080 

gatgcccctg agtctcagaa caactgccgc ctcattgcct accaggaacc tgcagatgac 1140 

agcagcttct cgctgtccca ggaggttctc cggcacctgc ggcaggagga aaaggaagag 1200 

gttactgtgg gcagcttgaa gacctcagcg gtgcccagta cctccacgat gtcccaagag 1260 

cctgagctcc tcatcagtgg aatggaaaag cccctccctc tccgcacgga tttctcttga 1320 

gacccagggt caccaggcca gagcctccag tggtctccaa gcctctggac tgggggctct 1380 

cttcagtggc tgaatgtcca gcagagctat ttccttccac agggggcctt gcagggaagg 1440 

gtccaggact tgacatctta agatgcgtct tgtccccttg ggccagtcat ttcccctctc 1500 

tgagcctcgg tgtcttcaac ctgtgaaatg ggatcataat cactgcctta cctccctcac 1560 

ggttgttgtg aggactgagt gtgtggaagt ttttcataaa ctttggatgc tagtgtactt 1620 

agggggtgtg ccaggtgtct ttcatggggc cttccagacc cactccccac ccttctcccc 1680 

ttcctttgcc cggggacgcc gaactctctc aatggtatca acaggctcct tcgccctctg 1740 

gctcctggtc atgttccatt attggggagc cccagcagaa gaatggagag gaggaggagg 1800 

ctgagtttgg ggtattgaat cccccggctc ccaccctgca gcatcaaggt tgctatggac 1860 

tctcctgccg ggcaactctt gcgtaatcat gactatctct aggattctgg caccacttcc 1920 

ttccctggcc ccttaagcct agctgtgtat cggcaccccc accccactag agtactccct 1980 

ctcacttgcg gtttccttat actccacccc tttctcaacg gtcctttttt aaagcacatc 2040 

tcagattaaa aaaaaaaaaa aaaaaaaaaa agggggggcn gent 2084 

<210> 288 
<211> 720 
<212> DNA 
<213> Homo sapiens 

<400> 288 

ggcacgagat ttctcacaat gacaaattct caaatattgc taatagtact gtggattttc 60 

ctacattggt aaattgaagg aattgctaaa tgetgaatte agcaaccagt ttgagattgt 120 

tgaaaataaa gattgtttct ttttcaatgc aagttcacag atcactggag ttctagctac 180 

agtttgttct agaccagagg ttgeagatat ttttgtccta taaagagaca catggttaat 240 

atttttggct ttgtgagttg tatagttttc gttgtagctg ttcagctctg ctacatgaag 300 

caaccataga ccatacctta acaagtggtc acttttgagt accaataaaa ctttatttag 360 

aaataacaga gggctggatt tggtcctagt ttgetgaace cttttctaga tgaaggctcc 420 

tettgecaag actggctccc taccttggct gacaaattct cactttggga cttagtcatt 480 

gttgetgetc tctgttattt tgcatgtctt ttctcatgtt taggtgctgt gtcttaatac 540 

ttttttctta catttaattt aacaatcatt actgageget ggtatgtcta gtttcttttc 600 

tcttctttcc tccttttctt ttcttttttt ctttttcttt atttgaaggc tctcactctg 660 

tcactccagc ctgggtggca gaccaggacc ctgtctctaa aaaaaaaaaa aaaaaaaaaa 720 

<210> 289 
<211> 738 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (646) . . (646) 
<223> n equals a,t,g, or c 
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<220> 

<221> mi sc_f eature 
<222> (670) . . (670) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (696) . . (696) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (707) . . (707) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (718) . . (718) 
<223> n equals a,t,g, or c 



<400> 


289 














catcagacca 


cacccagcag 


tcagaaaaga 


ggtgcagggg 


cccgggctgg gacagtgaag 


60 


agtgctgggc 


agtctgtggt 


cctctgtatc 


tcaacttttt 


catcttaaaa 


aaacaaatag 


120 


ggttgtgtgt 


gtggctggtg gtcataaggt 


cctttctggc 


tctaataacc 


tgagcttctg 


180 


ttatgaagct 


gggaccctta 


gagcctcagg 


atgatcctct 


gtttgtttgt 


gaagccccaa 


240 


tcaggtgcta 


agcaccatag 


tggcacttag 


ctgaagctcc 


tctgtaactc 


ctgtgggccc 


300 


tgccttgccc 


acccccgaca 


gctgctgcag 


tgctcctgag 


cagcacaggc 


ctgatggagc 


360 


ttctggagaa 


gatgctggcc 


ctcaccttgg 


caaaggcaga 


ttctcccagg 


actgcactcc 


420 


tctgctctgc 


ctggctgctc 


actgcctcct 


tctctgccca 


gcagcacaag 


ggcagtttgc 


480 


aggttcacca 


gacactctct 


gtggaaatgg 


accargtatt 


gaaggctctc 


agctttccaa 


540 


agaaaaaggc 


tgcactactc 


tcaactgcca 


tcttatgctt 


cctgcggaca 


gccctgcgac 


600 


aaagcttttc 


ctctgcctgg 


aaccctggtg 


cccttaaggg 


cccagncact 


gcagccacca 


660 


aggacactgn 


cctaacttca 


ctgcgaatgt 


ccaagnccgg 


ccctggncat 


tgggctgnaa 


720 


aaacctcctg 


gtgcaaaa 










738 


<210> 


290 














<211> 


935 














<212> 


DNA 














<213> 


Homo 


sapiens 












<220> 
















<221> 


misc. 


_f eature 












<222> 


(6) . 


. (6) 












<223> 


n equals a,t,g, 


or c 










<220> 
















<221> 


misc. 


_f eature 












<222> 


(14) 


(14) 












<223> 


n equals a,t,g, 


or c 










<220> 
















<221> 


misc. 


.feature 












<222> 


(16) 


. . (16) 












<223> 


n equals a,t,g, 


or c 










<220> 
















<221> 


misc. 


_f eature 












<222> 


(50) 


. - (50) 












<223> 


n equals a,t,g, 


or c 
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<220> 

<221> misc_f eature 

<222> (95).. (95) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (101) . . (101) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (139) . . (139) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (176) . . (176) 
<223> n equals a,t,g, or c 

<400> 290 

ggccancttt tttntngggg aaaaaatggg acccaaaagt tatttgaaan gggctttttc 60 

aagctttttc ccaaaaggaa aaaaagggtt gccantaatt nttcaaggat tgcccatctt 120 

taatgctttc cttggggana agccttgcga caaaagcttt ttccttctgc cctggnagcc 180 

ctggtgcctt caggggccca gccactgcca gccaccaagg acactgtcct agctccactg 240 

cgaatgtcgc aagtccggtc cctggtcatt gggctgcaga acctcctggt gcagaaggac 300 

cctctattgt cccaggcctg tgttggctgc ctggaggcct tgcttgacta cctggatgcc 360 

cggagcccag acattgctct ccacgtggcc tcccagcctt ggaatcggtt tttgctgttt 420 

accctcttgg atgctggaga gaattccttc ctcagacctg agattttgag gctcatgacc 480 

ctgtttatgc ggtaccggag tagcagtgtc ctctctcatg aagaggtggg tgatgttctg 540 

caaggtgtgg ctttggctga cctgtctacc ctctcgaaca ccacactcca ggccctgcat 600 

ggcttcttcc agcagctcca gagcatggga cacctggctg accacagcat ggcccagacc 660 

ctgcaggcct ccttggaggg ccttccccct agcacctcct caggccagcc acccctgcag 720 

gacatgctct gcctgggagg ggtggctgta tccctgtccc acatcagaaa ctgatcctca 780 

ggacttgaag gcccagaagt ggagagagaa tgagacctgg agacaaaggg cataattgtt 840 

ggggaaatgg atgacagctg aagctattca tatggagcca tatactctat tgttgaaata 900 

gaataaggaa ataaaatgat acactcacaa aaaaa 935 

<210> 291 
<211> 871 
<212> DNA 

<213> Homo sapiens 



<400> 291 



ggcacgaggg aacccagaag atgctgcctc tcctgatcat ctgtctcctg cctgccattg 
aagggaagaa ctgcctccgc tgctggccag aactgtctgc cttgatagac tatgacctgc 
agatcctctg ggtgacccca gggccaccca cagaactttc tcaaagtatt cactccttgt 
tcctagagga taataatttt ctcaaaccct ggtaccttga tcgtgaccat ttggaagaag 
aaacagccaa attcttcact caagtacacc aagccattaa aacgttacga gatgataaaa 
cagtacttct ggaagagatc tacacgcaca agaatctctt tactgagagg ctgaataaga 
tatctgatgg gctgaaggag aaggagcccc acccctctcc atgaatgcct tcccggctcc 
atctcctact tgcaccccag aaccccttgg cttctgtctg cctccccagc acctcagttt 
ctctaccttc tcaccctccc tggcagcctg caatgagtcc tgtgccagga accggcggac 
ctccctgtgg gctgtgagtc tcagcagtgc tctactcctg gccatagctg gagatgtttc 
ttttactggc aaaggaagaa ggaggcagta aaggaacagg gcagcccgca tgtcttccag 
aagtgaacag aggccgcagc taccaccgtc acaaagttca ctcatctctg ggtcccggtg 
accccatccc cccataccct ccatcctggg tcctggggcc ccaaagctct gaggcctagg 
agactgcgct gtctcgtggt ttgcctactc ctacaccttt gtaaagagtc tcttcattaa 
aacccctctt cataaaaaaa aaaaaaaaaa a 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
871 



<210> 292 
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<211> 881 
<212> DNA. 
<213> Homo 



sapiens 



<400> 292 

gaattcggca 

ccattgaagg 

acctgcagat 

ccttgttcct 

aagaagaaac 

ataaaacagt 

ataagatatc 

cggctccatc 

cagtttctct 

cggacctccc 

gtttctttta 

tccagaagtg 

cggtgacccc 

ctaggagact 

attaaaaccc 



cgagggaacc cagaagatgc 
gaagaactgc ctccgctgct 
cctctgggtg accccagggc 
agaggataat aattttctca 
agccaaattc ttcactcaag 
acttctggaa gagatctaca 
tgatgggctg aaggagaagg 
tcctacttgc accccagaac 
accttctcac ctccctggca 
tgtgggctgt gagtctcagc 
ctggcaaagg aagaaggagg 
aacagaggcc gcagctacca 
atccccccat accctccatc 
gcgctgtctc gtggtttgcc 
ctcttcataa aaaaaaaaaa 



tgcctctcct gatcatctgt 
ggccagaact gtctgccttg 
cacccacaga actttctcaa 
aaccctggta ccttgatcgt 
tacaccaagc cattaaaacg 
cgcacaagaa tctctttact 
gagccccacc cytctccatg 
cccttggctc tgtctgcctc 
gcctgcaatg agtcctgtgc 
agtgctctac tcctggccat 
cagtaaagga acagggcagc 
ccgtcacaaa gttcactcat 
ctgggtcctg gggccccaaa 
tactcctaca cctttgtaaa 
aaaaaactcg a 



ctcctgcctg 
atagactatg 
agtattcact 
gaccatttgg 
ttacgagatg 
gagaggctga 
aatgccttcc 
cccagcacct 
caggaaccgg 
agctggagat 
ccgcatgtct 
ctctgggtcc 
gctctgaggc 
gagtctcttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
881 



<210> 293 
<211> 1598 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (1067) . . (1067) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (1069) . . (1069) 
<223> n equals a,t,g, or c 

<220> 

<221> misc„feature 
<222> (1577) . . (1577) 
<223> n equals a,t,g, or c 

<400> 293 

aggaaagaac aaaggttatt tcctggagaa aagacaattt attcaacacc aacragggac 60 

tcatcatatg ggcacaactc tggtgtcctt ctatggagaa aacctcaagt aaagttttat 120 

tctgcctttr aaaatgcttc caaaagtaga ccctgtcccc acacaggtca agactacaga 180 

gaaggctttg tagaaatgtg tcacctatgt acacctgcta cttacacatt tcctcttttg 240 

gaaaaatgag atacttagaa taacargaaa attaagacat actggcctgg tgccagcaga 300 

tggcttttct atagacaaac taggttagtg tggaagatat aggttaaaat aaactatgct 360 

gttttattta tcttcccaac ctgattggca gctagacttt tttagggtct catttaatgg 420 

ccctgttttt ttcattatta tatttaatga tagggcagga tttcgtatgc aagctcttgt 480 

ttctcaggct gcctgcagaa gaagtcgcta taaattatct gttgtctaca tggtacaagg 540 

cccattgact catctgatgc ttgttttgtt aatttcttta atatttttat cacggggcag 600 

tgggagggct tgggctttta gccacagctg ttttaagact tctgatctcc tgccctgcag 660 

gaataggtgg gaagtcattg aatttttaca ctatagtaat ttgcattccc acataagttt 720 

gagtgttacg aaaacattcc tttaaaggga tctgtgctac acaaaatatg ccaggacctc 780 

acagacaaag ccattgctag aaatgtcatt ccaatgatca gatctggaaa caggctgcca 840 

taaccacttt tccttcttgt agactcagct cacctgtata tttaaactgt tcttggcatc 900 

ttgaaacacc tatttctact caggtactca ttgtcctgtt actgattcac ctttctgatc 960 

cttttcaacc agttttcccc caagggggga aattttactt aacctctagt atttgaacaa 1020 

ctcaatattt gaattgttgc cccatttgct tttacctgta ctgtatncnt ggtcatctca 1080 

aatggcgtct aaacccagct actttgcatt ccagaagttt ccattccctc caattccacc 1140 
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taatttttca tctgtcctag ttactggctc tttcttcatg tcttatttct cttgctttgg 1200 

gagcttaaaa gattttacaa gacctaattt tgggttcctt ccttggagcc atagttaccc 1260 

tgccaagaag agtagaaaat gggttcaact cctgtttcgc tccaccaaca cctctgtgag 1320 

tctcatcatc agctgagcga tgatgcctta caggttgcat agcactggaa ctttcctaga 1380 

gtaacggctc tgctgccagg gtttctctgg gctcattctt ccactgactt aattatgatc 1440 

tatgcctaac agagccccag tacaactatt ttgcagaatg gctgttaccc tagaattact 1500 

atagcacata ttgagatata gttgtactcc ctagtagata ggaactgacc ccaacaataa 1560 

actttgataa taaaganaaa aaaaaaaaaa actcgtag 1598 

<210> 294 
<211> 530 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<222> (517) . . (517) 
<223> n equals a,t,g, or c 

<400> 294 

gcctggcaga gagactctga aatgagggat tagaggtgtt caaggagcaa gagcttcagc 60 

ctgaagacaa gggagcagtc cctgaagacg cttctactga gaggtctgcc atggcctctc 120 

ttggcctcca acttgtgggc tacatcctag gccttctggg gcttttgggc acactggttg 180 

ccatgctgct ccccagctgg aaaacaagtt cttatgtcgg tgccagcatt gtgacagcag 240 

ttggcttctc caagggcctc tggatggaat gtgccacaca cagcacaggc atcacccagt 300 

gtgacatcta tagcaccctt ctgggcctgc ccgctgacat ccaggctgcc caggccatga 360 

tggtgacatc cagtgcaatc tcctccctgg cctgcattat ctctgtggtg ggcatgagat 420 

gcacagtctt ctgccaggaa tcccgagcca aagacagagt ggcggtagca ggtggagtct 480 

ttttcatcct tggaagcctc ctgggattca ttcctgntgc ctggaatctt 530 

<210> 295 

<211> 1046 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> mi sc__f eature 

<222> (14) . . (14) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 

<222> (33) . . (33) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (441) . . (441) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (460) . . (460) 
<223> n equals a,t,g, or c 

<400> 295 

gagaagtcag cctngcagag agactctgaa atnagggatt agaggtgttc aaggagcaag 60 

agcttcagcc tgcaagacaa gggagcagtc cctgaagacg cttctactca ccactggtgc 120 

ctgacagcat gaaatttgag attggagagg ctctttactt gggcattatt tcttccctgt 180 

tctccctgat akctggaatc atcctctgct tttcctgctc atsccagaga aatcgctcca 240 

actactacga tgcctaccaa gcccaacctc ttgccacaag gagctctcca aggcctggtc 300 
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aacctcccaa agtcaagagt gagttcaatt cctacagcyt gacagggtat gtgtgaagaa 360 

ccaggggcca garctggggg ktggctgggt ctgtgaaaaa cagtggacag caccccgagg 420 

ccacaggtga gggacattac nactggatcg tgtcagaagn tgctgctgag gatagactga 480 

ctttggccat tggattgagc aaaggcagaa atgggggcta gtgtaacagc atgcaggttg 540 

aattgccaag gatgctcgcc atgccagcct ttctgttttc ctcaccttgc tgstcccctg 600 

ccctaagtcc ccaaccctca acttgaaacc ccattccctt aagccaggac tcagaggatc 660 

cctttgccct ctggtttacc tgggactcca tccccaaacc cactaatcac atcccactga 720 

ctgaccctct gtgatcaaag accctctctc tggctgaggt tggctcttag ctcattgctg 780 

gggatgggaa ggagaagcag tggcttttgt gggcattgct ctaacctact tctcaagctt 840 

ccctccaaag aaactgattg gccctggaac ctccatccca ctcttgttat gactccacag 900 

tgtccagact aatttgtgca tgaactgaaa taaaaccatc ctacggtatc cagggaacag 960 

aaagcaggat gcaggatggg aggacaggaa ggcagcctgg gacatttaaa aaaaaaaaaa 1020 

aaaaaactcg agggggggcc cggtac 1046 

<210> 296 
<211> 819 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> (786).. (786) 

<223> n equals a,t,g, or c 

<220> 

<221> mis cofeature 
<222> (819).. (819) 
<223> n equals a,t,g, or c 

<400> 296 

aattcggcac gagctcaaag agtaagaatc caagtgtgtg acattacata gctttgcatc 60 

tatggaaacc taaatcataa ttgtttccac tgcccaatta tgttcctttt cataacattt 120 

actattctgg ctatatttat catagaacct aggaacctta gagttgacct gaatctaatt 180 

aaatttcaga cctcctggcc aaagacccta gtggaagagc aaaactaaat caacatatta 240 

ccaatctcaa gtatttctct gaggacccag accactgact ttttgttgtc attttcaggt 3 00 

tgatcctata actgtatgtt ctacaatatc tgtgctccac cagctcagtg aggaatcaac 360 

ggaatatcaa aagtaaatat tggtcaccat ataccttttg gtactagtct acgaaataat 420 

tggctgagga actgtttcat attaaagaaa agctaaaagc aatgtgtgat cttagattag 480 

acctatgatt ggaatgtatg tatattttat atacaaaata ttgaggaaat tgacaaaatt 540 

taaatacaga atatggatta gataatagga atgtatcaag gtcaatattt aaaaagataa 600 

tttcaacttt tattttattc agtgggtaca tgtgcagact ttgttttaca tagtacccaa 660 

cagtttttca acgcttatcc cccaccctct agtaatctgc agwgwctatt attgycatct 720 

tcgtggctat tgtacatggg atccatactt 4 gattttgctc tcaacatgaa cattattggt 780 

gtaganaaat gccactaagt tttkgtacgt tggcttttn 819 

<210> 297 
<211> 501 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_ feature 

<222> (1)..(2) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 

<222> (12) . . (12) 

<223> n equals a,t,g, or c 

<400> 297 
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nnccccccaa tnatattttt ccaaattaat tccaacatag gaaggattcc accttcctag 60 

tatgttttca aattgtttca aacctgacct ctttttgatt gctctacctt ccaaaagaaa 120 

agaagggaac actaattttc ttycctgatt tacttcattg ttttcttctg ttagattaac 180 

tttacctata aaagattgtc tcttgacttt atatatatat atatgtgtgt gtgtgtgtgt 240 

gtgtgtgtgt gtgtgtgtgt gtgtgtgtat ttgaagagga catgtgcctc cataaaagga 300 

aataaaatga gagaatacat tattgatttt gtgaaatcaa aatatttgaa ttatggtttc 360 

tcaatattca aaaactcttg cagtttctgt acttatttct tctgatgcat agagtttcgg 420 

ggactacata tgtttcacaa ccaaagatat ccacttgaaa taaaaacatt ataaagttaa 480 

aaaaaaaaaa aaaaactcga g 501 

<210> 298 

<211> 3306 

<212> DNA 

<213> Homo sapiens 

<400> 298 

ccacgcgtcc ggcccagggc tgtctgtctc caaagcccaa ccataactca catccccatt 60 

ccagctcctc tgggtgagtc tgttccccct cagcctcact ttccttatcc tgtcaaatga 120 

aggatttgga atgacttaag ttattcaagc aacaaacact tactgaattg tcttgccact 180 

tccagggtga cattatggag ttctgtgatt ctgcaagagg ccagagggga caaggtcaag 240 

tgggtgttca cctggcccct catcttcctc ctgtgcgtca ccattcccaa ctgcagcaag 300 

ccccgctggg agaagttctt catggtcacc ttcatcaacg ccacgctgtg gatcgctgtg 360 

ttctcctaca tcatggtgtg gctggtgact attatcggat acacacttgg gatcccggat 420 

gtcatcatgg gcattacttt cctggcagca ggacaagtgt tccagactgc atggccagcc 480 

taattgtggc gagacaaggc cttggggaca tggcagtctc caacaccata gaagcaacgt 540 

gtttgacatc ctggtaggac ttggtgtacc gtggggcctg cagaccatgg ttgttaatta 600 

tggatcaaca gtgaagatca acagccgggg gctggtctat tccgtggtcc tgttgctggg 660 

ctctgtcgct ctcaccgtcc tcggcatcca cctaaacaag tggcgactgg accggaagct 720 

gggtgtctac gtgctggttc tctacgccat cttcttgtgc ttctccataa tgatagagtt 780 

taacgtcttt accttcgtca acttgccgat gtgccgggaa gacgattagc gctgagtcgc 840 

ggcccctggg agctgatctg gacaccctgt gacactggcg tcctcctctc ccctccttcc 900 

cccaccacag gtctctcctg cataggcagc cactgtccgt tctttcacac actggaagga 960 

agagccatcg tggtctttgt ctggccacag ccaagctgct gggcatcctc ctcctccttg 1020 

gagttccacc cctgcaaggc tggatttggg ggccattatc tgagcagctt caaagacccc 1080 

tgagctgcca accacggaga tgtgccaagc atctcatctc tcctgcacac tttagtcaga 1140 

aggacttctg catgcagttt gtctttctgt tctgcaggca gcttcagaat tgaggtcatt 1200 

tgtgagcaca agatctcata gggcaggtgc aaaataggaa tgttgttctc aagtgtcacc 1260 

tccagcccag aggtggttcc ttaggcagca tgtgctcctg ggagcctctg acttttgctg 1320 

gaagcaccca cagtttggaa ggggcaagac ctcaacctgt tggggtttag ggcccatgat 1380 

ggcagacatt ctaccccttt tcctggaaaa actggaagaa tgaaaataat ttttttctgt 1440 

ggaagagaga aaatgagtga atattcttct cacttttatt gatgcattca gagaataagc 1500 

aatgaaatat taaaaaatga aacatcatat aggtcatcat acttgaaaat tatcattcca 1560 

tatgaaagga tcatgataca caccaaaaaa gtaatgatcg taaagacaca aatcctctgt 1620 

atgccatctt gcattggcac tgaggtgttt ggtttggaat agggaaaaag agacaggatc 1680 

tcgctgtgtt ccccaggtag gtcttgaact cctggcctca agtgatcctc ctgccttgac 1740 

ctcccaaagt gctggattac aagcgtgagc ccctgcaccc ggcccaagca gttgcttctt 1800 

tttttctctt tttttttttt tttgagatgg agcctcactc tgttgcccag gctggagtgc 1860 

agtggcgcga tctccactca ctgcaagctc cgcctcccgg gttcatgcca ttctcctgcc 1920 

tcagcctccc gagtagctgg gactacaggc gcctgccacc acacccagct aattttttgt 1980 

atttttggta cagacagggt ttcaccgtgt tagccaggat ggtcttgatc tctgatctcg 2040 

gatccgccac cccggcctcc aaagtgctgg attacaagcg tgagccaccg ggccccgcca 2100 

agcagttgct tcttatgcaa catgttgggt gggacttgtc cacgggccag gccaataaaa 2160 

ttctfcaatcc tgcagagagc agtaccctca tcaccccatc actggaaaac aaatgtttaa 2220 

gctatcaaga gagggaatgt gcagcttggt tctagatgca tggtttggag gatctacctt 2280 

ggcctaaagg gaatgtccca aacaacagag ccttctttgc tgcactccag aattctctac 2340 

acagaatttc ccaagtccat tcaggacaga cgcgcagtcc tctttcaatg gaagaagaga 2400 

ggacttttcc cctcctgaaa aatgactgga gtgtgaacaa ggcagctctg tttttctaaa 2460 

taagttgttc ttgtgagttt tttctggcca ctgggcatct ctgccctcac ttttcatccc 2520 

tgccctctaa gctgcagacc ccatgaccac actgtctgct tccttgagct tcccgcacga 2580 

ggcttgcacc tgggggacct ggagaccctg cggacagaac tgtggctgag ccactgtggc 2640 

caactcttgg ggagctccac agtgggggtt gctggtctgt gaggctgagt ctccatttca 2700 
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gagcacacac tccctggcag ggcgcctccg cctgtgtctc ctgcccagca gccgccagca 2760 

gggaatagtt gctggtgtct gagcacaaag agagctttga ttacctagag aggaaaaagg 2820 

ctgtcagcca gatgcagcca ggcccagggg tagatacagg agttgctaag gaaggggccg 2880 

agccaggaga ggccaggcag atccacaaag cccaagggga tgcaggctgg gtgtggtttc 2940 

tgagggaacc taccaaatag caggtagatg gaatcagagg actcttgtgt cctgaaagaa 3000 

cctccttaaa aacaactaaa accaagaact tctggggctg ttcacacatt gttcaagtca 3060 

ccccaagatc gttctggcac gctgagctga acaccaccat ctttgttcat tctctctcta 3120 

atgggcaaag caggatcatc gagttgaaaa gttgtaaata atgaggatat ttatcccgct 3180 

atttattttt tcaataactg tgacctcctg cactgtgaat gctctgtgac atgagattct 3240 

tagtttaata aaactgtcat taaatttgaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3300 

aaaaaa 3306 

<210> 299 
<211> 2194 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> (441) . . (441) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eat ure 
<222> (987) . . (987) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2034) (2034) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (2041) . . (2041) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (2121) . . (2121) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (2169) . . (2169) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (2184) . . (2184) 
<223> n equals a,t,g, or c 

<400> 299 

ggcccagggc tgtctgtctc caaagcccaa ccataactca catccccatt ccagctcctc 60 

tgggtgagtc tgttccccct cagcctcact ttccttatcc tgtcaaatga aggatttgga 120 

atgacttaag ttattcaagc aacaaacact tactgaattg tcttgccact tccagggtga 180 

cattatggag ttctgtgatt ctgcaagagg ccagagggga caaggtcaag tgggtgttca 240 

cctggcccct catcttcctc ctgtgcgtca ccattcccaa ctgcagcaag ccccgctggg 300 

agaagttctt catggtcacc ttcatcamcg ccacgctgtg gatcgctgtg ttctcctaca 360 

tcatggtgtg gctggtgact attatcggat acacacttgg gatcccggat gtcatcatgg 420 

gcattamttt cctggcagca nggacaagtg ttccagactg catggccagc ctaattgtgg 480 
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cgagacaagg ccttggggac atggcagtct ccaacacyat aaraagcaac gtgtttgaca 540 

tcctggtagg acttggtgta ccgtggggcc tgcagaccat ggttgttaat tatggatcaa 600 

cagtgaagat caacagccgg gggctggtct attccgtggt cctgttgctg ggctctgtcg 660 

ctctcaccgt cctcggcatc cacctaaaca agtggcgact ggaccggaag ctgggtgtct 720 

acgtgctggt tctctacgcc atcttcttgt gcttctccat aatgatagag tttaacgtct 780 

ttaccttcgt caacttgccg atgtgccggg aagacgatta gcgctgagtc gcggcccctg 840 

ggagctgatc tggacaccct gtgacactgg cgtcctcctc tcccctcctt cccccaccac 900 

aggtctctcc tgcataggca gccactgtcc gttctttcac acactggaag gaagagccat 960 

cgtggtcttt gtctggccac aggccangct gctgggcatc ctcctcctcc ttggagttcc 1020 

acccctgsaa ggcygatttg ggggccatta tctgagcagc ttcaaagacc cctgarctgc 1080 

caaccacgga gatgtgccaa gcatctcatc tctcctgcac actttagtca gaaggacttc 1140 

tgcatgcagt ttgtctttct gttctgcagg cagcttcaga attgaggtca tttgtgagca 1200 

caagatctca tagggcaggt gcaaaatagg aatgttgttc tcaagtgtca cctccagccc 1260 

agaggtggtt ccttaggcag catgtgctcc tgggagcctc tgacttttgc tggaagcacc 1320 

cacagtttgg aaggggcaag acctcaacct gttggggttt agggcccatg atggcagaca 1380 

ttctacccct tttcctggaa aaactggaag aatgaaaatm atttttttct gtggaagaga 1440 

gaaaatgagt gaatatyctt ctcactttta ttgatgcatt cagagaataa gcaatgaaat 1500 

attaaaaaat gaaacatcat ataggtcatc atacttgaaa attatcattc catatgaaag 1560 

gatcatgata cacaccaaaa aagtaatgat cgtaaagaca caaatcctct gtatgccatc 1620 

ttgcattggc actgaggtgt ttggtttgga atagggaaaa agagacagga tctcgctgtg 1680 

ttccccaggt aggtcttgaa ctcctggcct caagtgatcc tcctgccttg acctcccaaa 1740 

gtgctggatt acaagcgtga gcccctgcac ccggcgccaa gcagttgctt ctttttttct 1800 

cttttttttt ttttttgaga tggagcctca ctctgttgcc caggctggag tgcagtggcg 1860 

cgatctccac tcactgcaag ctccgcctcc cgggttcatg ccattctcct gcctcagcct 1920 

cccgagtagc tgggactaca ggcgcctgcc accacaccca gctaattttt tgtatttttg 1980 

gtacagacag ggtttcaccg tgttagccag gatggtcttg atctctgatc tcgngatccg 2040 

nccaccccgg ccttccaaag tgcttggatt acaagcgtga gccacccggg ccccgccaag 2100 

caagttgctt cttatgcaac natgttgggt tggggacttg gtccacgggg cccaggccca 2160 

ataaaaatnc tttaatccct gcanaagagg ccag 2194 

<210> 300 
<211> 207 
<212> PRT 
<213> Homo sapiens 

<400> 300 

Met lie Lys His Val Ala Trp Leu lie Phe Thr Asn Cys lie Phe Phe 
15 10 15 

Cys Pro Val Ala Phe Phe Ser Phe Ala Pro Leu He Thr Ala He Ser 

20 25 30 

He Ser Pro Glu He Met Lys Ser Val Thr Leu He Phe Phe Pro Leu 
35 40 45 

Pro Ala Cys Leu Asn Pro Val Leu Tyr Val Phe Phe Asn Pro Lys Phe 
50 55 60 

Lys Glu Asp Trp Lys Leu Leu Lys Arg Arg Val Thr Lys Lys Ser Gly 
65 ' 70 75 80 

Ser Val Ser Val Ser He Ser Ser Gin Gly Gly Cys Leu Glu Gin Asp 

85 90 95 

Phe Tyr Tyr Asp Cys Gly Met Tyr Ser His Leu Gin Gly Asn Leu Thr 

100 105 110 

Val Cys Asp Cys Cys Glu Ser Phe Leu Leu Thr Lys Pro Val Ser Cys 
115 120 125 

Lys His Leu He Lys Ser His Ser Cys Pro Ala Leu Ala Val Ala Ser 
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130 

Cys Gin Arg Pro Glu 
145 

His Ser Asp Tyr Ala 

165 

Asp Gin Val Gin Ala 

180 

Phe Pro Leu Val Arg 
195 



135 

Gly Tyr Trp Ser Asp Cys 
150 155 

Asp Glu Glu Asp Ser Phe 

170 

Cys Gly Arg Ala Cys Phe 

185 

Tyr Ala Tyr Asn Leu Pro 
200 



140 

Gly Thr Gin Ser Ala 

160 

Val Ser Asp Ser Ser 

175 

Tyr Gin Ser Arg Gly 
190 

Arg Val Lys Asp 
205 



<210> 301 
<211> 114 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (13) 

<223> Xaa equals any amino acid 
<400> 301 

Met Ala Gly Pro Arg Ala Ser Thr Gly Pro Arg Pro Xaa Cys Leu Val 
15 10 15 

Leu Phe Leu Phe Asn Phe lie Phe Cys Phe Met Ser Val Cys Pro Pro 

20 25 30 

Thr Pro Thr Pro Phe Ser Val Lys Trp Gly Ala Leu Gly Glu Ser Leu 
35 40 45 

Leu Pro Pro Ser Leu Ser Gin Asp Leu Pro Pro Arg His Gin Pro Ser 
50 55 60 

Leu Trp Thr Arg Gin Arg Ala Asp Arg Val Gly Arg Gly Leu Arg Val 
65 70 75 80 

Ala Arg Ala Ser Pro Pro Ala Asn Gly Pro Leu Leu Arg Pro Pro Val 

85 90 95 

Ser Pro Cys Pro Phe Leu Lys Gin Asn Ala Leu Val Cys Lys Pro Leu 

100 105 110 

Asp Ala 



<210> 302 
<211> 49 
<212> PRT 

<213> Homo sapiens 
<400> 302 

Met Arg Leu Cys Ser Phe Thr Lys Val Pro Met Asn Leu Phe Leu Asn 
15 10 15 



182 



« 



* 



I 



WO 02/076488 PCT/US02/08276 



Val He Leu Leu Lys Phe Tyr Asn Phe Leu Phe Ser Leu He Leu Gly 

20 25 30 

Lys Ser Cys Leu Ala Ser Leu Gly Leu Cys Lys Asn Asn Lys Cys Leu 
35 40 45 

Ser 



<210> 303 
<211> 62 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (16) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (54) 

<223> Xaa equals any amino acid 
<400> 303 

Met Val Thr Gly Phe Phe Phe He Leu Met Thr Val Leu Trp Phe Xaa 
15 10 15 

Arg Glu Pro Gly Phe Val Pro Gly Trp Asp Ser Phe Phe Glu Lys Lys 

20 25 30 

Gly Tyr Arg Thr Asp Ala Thr Val Ser Val Phe Leu Gly Phe Leu Leu 
35 40 45 

Phe Leu He Pro Ala Xaa Glu Ala Leu Leu Trp Glu Lys Glu 
50 55 60 



<210> 304 
<211> 122 
<212> PRT 

<213> Homo sapiens 
<400> 304 

Met Cys Tyr Leu Leu Leu Leu Leu lie Gin Thr Ala Glu Leu Leu He 
15 10 15 

His Pro Gin Gly Leu Gin Ala Val Ser Asn Gly Glu Ser Ala Leu Lys 

20 25 30 

Gly Thr Arg Pro Thr Phe Ser Ser Pro Phe He Leu Val Thr Glu Gly 
35 40 45 

Arg Lys Glu Trp Glu Gly Val Phe Leu Ser Ser Gly Trp Lys Gly Asn 
50 55 60 

Thr Leu Ser Asn Tyr Tyr He Ser Leu Val Phe Tyr Tyr Ser Arg He 
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65 70 75 80 

Leu Gin Pro Tyr Phe Tyr Cys Leu Trp Gly Lys Leu Glu Met Val Thr 

85 90 ' 95 

Leu lie Arg Ser Val Trp Arg Gly lie Asn Gly Gly Asp Lys lie Gin 

100 105 110 

Leu Val Leu Glu Asn Val Lys Val Leu Lys 
115 120 



<210> 305 

<211> 563 

<212> PRT 

<213> Homo sapiens 

. <400> 305 

Met Trp Ala Val Leu Arg Leu Ala Leu Arg Pro Cys Ala Arg Ala Ser 
1 5 10 15 

Pro Ala Gly Pro Arg Ala Tyr His Gly Asp Ser Val Ala Ser* Leu Gly 

20 25 30 

Thr Gin Pro Asp Leu Gly Ser Ala Leu Tyr Gin Glu Asn Tyr Lys Gin 
35 40 45 

Met Lys Ala Leu Val Asn Gin Leu His Glu Arg Val Glu His He Lys 
50 55 60 

Leu Gly Gly Gly Glu Lys Ala Arg Ala Leu His He Ser Arg Gly Lys 
65 70 75 80 

Leu Leu Pro Arg Glu Arg He Asp Asn Leu He Asp Pro Gly Ser Pro 

85 90 95 

Phe Leu Glu Leu Ser Gin Phe Ala Gly Tyr Gin Leu Tyr Asp Asn Glu 

100 105 110 

Glu Val Pro Gly Gly Gly He He Thr Gly He Gly Arg Val Ser Gly 
115 120 125 

Val Glu Cys Met Xle He Ala Asn Asp Ala Thr Val Lys Gly Gly Ala 
130 135 140 

Tyr Tyr Pro Val Thr Val Lys Lys Gin Leu Arg Ala Gin Glu He Ala 
145 150 155 160 

Met Gin Asn Arg Leu Pro Cys He Tyr Leu Val Asp Ser Gly Gly Ala 

165 170 175 

Tyr Leu Pro Arg Gin Ala Asp Val Phe Pro Asp Arg Asp His Phe Gly 

180 185 190 

Arg Thr Phe Tyr Asn Gin Ala He Met Ser Ser Lys Asn lie Ala Gin 
195 200 205 

He Ala Val Val Met Gly Ser Cys Thr Ala Gly Gly Ala Tyr Val Pro 
210 215 220 
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Ala Met Ala Asp Glu Asn lie lie Val Arg Lys Gin Gly Thr lie Phe 
225 230 235 240 

Leu Ala Gly Pro Pro Leu Val Lys Ala Ala Thr Gly Glu Glu Val Ser 

245 250 255 

Ala Glu Asp Leu Gly Gly Ala Asp Leu His Cys Arg Lys Ser Gly Val 

260 . 265 270 

Ser Asp His Trp Ala Leu Asp Asp His His Ala Leu His Leu Thr Arg 
275 280 285 

Lys Val Val Arg Asn Leu Asn Tyr Gin Lys Lys Leu Asp Val Thr lie 
290 295 300 

Glu Pro Ser Glu Glu Pro Leu Phe Pro Ala Asp Glu Leu Tyr Gly He 
305 310 315 320 

Val Gly Ala Asn Leu Lys Arg Ser Phe Asp Val Arg Glu Val He Ala 

325 330 335 

Arg He Val Asp Gly Ser Arg Phe Thr Glu Phe Lys Ala Phe Tyr Gly 

340 345 350 

Asp Thr Leu Val Thr Gly Phe Ala Arg He Phe Gly Tyr Pro Val Gly 
355 360 365 

He Val Gly Asn Asn Gly Val Leu Phe Ser Glu Ser Ala Lys Lys Gly 
370 375 380 

Thr His Phe Val Gin Leu Cys Cys Gin Arg Asn He Pro Leu Leu Phe 
385 390 395 400 

Leu Gin Asn He Thr Gly Phe Met Val Gly Arg Glu Tyr Glu Ala Glu 

405 410 415 

Gly He Ala Lys Asp Gly Ala Lys Met Val Ala Ala Val Ala Cys Ala 

420 425 430 

Gin Val Pro Lys He Thr Leu He He Gly Gly Ser Tyr Gly Ala Gly 
435 440 445 

Asn Tyr Gly Met Cys Gly Arg Ala Tyr Ser Pro Arg Phe Leu Tyr He 
450 455 460 

Trp Pro Asn Ala Arg He Ser Val Met Gly Gly Glu Gin Ala Ala Asn 
465 470 475 480 

Val Leu Ala Thr He Thr Lys Asp Gin Arg Ala Arg Glu Gly Lys Gin 

485 490 495 

Phe Ser Ser Ala Asp Glu Ala Ala Leu Lys Glu Pro He He Lys Lys 

500 505 510 

Phe Glu Glu Glu Gly Asn Pro Tyr Tyr Ser Ser Ala Arg Val Trp Asp 
515 520 525 

Asp Gly He He Asp Pro Ala Asp Thr Arg Leu Val Leu Gly Leu Ser 
530 535 540 

Phe Ser Ala Ala Leu Asn Ala Pro He Glu Lys Thr Asp Phe Gly lie 
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545 550 • 555 560 

Phe Arg Met 



<210> 306 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 306 

Met Val Gin Phe Glu Val lie Phe Leu Leu Phe Gly Leu Cys Phe Ser 
15 10 15 

Ser Ser Ser Ser Arg Leu Val Gly Ser Gin Val Glu Asn Phe Ser Pro 

20 25 30 

Thr Pro Cys lie Phe Gin Ala Phe Arg Cys Ser Ser Leu Ala lie lie 
35 40 45 

Ser Met Ser Leu Ser 
50 



<210> 307 
<211> 421 
<212> PRT 

<213> Homo sapiens 
<400> 307 

Met Thr Val Phe Phe Lys Thr Leu Arg Asn His Trp Lys Lys Thr Thr 
15 10 15 

Ala Gly Leu Cys Leu Leu Thr Trp Gly Gly His Trp Leu Tyr Gly Lys 

20 25 30 

His Cys Asp Asn Leu Leu Arg Arg Ala Ala Cys Gin Glu Ala Gin Val 
35 40 45 

Phe Gly Asn Gin Leu lie Pro Pro Asn Ala Gin Val Lys Lys Ala Thr 
50 55 60 

Val Phe Ser lie Leu Gin Leu Ala Lys Glu Lys Pro Gly Leu Tyr Leu 
65 70 75 80 

Lys Lys Met Leu Pro Asp Phe Thr Phe lie Trp His Gly Cys Asp Tyr 

85 90 95 

Cys Lys Thr Asp Tyr Glu Gly Gin Ala Lys Lys Leu Leu Glu Leu Met 

100 105 110 

Glu Asn Thr Asp Val He He Val Ala Gly Gly Asp Gly Thr Leu Gin 
115 120 125 

Glu Val Val Thr Gly Val Leu Arg Arg Thr Asp Glu Ala Thr Phe Ser 
130 135 140 

Lys He Pro He Gly Phe He Pro Leu Gly Glu Thr Ser Ser Leu Ser 
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145 150 155 160 

His Thr Leu Phe Ala Glu Ser Gly Asn Lys Val Gin His lie Thr Asp 

165 170 175 

Ala Thr Leu Ala He Val Lys Gly Glu Thr Val Pro Leu Asp Val Leu 

180 185 190 

Gin He Lys Gly Glu Lys Glu Gin Pro Val Phe Ala Met Thr Gly Leu 
195 200 205 

Arg Trp Gly Ser Phe Arg Asp Ala Gly Val Lys Val Ser Lys Tyr Trp 
210 215 220 

Tyr Leu Gly Pro Leu Lys lie Lys Ala Ala His Phe Phe Ser Thr Leu 
225 230 235 240 

Lys Glu Trp Pro Gin Thr His Gin Ala Ser He Ser Tyr Thr Gly Pro 

245 250 255 

Thr Glu Arg Pro Pro Asn Glu Pro Glu Glu Thr Pro Val Gin Arg Pro 

260 265 270 

Ser Leu Tyr Arg Arg He Leu Arg Arg Leu Ala Ser Tyr Trp Ala Gin 
275 280 285 

Pro Gin Asp Ala Leu Ser Gin Glu Val Ser Pro Glu Val Trp Lys Asp 
290 295 300 

Val Gin Leu Ser Thr He Glu Leu Ser He Thr Thr Arg Asn Asn Gin 
305 310 315 320 

Leu Asp Pro Thr Ser Lys Glu Asp Phe Leu Asn He Cys He Glu Pro 

325 330 335 

Asp Thr He Ser Lys Gly Asp Phe He Thr He Gly Ser Arg Lys Val 

340 . 345 350 

Arg Asn Pro Lys Leu His Val Glu Gly Thr Glu Cys Leu Gin Ala Ser 
355 360 365 

Gin Cys Thr Leu Leu He Pro Glu Gly Ala Gly Gly Ser Phe Ser He 
370 375 380 

Asp Ser Glu Glu Tyr Glu Ala Met Pro Val Glu Val Lys Leu Leu Pro 
385 390 395 400 

Arg Lys Leu Gin Phe Phe Cys Asp Pro Arg Lys Arg Glu Gin Met Leu 

405 410 415 

Thr Ser Pro Thr Gin 

420 



<210> 308 
<211> 242 
<212> PRT 

<213> Homo sapiens 
<400> 308 
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Met Gin Leu Gly Ser Val Leu Leu Thr Arg Cys Pro Phe Tip Gly Cys 
15 10 15 

Phe Ser Gin Leu Met Leu Tyr Ala Glu Arg Ala Glu Ala Arg Arg Lys 

20 25 30 

Pro Asp lie Pro Val Pro Tyr Leu Tyr Phe Asp Met Gly Ala Ala Val 
35 40 45 

Leu Cys Ala Ser Phe Met Ser Phe Gly Val Lys Arg Arg Trp Phe Ala 
50 55 . 60 

Leu Gly Ala Ala Leu Gin Leu Ala He Ser Thr Tyr Ala Ala Tyr He 
65 70 75 80 

Gly Gly Tyr Val His Tyr Gly Asp Trp Leu Lys Val Arg Met Tyr Ser 

85 90 95 

Arg Thr Val Ala He He Gly Gly Phe Leu Val Leu Ala Ser Gly Ala 

100 105 110 

Gly Glu Leu Tyr Arg Arg Lys Pro Arg Ser Arg Ser Leu Gin Ser Thr 
115 120 125 

Gly Gin Val Phe Leu Gly He Tyr Leu He Cys Val Ala Tyr Ser Leu 
130 135 140 

Gin His Ser Lys Glu Asp Arg Leu Ala Tyr Leu Asn His Leu Pro Gly 
145 150 155 160 

Gly Glu Leu Met He Gin Leu Phe Phe Val Leu Tyr Gly He Leu Ala 

165 170 175 

Leu Ala Phe Leu Ser Gly Tyr Tyr Val Thr Leu Ala Ala Gin He Leu 

180 185 190 

Ala Val Leu Leu Pro Pro Val Met Leu Leu He Asp Gly Asn Val Ala 
195 200 205 

Tyr Trp His Asn Thr Arg Arg Val Glu Phe Trp Asn Gin Met Lys Leu 
210 215 220 

Leu Gly Glu Ser Val Gly He Phe Gly Thr Ala Val He Leu Ala Thr 
225 230 235 240 

Asp Gly 



<210> 309 
<211> 189 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (94) 

<223> Xaa equals any amino acid 
<400> 309 
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Met Ala Leu Leu Ser Arg Pro Ala Leu Thr Leu Leu Leu Leu Leu Met 
15 10 15 

Ala Ala Val Val Arg Cys Gin Glu Gin Ala Gin Thr Thr Asp Trp Arg 

20 25 30 

Ala Thr Leu Lys Thr lie Arg Asn Gly Val His Lys lie Asp Thr Tyr 
35 40 45 

Leu Asn Ala Ala Leu Asp Leu Leu Gly Gly Glu Asp Gly Leu Cys Gin 
50 55 60 

Tyr Lys Cys Ser Asp Gly Ser Lys Pro Phe Pro Arg Tyr Gly Tyr Lys 
65 70 75 8.0 

Pro Ser Pro Pro Asn Gly Cys Gly Ser Pro Leu Phe Gly Xaa His Leu 

85 90 95 

Asn lie Gly lie Pro Ser Leu Thr Lys Cys Cys Asn Gin His Asp Arg 

100 105 110 

Cys Tyr Glu Thr Cys Gly Lys Ser Lys Asn Asp Cys Asp Glu Glu Phe 
115 120 125 

Gin Tyr Cys Leu Ser Lys lie Cys Arg Asp Val Gin Lys Thr Leu Gly 
130 135 140 

Leu Thr Gin His Val Gin Ala Cys Glu Thr Thr Val Glu Leu Leu Phe 
145 150 155 160 

Asp Ser Val lie His Leu Gly Cys Lys Pro Tyr Leu Asp Ser Gin Arg 

165 170 175 

Ala Ala Cys Arg Cys His Tyr Glu Glu Lys Thr Asp Leu 

180 185 



<210> 310 
<211> 64 
<212> PRT 

<213> Homo sapiens 
<400> 310 

Met Pro JjGU Phe Leu Phe Val Ala His Leu lie Ser Leu Leu Leu Ala 
15 10 15 

Phe Arg Arg Pro Pro Ala Ser Gin lie Thr Pro Arg Ala Trp Thr Thr 

20 25 30 

Glu He Ala Ser Cys Glu Ser Val Glu Met Val Lys Ala Leu Ser Ser 
35 40 45 

Leu Arg Ser Arg Ala Gin Val Asn Ala Asp Phe Pro Gly His Leu Cys 
50 55 60 
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<210> 311 
<211> 49 
<212> PRT 

<213> Homo sapiens 
<400> 311 

Met Asn Leu Leu Gly Met lie Phe Ser 
1 5 

Leu Lys Trp Cys Ala Trp Val Ala Val 

20 25 

Ala Asn Ser Arg Ser Ser Glu Asp Thr 
35 40 

Met 



PCTAJS02/08276 



Met Cys Gly Leu Met Leu Lys 
10 15 

Tyr Cys Ser Phe lie Ser Phe 

30 

Lys Gin Met Met Ser Ser Phe 

45 



<210> 312 
<211> 59 
<212> PRT 

<213> Homo sapiens 
<400> 312 

Met Asn Ser Thr Leu Cys Val Val Leu Ser Leu Met Cys Met Asn Ser 
15 10 15 

Thr Leu Cys Val Val Leu Ser Leu Thr His Ser Cys Pro Ser Pro Gin 

20 25 30 

Val Pro Lys Val His Tyr Met lie Phe Met Pro Leu His Leu His Ser 
35 40 45 

Leu Ala Leu Thr Gin Leu lie lie lie Tyr Lys 
50 55 



<210> 313 
<211> 240 
<212> PRT 

<213> Homo sapiens 
<400> 313 

Met Gly Asn Cys Gin Ala Gly His Asn Leu His Leu Cys Leu Ala His 
15 10 15 

His Pro Pro Leu Val Cys Ala Thr Leu lie Leu Leu Leu Leu Gly Leu 

20 25 30 

Ser Gly Leu Gly Leu Gly Ser Phe Leu Leu Thr His Arg Thr Gly Leu 
35 40 45 

Arg Ser Pro Asp He Pro Gin Asp Trp Val Ser Phe Leu Arg Ser Phe 
50 55 60 

Gly Gin Leu Thr Leu Cys Pro Arg Asn Gly Thr Val Thr Gly Lys Trp 
65 70 75 80 
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Arg Gly Ser His Val Val Gly Leu Leu Thr Thr Leu Asn Phe Gly Asp 

85 90 95 

Gly Pro Asp Arg Asn Lys Thr Arg Thr Phe Gin Ala Thr Val Leu Gly 

100 105 110 

Ser Gin Met Gly Leu Lys Gly Ser Ser Ala Gly Gin Leu Val Leu He 
115 120 125 

Thr Ala Arg Val Thr Thr Glu Arg Thr Ala Gly Thr Cys Leu Tyr Phe 
130 135 140 

Ser Ala Val Pro Gly He Leu Pro Ser Ser Gin Pro Pro He Ser Cys 
145 150 155 160 

Ser Glu Glu Gly Ala Gly Asn Ala Thr Leu Ser Pro Arg Met Gly Glu 

165 170 175 

Glu Cys Val Ser Val Trp Ser His Glu Gly Leu Val Leu Thr Lys Leu 

180 185 190 

Leu Thr Ser Glu Glu Leu Ala Leu Cys Gly Ser Arg Leu Leu Val Leu 
195 200 205 * 

Gly Ser Phe Leu Leu Leu Phe Cys Gly Leu Leu Cys Cys Val Thr Ala 
210 215 220 

Met Cys Phe His Pro Arg Arg Glu Ser His Trp Ser Arg Thr Arg Leu 
225 230 235 240 



<210> 314 

<211> 39 

<212> PRT 

<213> Homo sapiens 

<400> 314 

Met Leu Leu Leu Leu Lys Thr Leu Phe Val Thr Phe Trp Ser Thr Asn 
15 10 15 

Leu Ser He Thr Phe Ser Asn Tyr Asn Val Lys Leu Tyr Gin Trp Gin 

20 25 30 

Ser Tyr He Val Asn Gly Ser 
35 



<210> 315 
<211> 174 
<212> PRT 

<213> Homo sapiens 
<400> 315 

Met Glu Ala Pro Gly Pro Arg Ala Leu Arg Thr Ala Leu Cys Gly Gly 
15 10 15 
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Cys Cys Cys Leu Leu Leu Cys Ala Gin Leu Ala Val Ala Gly Lys Gly 

20 25 30 

Ala Arg Gly Phe Gly Arg Gly Ala Leu lie Arg Leu Asn He Trp Pro 
35 40 45 

Ala Val Gin Gly Ala Cys Lys Gin Leu Glu Val Cys Glu His Cys Val 
50 55 60 

Glu Gly Asp Arg Ala Arg Asn Leu Ser Ser Cys Met Trp Glu Gin Cys 
65 70 75 80 

Arg Pro Glu Glu Pro Gly His Cys Val Ala Gin Ser Glu Val Val Lys 

85 90 95 

Glu Gly Cys Ser He Tyr Asn Arg Ser Glu Ala Cys Pro Ala Ala His 

100 105 110 

His His Pro Thr Tyr Glu Pro Lys Thr Val Thr Thr Gly Ser Pro Pro 
115 120 125 

Val Pro Glu Ala His Ser Pro Gly Phe Asp Gly Ala Ser Phe He Gly 
130 135 140 

Gly Val Val Leu Val Leu Ser Leu Gin Ala Val Ala Phe Phe Val Leu 
145 150 155 160 

His Phe Leu Lys Ala Lys Asp Ser Thr Tyr Gin Thr Leu He 

165 170 



<210> 316 
<211> 61 
<212> PRT 

<213> Homo sapiens 
<400> 316 

Met Tyr Leu Phe Leu Lys Thr Leu Leu Ser Phe Ser Thr Leu Met Met 
15 10 15 

Thr Thr Ala Leu Ser Phe Met Val He Thr Val Leu Trp Val Leu Leu 

20 25 30 

Leu His Leu Leu Ala Asn He Cys He Pro Arg Lys Cys Ser Phe Ala 
35 40 45 

Cys Phe Tyr lie Asn Gly He Leu Leu His Ala Val Phe 
50 55 60 



<210> 317 
<211> 319 
<212> PRT 

<213> Homo sapiens 
<400> 317 

Met Ser Trp Cys Cys Leu Trp Leu Cys Leu Ser Ser Val Gly Arg Thr 
15 10 15 
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Gly Ser Ala Gly Pro Ser Leu Pro Phe Ser Glu Leu Cys Ser Leu Gly 

20 25 30 

Leu Leu Arg Leu Arg Pro Val Phe Ser Pro Leu His Ser Gly Pro Gly 
35 40 45 

Lys Pro Ala Gin Phe Leu Ala Gly Glu Ala Glu Glu Val Asn Ala Phe 
50 55 60 

Ala Leu Gly Phe Leu Ser Thr Ser Ser Gly Val Ser Gly Glu Asp Glu 
65 70 75 80 

Val Glu Pro Leu His Asp Gly Val Glu Glu Ala Glu Lys Lys Met Glu 

85 90 95 

Glu Glu Gly Val Ser Val Ser Glu Met Glu Ala Thr Gly Ala Gin Gly 

100 105 110* 

Pro Ser Arg Val Glu Glu Ala Glu Gly His Thr Glu Val Thr Glu Ala 
115 120 125 

Glu Gly Ser Gin Gly Thr Ala Glu Ala Asp Gly Pro Gly Ala Ser Ser 
130 135 140 

Gly Asp Glu Asp Ala Ser Gly Arg Ala Ala Ser Pro Glu Ser Ala Ser # 
145 150 155 160 

Ser Thr Pro Glu Ser Leu Gin Ala Arg Arg His His Gin Phe Leu Glu 

165 170 175 

Pro Ala Pro Ala Pro Gly Ala Ala Val Leu Ser Ser Glu Pro Ala Glu 

180 185 190 

Pro Leu Leu Val Arg His Pro Pro Arg Pro Arg Thr Thr Gly Pro Arg 
195 200 205 

Pro Arg Gin Asp Pro His Lys Ala Gly Leu Ser His Tyr Val Lys Leu 
210 215 220 

Phe ( Ser Phe Tyr Ala Lys Met Pro Met Glu Arg Lys Ala Leu Glu Met 
225'' 230. 235 240 

Val Glu Lys Cys Leu Asp Lys Tyr Phe Gin His Leu Cys Asp Asp Leu 

245 250 255 

Glu Val Phe Ala Ala His Ala Gly Arg Lys Thr Val Lys Pro Glu Asp 

260 265 270 

Leu Glu Leu Leu Met Arg Arg Gin Gly Leu Val Thr Asp Gin Val Ser 
275 280 285 

Leu His Val Leu Val Glu Arg His Leu Pro Leu Glu Tyr Arg Gin Leu 
290 295 300 

Leu lie Pro Cys Ala Tyr Ser Gly Asn Ser Val Phe Pro Ala Gin 
305 310 315 



<210> 318 
<211> 336 
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<212> PRT 

<213> Homo sapiens 
<400> 318 

Met lie Ser Tyr lie Val Leu Leu Ser lie Leu Leu Trp Pro Leu Val 
15 10 15 

Val Tyr His Glu Leu He Gin Arg Met Tyr Thr Arg Leu Glu Pro Leu 

20 25 30 

Leu Met Gin Leu Asp Tyr Ser Met Lys Ala Glu Ala Asn Ala Leu His 
35 40 45 

His Lys His Asp Lys Arg Lys Arg Gin Gly Lys Asn Ala Pro Pro Gly 
50 55 60 

Gly Asp Glu Pro Leu Ala Glu Thr Glu Ser Glu Ser Glu Ala Glu Leu 
65 70 75 80 

Ala Gly Phe Ser Pro Val Val Asp Val Lys Lys Thr Ala Leu Ala Leu 

85 90 95 

Ala lie Thr Asp Ser Glu Leu Ser Asp Glu Glu Ala Ser lie Leu Glu 

100 105 ' 110 

Ser Gly Gly Phe Ser Val Ser Arg Ala Thr Thr Pro Gin Leu Thr Asp 
115 120 125 

Val Ser Glu Asp Leu Asp Gin Gin Ser Leu Pro Ser Glu Pro Glu Glu 
130 135 140 

Thr Leu Ser Arg Asp Leu Gly Glu Gly Glu Glu Gly Glu Leu Ala Pro 
145 150 155 160 

Pro Glu Asp Leu Leu Gly Arg Pro Gin Ala Leu Ser Arg Gin Ala Leu 

165 170 175 

Asp Ser Glu Glu Glu Glu Glu Asp Val Ala Ala Lys Glu Thr Leu Leu 

180 185 190 

Arg Leu Ser Ser Pro Leu His Phe Val Asn Thr His Phe Asn Gly Ala 
195 200 205 

Gly Ser Pro Gin Asp Gly Val Lys Cys Ser Pro Gly Gly Pro Val Glu 
210 215 220 

Thr Leu Ser Pro . Glu Thr Val Ser Gly Gly Leu Thr Ala Leu Pro Gly 
225 230 235 240 

Thr Leu Ser Pro Pro Leu Cys * Leu Val Gly Ser Asp Pro Ala Pro Ser 

245 250 255 

Pro Ser He Leu Pro Pro Val Pro Gin Asp Ser Pro Gin Pro Leu Pro 

260 265 270 

Ala Pro Glu Glu Glu Glu Ala Leu Thr Thr Glu Asp Phe Glu Leu Leu 
275 280 285 

Asp Gin Gly Glu Leu Glu Gin Leu Asn Ala Glu Leu Gly Leu Glu Pro 
290 295 300 
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Glu Thr Pro Pro Lys Pro Pro Asp Ala Pro Pro Leu Gly Pro Asp lie 
305 310 315 320 

His Ser Leu Val Gin Ser Asp Gin Glu Ala Gin Ala Val Ala Glu Pro 

325 330 335 



<210> 319 
<211> 272 
<212> PRT 

<213> Homo sapiens 
<400> 319 

Met Trp Gly Asn Lys Phe Gly Val Leu Leu Phe Leu Tyr Ser Val Leu 
15 10 15 

Leu Thr Lys Gly lie Glu Asn He Lys Asn Glu He Glu Asp Ala Ser 

20 25 30 

Glu Pro Leu He Asp Pro Val Tyr Gly His Gly Ser Gin Ser Leu He 
35 40 45 

Asn Leu Leu Leu Thr Gly His Ala Val Ser Asn Val Trp Asp Gly Asp 
50 55 60 

Arg Glu Cys Ser Gly Met Lys Leu Leu Gly He His Glu Gin Ala Ala 
65 70 75 80 

Val Gly Phe Leu Thr Leu Met Glu Ala Leu Arg Tyr Cys Lys Val Gly 

85 90 95 

Ser Tyr Leu Lys Ser Pro Lys Phe Pro He Trp He Val Gly Ser Glu 

100 105 110 

Thr His Leu Thr Val Phe Phe Ala Lys Asp Met Ala Leu Val Ala Pro 
115 120 125 

Glu Ala Pro Ser Glu Gin Ala Arg Arg Val Phe Gin Thr Tyr Asp Pro 
130 135 140 

Glu Asp Asn Gly Phe He Pro Asp Ser Leu Leu Glu Asp Val Met Lys 
145 150 155 160 

Ala Leu Asp Leu Val Ser Asp Pro Glu Tyr He Asn Leu Met Lys Asn 

165 170 175 

Lys Leu Asp Pro Glu Gly Leu Gly He He Leu Leu Gly Pro Phe Leu 

180 185 190 

Gin Glu Phe Phe Pro Asp Gin Gly Ser Ser Gly Pro Glu Ser Phe Thr 
195 200 205 

Val Tyr His Tyr Asn Gly Leu Lys Gin Ser Asn Tyr Asn Glu Lys Val 
210 215 220 

Met Tyr Val Glu Gly Thr Ala Val Val Met Gly Phe Glu Asp Pro Met 
225 230 235 240 
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Leu Gin Thr Asp Asp Thr Pro lie Lys Arg Cys Leu Gin Thx Lys Trp 

245 250 255 



Pro Tyr lie Glu Leu Leu Trp Thr Thx Asp Arg Ser Pro Ser Leu Asn 

260 265 270 



<210> 320 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<400> 320 

Met Phe Lys Asp Tyr Pro Pro Ala lie Lys Pro Ser Tyr Asp Val Leu 
15 10 15 

Leu Leu Leu Leu Leu Leu Val Leu Leu Leu Gin Ala Gly Leu Asn Thr 

20 25 30 

Gly Thr Ala lie Gin Cys Val Arg Phe Lys Val Ser Ala Arg Leu Gin 
35 40 45 

Gly Ala Ser Trp Asp Thr Gin Asn Gly Pro Gin Glu Arg Leu Ala Gly 
50 55* 60 

Glu Val Ala Arg Ser Pro Leu Lys Glu Phe Asp Lys Glu Lys Ala Trp 
65 70 75 80 

Arg Ala Val Val Val Gin Met Ala Gin 

85 



<210> 321 
<211> 51 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (23) 

<223> Xaa equals any amino acid 
<400> 321 

Met Ala Gin His His Leu Leu Ser lie Leu Leu Ala lie Leu Ser Cys 
15 10 15 

Ser Ser Gin Pro Arg Gin Xaa Arg Gly Ser Gly Ala Leu Pro Cys Glu 

20 25 30 

Val Cys Ser Ala Val Leu Leu Thr Cys Leu Arg Lys He Ser Gly Ser 
35 40 45 

Leu Cys Val 
50 
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<210> 322 
<211> 74 
<212> PRT 

<213> Homo sapiens 
<400> 322 

Met Leu His Leu Ala Ala Met Trp Trp Ala Cys Val Thr Thr Leu Val 
15 10 15 

Phe Thr Leu Val Ser Lys Leu Phe lie Pro Leu Lys Ser Ser Met Asp 

20 25 30 

Gly Glu Met Ser Leu Asp Pro His Ser Cys Val Leu Val Cys lie Cys 
35 40 45 

Phe Pro Leu Arg Phe Val Phe Val Ser Cys Phe Glu Leu Tyr Leu Val 
50 55 60 

Gin Ser lie Val Lys Leu Ser Gin Gin Leu 
65 70 



<210> 323 
<211> 127 
<212> PRT 

<213> Homo sapiens 
<400> 323 

Met Gly Gin Val Trp Arg Val Pro Pro Leu Leu Leu Ser Val Gin Val 
1 5 10 15 

Phe Leu Thr Met Ala His Ala Phe His Gin Ala Pro Glu Leu Gin Trp 

20 25 30 

Leu Gly Leu Trp Phe Trp Val Arg Leu Phe Ala Gly Gly Asp Gly Gly 
35 -40 45 

Leu His Leu Asn lie Ser Ser Val Thr Leu Pro Leu Leu His Gly Lys 
50 55 60 

Gin Leu Ser Arg Glu Val Pro Ser Cys Gin Gly Lys Pro Arg Leu Gly 
65 70 75 80 

Arg Pro Pro Tyr Lys Glu Pro Gin Asp Cys Ser His Gly Cys His Leu 

85 90 95 

Ser Trp Lys Gly Arg Phe Met Gly Phe Pro Gly Thr Pro Arg Leu Ser 

100 105 110 

Trp Pro Arg Gly Lys Arg Trp Leu Leu Gin Glu Phe Asp Leu Ser 
115 120 125 



<210> 324 
<211> 215 
<212> PRT 

<213> Homo saoiens 
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<220> 

<221> SITE 
<222> (83) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (141) 

<223> Xaa equals any amino acid 
<400> 324 

Met Tyr Gly Lys Ser Ser Thr Arg Ala Val Leu Leu Leu Leu Gly lie 
1 5 10 15 

Gin Leu Thr Ala Leu Trp Pro lie Ala Ala Val Glu lie Tyr Thr Ser 

20 25 30 

Arg Val Leu Glu Ala Val Asn Gly Thr Asp Ala Arg Leu Lys Cys Thr 
35 40 45 

Phe Ser Ser Phe Ala Pro Val Gly Asp Ala Leu Thr Val Thr Trp Asn 
50 55 60 

Phe Arg Pro Leu Asp Gly Gly Pro Glu Gin Phe Val Phe Tyr Tyr His 
65 70 75 80 

lie Asp Xaa Phe Gin Pro Met Ser Gly Arg Phe Lys Asp Arg Val Ser 

85 90 95 

Trp Asp Gly Asn Pro Glu Arg Tyr Asp Ala Ser He Leu Leu Trp Lys 

100 105 110 

Leu Gin Phe Asp Asp Asn Gly Thr Tyr Thr Cys Gin Val Lys Asn Pro 
115 120 125 

Pro Asp Val Asp Gly Val He Gly Asp He Arg Leu Xaa Val Val His 
130 135 140 

Thr Val Arg Phe Ser Glu He His Phe Leu Ala Leu Ala He Gly Ser 
145 150 155 160 

Ala Cys Ala Leu Met lie He He Val He Val Val Val Leu Phe Gin 

165 170 175 

His Tyr Arg Lys Lys Arg Trp Ala Glu Arg Ala His Lys Val Val Glu 

180 185 190 

He Lys Ser Lys Glu Glu Glu Arg Leu Asn Gin Glu Lys Lys Val Ser 
195 200 205 

Val Tyr Leu Glu Asp Thr Asp 
210 215 



<210> 325 
<211> 47 
<212> PRT 

<213> Homo sapiens 
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<400> 325 

Met Phe Tyr Pro Pro Cys Pro Phe Phe Pro Gin Leu Cys Phe Cys He 
15 10 15 

Phe Phe Leu Gly Lys Cys Lys Leu Ser Leu Ser Phe Met Thr Cys Glu 

20 25 30 

He Ser Val Ser Leu Glu Phe Val Arg Arg Arg Gly Asn His Ala 
35 40 45 



<210> 326 
<211> 100 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (36) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (51) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (83) 

<223> Xaa equals any amino acid 
<400> 326 

Met Gly Met He Leu Val Leu Ala Ser Phe Leu Ala His Pro Val Glu 
15 10 15 

Ala Leu Ala Gin Ala Val Ala Leu Gly Gin Gin Gin Leu Ala Leu Leu 

20 25 30 

Gly Val Gin Xaa His Ala Val Glu Gly Phe Leu Gin Leu Gin Xaa Cys 
35 40 45 

Phe Ala Xaa Leu Phe Val Phe Glu Gly Ala Leu Leu Ala His Leu Gly 
50 55 60 

His Phe Phe Val Glu Pro Gly Ala Ala Gin Gly Gin Leu Leu Asp Leu 
65 70 75 80 

Gly Leu Xaa Arg Arg Glu Leu Gly Phe Gin Phe Ala Leu Leu Ala Arg 

85 90 95 

Phe Val Leu Gin 

100 
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<210> 327 
<211> 40 
<212> PRT 

<213> Homo sapiens 
<400> 327 

Met lie lie Leu His lie Val Val Cys Leu Phe Thr lie Ser lie lie 
15 10 15 

Glu Glu Gin Lys Glu Glu He Leu Cys Ser Thr Lys Ser Gin Ala Glu 

20 25 30 

Lys Thr Val Thr His He Glu Gin 
35 40 



<210> 328 
<211> 108 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (62) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (63) 

<223> Xaa equals any amino acid 
<400> 328 

Met Gly Ala Ala Lys Val Trp Gly Glu Val Gly Arg Trp Leu Val He 
15 10 15 

Ala Leu He Gin Leu Ala Lys Ala Val Leu Arg Met Leu Leu Leu Leu 

20 25 30 

Trp Phe Lys Ala Gly Leu Gin Thr Ser Pro Pro He Val Pro Leu Asp 
35 40 45 

Arg Glu Thr Arg His Ser Pro Arg Met Val Thr Thr Ala Xaa Xaa Thr 
50 55 60 

Met Ser Ser Pro Thr Trp Gly Ser Gly Gin Thr Gly Trp Cys Glu Pro 
65 70 75 80 

Ser Arg Thr Arg Arg Pro Cys Thr Pro Gly Thr Gly Glu Leu Pro Ser 

85 90 95 

Ser Gly Arg Asp Gly Ser Ser Ser He Thr Arg Ser 

100 105 



<210> 329 
<211> 941 
<212> PRT 

<213> Homo sapiens 
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<400> 329 

Met Val Phe Leu Pro Leu Lys Trp Ser Leu Ala Thr Met Ser Phe Leu 
15 10 15 



Leu Ser Ser Leu Leu Ala Leu Leu Thr Val Ser Thr Pro Ser Trp Cys 

20 25 30 

Gin Ser Thr Glu Ala Ser Pro Lys Arg Ser Asp Gly Thr Pro Phe Pro 
35 40 45 

Trp Asn Lys lie Arg Leu Pro Glu Tyr Val lie Pro Val His Tyr Asp 
50 55 60 

Leu Leu lie His Ala Asn Leu Thr Thr Leu Thr Phe Trp Gly Thr Thr 
65 70 75 80 

Lys Val Glu lie Thr Ala Ser Gin Pro Thr Ser Thr lie He Leu His 

85 90 95 

Ser His His Leu Gin He Ser Arg Ala Thr Leu Arg Lys Gly Ala Gly 

100 105 110 

Glu Arg Leu Ser Glu Glu Pro Leu Gin Val Leu Glu His Pro Pro Gin 
115 120 125 

Glu Gin He Ala Leu Leu Ala Pro Glu Pro Leu Leu Val Gly Leu Pro 
130 135 140 

Tyr Thr Val Val He His Tyr Ala Gly Asn Leu Ser Glu Thr Phe His 
145 150 155 160 

Gly Phe Tyr Lys Ser Thr Tyr Arg Thr Lys Glu Gly Glu Leu Arg He 

165 170 175- 

Leu Ala Ser Thr Gin Phe Glu Pro Thr Ala Ala Arg Met Ala Phe Pro 

180 185 190 

Cys Phe Asp Glu Pro Ala Phe Lys Ala Ser Phe Ser He Lys He Arg 
195 200 205 

Arg Glu Pro Arg His Leu Ala He Ser Asn Met Pro Leu Val Lys Ser 
210 215 220 

Val Thr Val Ala Glu Gly Leu He Glu Asp His Phe Asp Val Thr Val 
225 230 235 240 

Lys Met Ser Thr Tyr Leu Val Ala Phe He He Ser Asp Phe Glu Ser 

245 250 255 

Val Ser Lys He Thr Lys Ser Gly Val Lys Val Ser Val Tyr Ala Val 

260 265 270 

Pro Asp Lys Met Asn Gin Ala Asp Tyr Ala Leu Asp Ala Ala Val Thr 
275 280 285 

Leu Leu Glu Phe Tyr Glu Asp Tyr Phe Ser He Pro Tyr Pro Leu Pro 
290 295 300 

Lys Gin Asp Leu Ala Ala He Pro Asp Phe Gin Ser Gly Ala Met Glu 
305 310 315 320 
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Asn Trp Gly Leu Thr Thr Tyr Arg Glu Ser Ala Leu Leu Phe Asp Ala 

325 330 335 

Glu Lys Ser Ser Ala Ser Ser Lys Leu Gly He Thr Met Thr Val Ala 

340 345 350 

His Glu Leu Ala His Gin Trp Phe Gly Asn Leu Val Thr Met Glu Trp 
355 360 365 

Trp Asn Asp Leu Trp Leu Asn Glu Gly Phe Ala Lys Phe Met Glu Phe 
370 375 380 

Val Ser Val Ser Val Thr His Pro Glu Leu Lys Val Gly Asp Tyr Phe 
385 390 395 400 

Phe Gly Lys Cys Phe Asp Ala Met Glu Val Asp Ala Leu Asn Ser Ser 

405 410 415 

His Pro Val Ser Thr Pro Val Glu Asn Pro Ala Gin He Arg Glu Met 

420 425 430 

Phe Asp Asp Val Ser Tyr Asp Lys Gly Ala Cys He Leu Asn Met Leu 
435 440 445 

Arg Glu Tyr Leu Ser Ala Asp Ala Phe Lys Ser Gly He Val Gin Tyr 
450 455 460 

Leu Gin Lys His Ser Tyr Lys Asn Thr Lys Asn Glu Asp Leu Trp Asp 
465 470 475 480 

Ser Met Ala Ser He Cys Pro Thr Asp Gly Val Lys Gly Met Asp Gly 

485 490 495 

Phe Cys Ser Arg Ser Gin His Ser Ser Ser- Ser Ser His Trp His Gin 

500 505 510 

Glu Gly Val Asp Val Lys Thr Met Met Asn Thr Trp Thr Leu Gin Arg 
515 520 525 

Gly Phe Pro Leu He Thr He Thr Val Arg Gly Arg Asn Val His Met 
530 535 540 

Lys Gin Glu His Tyr Met Lys Gly Ser Asp Gly Ala Pro Asp Thr Gly 
545 550 555 560 

Tyr Leu Trp His Val Pro Leu Thr Phe He Thr Ser Lys Ser Asp Met 

565 570 575 

Val His Arg Phe Leu Leu Lys Thr Lys Thr Asp Val Leu lie Leu Pro 

580 585 590 

Glu Glu Val Glu Trp He Lys Phe Asn Val Gly Met Asn Gly Tyr Tyr 
595 600 605 

He Val His Tyr Glu Asp Asp Gly Trp Asp Ser Leu Thr Gly Leu Leu 
610 615 620 

Lys Gly Thr His Thr Ala Val Ser Ser Asn Asp Arg Ala Ser Leu He 
625 630 635 640 
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Asn Asn Ala Phe Gin Leu Val Ser lie Gly Lys Leu Ser Xle Glu Lys 

645 650 655 

Ala Leu Asp Leu Ser Leu Tyr Leu Lys His Glu Thr Glu lie Met Pro 

660 665 670 

Val Phe Gin Gly Leu Asn Glu Leu He Pro Met Tyr Lys Leu Met Glu 
675 680 685 

Lys Arg Asp Met Asn Glu Val Glu Thr Gin Phe Lys Ala Phe Leu He 
690 695 700 

Arg Leu Leu Arg Asp Leu He Asp Lys Gin Thr Trp Thr Asp Glu Gly 
705 710 715 720 

Ser Val Ser Glu Arg Met Leu Arg Ser Glu Leu Leu Leu Leu Ala Cys 

725 730 735 

Val His Asn Tyr Gin Pro Cys Val Gin Arg Ala Glu Gly Tyr Phe Arg 

740 745 750 

Lys Trp Lys Glu Ser Asn Gly Asn Leu Ser Leu Pro Val Asp Val Thr 
755 760 765 

Leu Ala Val Phe Ala Val Gly Ala Gin Ser Thr Glu Gly Trp Asp Phe 
770 775 780 

Leu Tyr Ser Lys Tyr Gin Phe Ser Leu Ser Ser Thr Glu Lys Ser Gin 
785 790 795 800 

He Glu Phe Ala Leu Cys Arg Thr Gin Asn Lys Glu Lys Leu Gin Trp 

805 810 815 

Leu Leu Asp Glu Ser Phe Lys Gly Asp Lys He Lys Thr Gin Glu Phe 

820 825 830 

Pro Gin He Leu Thr Leu He Gly Arg Asn Pro Val Gly Tyr Pro Leu 
835 840 845 

Ala Trp Gin Phe Leu Arg Lys Asn Trp Asn Lys Leu Val Gin Lys Phe 
850 855 860 

Glu Leu Gly Ser Ser Ser He Ala His Met Val Met Gly Thr Thr Asn 
865 870 875 880 

Gin Phe Ser Thr Arg Thr Arg Leu Glu Glu Val Lys Gly Phe Phe Ser 

885 890 895 

Ser Leu Lys Glu Asn Gly Ser Gin Leu Arg Cys Val Gin Gin Thr He 

900 905 910 

Glu Thr He Glu Glu Asn He Gly Trp Met Asp Lys Asn Phe Asp Lys 
915 920 925 

He Arg Val Trp Leu Gin Ser Glu Lys Leu Glu Arg Met 
930 935 940 



<210> 330 
<211> 267 
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<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (172) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (175) 

<223> Xaa equals any amino acid 
<400> 330 

Met Ser Glu He Arg Gly Lys Pro He Glu Ser Ser Cys Met Tyr Gly 
15 10 15 

Thr Cys Cys Leu Trp Gly Lys Thr Tyr Ser He Gly Phe Leu Arg Phe 

20 25 30 

Cys Lys Gin Ala Thr Leu Gin Phe Cys Val Val Lys Pro Leu Met Ala 
35 40 45 

Val Ser Thr Val Val Leu Gin Ala Phe Gly Lys Tyr Arg Asp Gly Asp 
50 55 60 

Phe Asp Val Thr Ser Gly Tyr Leu Tyr Val Thr He He Tyr Asn He 
65 70 75 80 

4 

Ser Val Ser Leu Ala Leu Tyr Ala Leu Phe Leu Phe Tyr Phe Ala Thr 

85 90 95 

Arg Glu Leu Leu Ser Pro Tyr Ser Pro Val Leu Lys Phe Phe Met Val 

100 105 110 

Lys Ser Val He Phe Leu Ser Phe Trp Gin Gly Met Leu Leu Ala He 
115 120 125 

Leu Glu Lys Cys Gly Ala He Pro Lys He His Ser Ala Arg Val Ser 
130 135 140 

Val Gly Glu Gly Thr Val Ala Ala Gly Tyr Gin Asp Phe He He Cys 
145 150 155 160 

Val Glu Met Phe Phe Ala Ala Leu Ala Leu Arg Xaa Ala Phe Xaa Tyr 

165 170 175 

Lys Val Tyr Ala Asp Lys Arg Leu Asp Ala Gin Gly Arg Cys Ala Pro 

180 185 190 

Met Lys Ser He Ser Ser Ser Leu Lys Glu Thr Met Asn Pro His Asp 
195 200 205 

He Val Gin Asp Ala He His Asn Phe Ser Pro Ala Tyr Gin Gin Tyr 
210 215 220 

Thr Gin Gin Ser Thr Leu Glu Pro Gly Pro Thr Trp Arg Gly Gly Ala 
225 230 235 240 

His Gly Leu Ser Arg Ser His Ser Leu Ser Gly Ala Arg Asp Asn Glu 

245 250 255 
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Lys Thr Leu Leu Leu Ser Ser Asp Asp Glu Phe 

260 265 



<210> 331 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 331 

Met Leu Val Leu Met Thr Thr Cys lie Leu Ala Ala Val Cys Val His 
15 10 15 

Thr Ala Gin Cys Ala Pro Asp Ser Arg Met Asp Asn Asp Cys Pro Ser 

20 25 30 

His Gin Ala Gin He His Phe Arg Ala Ser Glu Val Arg Arg Gly Trp 
35 40 45 

Thr Phe Asn His Asp 
50 



<210> 332 
<211> 52 
<212> PRT 

<213> Homo sapiens 
<400> 332 

Met His Cys His Ser Ala Leu Gly Pro Met Ser Thr Pro Val Leu Pro 
15 10 15 

Phe Ser Gly He Gly Leu Ala Phe Leu Cys Leu Cys Leu Ala Ala Ser 

20 25 30 

Met Val Asp Leu Lys Cys Leu Gly Met Asn Ser Thr Leu Leu Gin Pro 
35 40 45 

Ser He Lys Glu 
50 



<210> 333 
<211> 87 
<212> PRT 

<213> Homo sapiens 
<400> 333 

Met Gly Leu His Leu Arg Pro Tyr Arg Val Gly Leu Leu Pro Asp Gly 
1.5 10 15 

Leu Leu Phe Leu Leu Leu Leu Leu Met Leu Leu Ala Asp Pro Ala Leu 

20 25 30 

Pro Ala Gly Arg His Pro Pro Val Val Leu Val Pro Gly Asp Leu Gly 
35 40 45 
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Asn Gin Leu Glu Ala Lys Leu Asp Lys Pro Thr Val Val His Tyr Leu 
50 55 60 

Cys Ser Lys Lys Thr Glu Ser Tyr Phe Thr lie Trp Leu Asn Leu Glu 
65 70 75 80 

Leu Leu Leu Pro Val His His 

85 



<210> 334 

<211> 40 

<212> PRT 

<213> Homo sapiens 

<400> 334 

Met Gly Pro Ser Gin Arg Glu Val 
1 5 

Phe Leu Leu Pro Leu Leu Leu Leu 

20 

Phe Gly Phe Lys Trp Leu Lys Asp 
35 40 



Thr Val Gin Trp His Arg Ala Leu 
10 15 

Ser Thr Arg Thr Glu Thr Lys Asn 
25 30 



<210> 335 
<211> 525 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (210) 

<223> Xaa equals any amino acid 
<400> 335 

Met Leu Ala Phe Pro Leu Leu Leu Thr Gly Leu lie Ser Phe Arg Glu 
15 10 15 

Lys Arg Leu Gin Asp Val Gly Thr Pro Ala Ala Arg Ala Arg Ala Phe 

20 25 30 

Phe Thr Ala Pro Val Val Val Phe His Leu Asn He Leu Ser Tyr Phe 
35 40 45 

Ala Phe Leu Cys Leu Phe Ala Tyr Val Leu Met Val Asp Phe Gin Pro 
50 55 60 

Val Pro Ser Trp Cys Glu Cys Ala He Tyr Leu Trp Leu Phe Ser Leu 
65 70 75 80 

Val Cys Glu Glu Met Arg Gin Leu Phe Tyr Asp Pro Asp Glu Cys Gly 

85 90 95 

Leu Met Lys Lys Ala Ala Leu Tyr Phe Ser Asp Phe Trp Asn Lys Leu 

100 105 110 

Asp Val Gly Ala He Leu Leu Phe Val Ala Gly Leu Thr Cys Arg Leu 
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115 120 125 

lie Pro Ala Thr Leu Tyr Pro Gly Arg Val He Leu Ser Leu Asp Phe 
130 135 140 

He Leu Phe Cys Leu Arg Leu Met His He Phe Thr He Ser Lys Thr 
145 150 155 160 

Leu Gly Pro Lys He He He Val Lys Arg Met Met Lys Asp Val Phe 

165 170 175 

Phe Phe Leu Phe Leu Leu Ala Val Trp Val Val Ser Phe Gly Val Ala 

180 185 190 

Lys Gin Ala He Leu He His Asn Glu Arg Arg Val Asp Trp Leu Phe 
195 200 205 

Arg Xaa Ala Val Tyr His Ser Tyr Leu Thr He Phe Gly Gin He Pro 
210 215 220 

Gly Tyr He Asp Gly Val Asn Phe Asn Pro Glu His Cys Ser Pro Asn 
225 230 235 240 

Gly Thr Asp Pro Tyr Lys Pro Lys Cys Pro Glu Ser Asp Ala Thr Gin 

245 250 255 

Gin Arg Pro Ala Phe Pro Glu Trp Leu Thr Val Leu Leu Leu Cys Leu 

260 265 270 

Tyr Leu Leu Phe Thr Asn lie Leu Leu Leu Asn Leu Leu He Ala Met 
275 280 285 

Phe Asn Tyr Thr Phe Gin Gin Val Gin Glu His Thr Asp Gin He Trp 
290 295 300 

Lys Phe Gin Arg His Asp Leu He Glu Glu Tyr His Gly Arg Pro Ala 
305 310 315 320 

Ala Pro Pro Pro Phe He Leu Leu Ser His Leu Gin Leu Phe He Lys 

325 330 335 

Arg Val Val Leu Lys Thr Pro Ala Lys Arg His Lys Gin Leu Lys Asn 

340 345 350 

Lys Leu Glu Lys Asn Glu Glu Ala Ala Leu Leu Ser Trp Glu He Tyr 
355 360 365 

Leu Lys Glu Asn Tyr Leu Gin Asn Arg Gin Phe Gin Gin Lys Gin Arg 
370 375 380 

Pro Glu Gin Lys He Glu Asp He Ser Asn Lys Val Asp Ala Met Val 
385 390 395 400 

Asp Leu Leu Asp Leu Asp Pro Leu Lys Arg Ser Gly Ser Met Glu Gin 

405 410 415 

Arg Leu Ala Ser Leu Glu Glu Gin Val Ala Gin Thr Ala Arg Ala Leu 

420 425 430 

His Trp He Val Arg Thr Leu Arg Ala Ser Gly Phe Ser Ser Glu Ala 
435 440 445 
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Asp Val Pro Thr Leu Ala Ser Gin 
450 455 

Glu Pro Gly Gly Arg Lys Lys Thr 
465 470 

Val Asn Ala Arg His Leu Leu Tyr 

485 

Pro Val Pro Asn Glu Lys Val Pro 

500 

Asp Pro Pro Phe Tyr Thr Ala Glu 
515 520 



Lys Ala Ala Glu Glu Pro Asp Ala 

460 

Glu Glu Pro .Gly Asp Ser Tyr His 

475 480 

Pro Asn Cys Pro Val Thr Arg Phe 
490 495 

Trp Glu Thr Glu Phe Leu lie Tyr 
505 510 

Arg Lys Asp Ala Ala 

525 



<210> 336 
<211> 937 
<212> PRT 

<213> Homo sapiens 
<400> 336 

Met Gin Asn Ser Gly Lys Thr Lys Phe Lys Arg Thr Ser lie Asp Arg 
15 10 15 

Leu Met Asn Thr Leu Val Leu Trp lie Phe Gly Phe Leu lie Cys Leu 

20 25 30 

Gly lie lie Leu Ala lie Gly Asn Ser lie Trp Glu Ser Gin Thr Gly 
35 40 45 

Asp Gin Phe Arg Thr Phe Leu Phe Trp Asn Glu Gly Glu Lys Ser Ser 
50 55 60 

Val Phe Ser Gly Phe Leu Thr Phe Trp Ser Tyr lie lie lie Leu Asn 
65 70 75 80 

Thr Val Val Pro lie Ser Leu Tyr Val Ser Val Glu Val lie Arg Leu 

85 90 95 

Gly His Ser Tyr Phe lie Asn Trp Asp Arg Lys Met Tyr Tyr Ser Arg 

100 105 110 

Lys Ala lie Pro Ala Val Ala Arg Thr Thr Thr Leu Asn Glu Glu Leu 
115 120 125 

Gly Gin lie Glu Tyr lie Phe Ser Asp Lys Thr Gly Thr Leu Thr Gin 
130 135 140 

Asn lie Met Thr Phe Lys Arg Cys Ser lie Asn Gly Arg lie Tyr Gly 
145 150 155 160 

Glu Val His Asp Asp Leu Asp Gin Lys Thr Glu lie Thr Gin Glu Lys 

165 170 175 

Glu Pro Val Asp Phe Ser Val Lys Ser Gin Ala Asp Arg Glu Phe Gin 

180 185 190 

Phe Phe Asp His Asn Leu Met Glu Ser lie Lys Met Gly Asp Pro Lys 



208 



WO 02/076488 PCTAJS02/08276 



195 200 205 

Val His Glu Phe Leu Arg Leu Leu Ala Leu Cys His Thr Val Met Ser 
210 215 220 

Glu Glu Asn Ser Ala Gly Glu Leu lie Tyr Gin Val Gin Ser Pro Asp 
225 230 235 240 

Glu Gly Ala Leu Val Thr Ala Ala Arg Asn Phe Gly Phe He Phe Lys 

245 250 255 

Ser Arg Thr Pro Glu Thr He Thr He Glu Glu Leu Gly Thr Leu Val 

260 265 270 

Thr Tyr Gin Leu Leu Ala Phe Leu Asp Phe Asn Asn Thr Arg Lys Arg 
275 280 285 

Met Ser Val He Val Arg Asn Pro Glu Gly Gin He Lys Leu Tyr Ser 
290 295 300 

Lys Gly Ala Asp Thr He Leu Phe Glu Lys Leu His Pro Ser Asn Glu 
305 310 315 320 

Val Leu Leu Ser Leu Thr Ser Asp His Leu Ser Glu Phe Ala Gly Glu 

325 330 335 

Gly Leu Arg Thr Leu Ala He Ala Tyr Arg Asp Leu Asp Asp Lys Tyr 

340 345 350 

Phe Lys Glu Trp His Lys Met Leu Glu Asp Ala Asn Val Ala Thr Glu 
355 360 365 

Glu Arg Asp Glu Arg He Ala Gly Leu Tyr Glu Glu He Glu Arg Asp 
370 375 380 

Leu Met Leu Leu Gly Ala Thr Ala Val Glu Asp Lys Leu Gin Glu Gly 
385 390 395 400 

Val He Glu Thr Val Thr Ser Leu Ser Leu Ala Asn He Lys He Trp 

405 410 415 

Val Leu Thr Gly Asp Lys Gin Glu Thr Ala He Asn He Gly Tyr Ala 

420 425 430 

Cys Asn Met Leu Thr Asp Asp Met Asn Asp Val Phe Val He Ala Gly 
435 " 440 445 

Asn Asn Ala Val Glu Val Arg Glu Glu Leu Arg Lys Ala Lys Gin Asn 
450 455 460 

Leu Phe Gly Gin Asn Arg Asn Phe Ser Asn Gly His Val Val Cys Glu 
465 470 475 480 

Lys Lys Gin Gin Leu Glu Leu Asp Ser He Val Glu Glu Thr He Thr 

485 490 495 

Gly Asp Tyr Ala Leu He He Asn Gly His Ser Leu Ala His Ala Leu 

500 505 510 

Glu Ser Asp Val Lys Asn Asp Leu Leu Glu Leu Ala Cys Ket Cys Lys 
515 520 525 
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Thr Val lie Cys Cys Arg Val Thr Pro Leu Gin Lys Ala Gin Val Val 
530 535 540 

Glu Leu Val Lys Lys Tyr Arg Asn Ala Val Thr Leu Ala lie Gly Asp 
545 550 555 560 

Gly Ala Asn Asp Val Ser Met lie Lys Ser Ala His lie Gly Val Gly 

565 570 575 

lie Ser Gly Gin Glu Gly Leu Gin Ala Val Leu Ala Ser Asp Tyr Ser 

580 585 590 

Phe Ala Gin Phe Arg Tyr Leu Gin Arg Leu Leu Leu Val His Gly Arg 
595 600 605 

Trp Ser Tyr Phe Arg Met Cys Lys Phe Leu Cys Tyr Phe Phe Tyr Lys 
610 615 620 

Asn Phe Ala Phe Thr Leu Val His Phe Trp Phe Gly Phe Phe Cys Gly 
625 630 635 640 

Phe Ser Ala Gin Thr Val Tyr Asp Gin Trp Phe lie Thr Leu Phe Asn 

645 650 655 

lie Val Tyr Thr Ser Leu Pro Val Leu Ala Met Gly lie Phe Asp Gin 

660 665 670 

Asp Val Ser Asp Gin Asn Ser Val Asp Cys Pro Gin Leu Tyr Lys Pro 
675 680 685 

Gly Gin Leu Asn Leu Leu Phe Asn Lys Arg Lys Phe Phe He Cys Val 
690 695 700 

Met His Gly He Tyr Thr Ser Leu Val Leu Phe Phe He Pro Tyr Gly 
705 710 715 720 

Ala Phe Tyr Asn Val Ala Gly Glu Asp Gly Gin His He Ala Asp Tyr 

725 730 735 

Gin Ser Phe Ala Val Thr Met Ala Thr Ser Leu Val He Val Val Ser 

740 745 750 

Val Gin He Ala Leu Asp Thr Ser Tyr Trp Thr Phe He Asn His Val 
755 760 765 

Phe He Trp Gly Ser He Ala He Tyr Phe Ser He Leu Phe Thr Met 
770 775 780 

His Ser Asn Gly He Phe Gly He Phe Pro Asn Gin Phe Pro Phe Val 
785 790 795 800 

Gly Asn Ala Arg His Ser Leu Thr Gin Lys Cys He Trp Leu Val He 

805 810 815 

Leu Leu Thr Thr Val Ala Ser Val Met Pro Val Val Ala Phe Arg Phe 

820 825 830 

Leu Lys Val Asp Leu Tyr Pro Thr Leu Ser Asp Gin He Arg Arg Trp 
835 840 845 
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Gin Lys Ala Gin Lys Lys Ala Arg Pro Pro Ser Ser Arg Arg Pro Arg 
850 855 860 

Thr Arg Arg Ser Ser Ser Arg Arg Ser Gly Tyr Ala Phe Ala His Gin 
865 870 875 880 

Glu Gly Tyr Gly Glu Leu lie Thr Ser Gly Lys Asn Met Arg Ala Lys 

885 890 895 

Asn Pro Pro Pro Thr Ser Gly Leu Glu Lys Thr His Tyr Asn Ser Thr 

900 905 910 

Ser Trp lie Glu Asn Leu Cys Lys Lys Thr Thr Asp Thr Val Ser Ser 
915 920 925 

Phe Ser Gin Asp Lys Thr Val Lys Leu 
930 935 



<210> 337 
<211> 122 
<212> PRT 

<213> Homo sapiens 
<400> 337 

Met He Gly Gly He Thr Cys He Leu Ser Leu He Cys Ala Leu Ala 
15 10 15 

Leu Ala Tyr Leu Asp Gin Arg Ala Glu Arg He Leu His Lys Glu Gin 

20 25 30 

Gly Lys Thr Gly Glu Val He Lys Leu Thr Asp Val Lys Asp Phe Ser 
35 40 45 

Leu Pro Leu Trp Leu He Phe He He Cys Val Cys Tyr Tyr Val Ala 
50 55 60 

Val Phe Pro Phe He Gly Leu Gly Lys Val Phe Phe Thr Glu Lys Phe 
65 70 75 80 

Gly Phe Ser Ser Gin Ala Ala Ser Ala He Asn Ser Val Val Tyr Val 

85 90 95 

He Ser Ala Pro Met Ser Pro Val Phe Gly Leu Leu Val Asp Lys Thr 

100 105 110 

Gly Lys Asn He He Trp Val Leu Cys Ala 
115 120 



<210> 338 

<211> 46 

<212> PRT 

<213> Homo sapiens 

<400> 338 

Met Pro Trp Leu Lys Ser Leu Leu His Phe Ser Leu Phe Leu Val Val 
15 10 15 



211 



WO 02/076488 PCT/USO 2/082 76 



Phe Ser Thr Leu Ala Val Lys Ser Leu Gly Val Pro Val Ala Ala Gly 

20 25 30 

Ser Pro Phe Cys He Val Asp Val Leu His Phe He Leu Leu 
35 40 45 



<210> 339 
<211> 66 
<212> PRT 

<213> Homo sapiens 
<400> 339 

Met Ser Trp Val He Val Val He He Trp Gly Tyr Leu Leu Glu Gly 
15 10 15 

His Gly Val Pro Phe Cys Lys Ser Tyr Gly Pro Ser Pro Trp Lys Leu 

20 25 30 

His Thr His His Ala Ala Tyr Asn Ser Gly Ser Ser Gin Val Tyr Arg 
35 40 45 

He Leu Glu Thr Leu Met Ser Gly Ser Thr His Cys Ser Phe Ser Gly 
50 55 60 

Thr Phe 
65 



<210> 340 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<400> 340 

Met Pro Arg Ala Pro Trp Arg He Pro Leu Cys Ala Leu Pro Thr Leu 
15 10 15 

Cys Leu Gly Ser Pro Leu Pro Ser Gin Pro Thr His Pro He Phe Tyr 

20 25 30 

Asp His Arg Ala Pro Thr- Trp Lys Met Ala His Pro Gly Gly Pro Arg 
35 40 45 

Ser Ser His Ser Pro Arg Thr Trp Arg Thr Pro Ser Ser Gin Thr Lys 
50 55 60 

Ala Ala Leu Pro Ala Gly Gly Ala Arg Asn Ser Pro Leu Gin Leu Cys 
65 70 75 80 

Thr Arg Ser Arg Phe Cys Gly Thr Pro Met 

85 90 



<210> 341 
<211> 710 
<212> PRT 

<213> Homo sapiens 
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<400> 341 

Met Pro Val Pro Trp Phe Leu Leu Ser Leu Ala Leu Gly Arg Ser Pro 
15 10 15 

Val Val Leu Ser Leu Glu Arg Leu Val Gly Pro Gin Asp Ala Thr His 

20 25 30 

Cys Ser Pro Gly Leu Ser Cys Arg Leu Trp Asp Ser Asp lie Leu Cys 
35 40 45 

Leu Pro Gly Asp lie Val Pro Ala Pro Gly Pro Val Leu Ala Pro Thr 
50 55 60 

His Leu Gin Thr Glu Leu Val Leu Arg Cys Gin Lys Glu Thr Asp Cys 
65 70 75 80 

Asp Leu Cys Leu Arg Val Ala Val His Leu Ala Val His Gly His Trp 

85 90 95 

Glu Glu Pro Glu Asp Glu Glu Lys Phe Gly Gly Ala Ala Asp Leu Gly 

100 105 110 

Val Glu Glu Pro Arg Asn Ala Ser Leu Gin Ala Gin Val Val Leu Ser 
115 120 125 

Phe Gin Ala Tyr Pro Thr Ala Arg Cys Val Leu Leu Glu Val Gin Val 
130 135 140 

Pro Ala Ala Leu Val Gin Phe Gly Gin Ser Val Gly Ser Val Val Tyr 
145 150 155 160 

Asp Cys Phe Glu Ala Ala Leu Gly Ser Glu Val Arg lie Trp Ser Tyr 

165 170 175 

Thr Gin Pro Arg Tyr Glu Lys Glu Leu Asn His Thr Gin Gin Leu Pro 

180 185 190 

Asp Cys Arg Gly Leu Glu Val Trp Asn Ser lie Pro Ser Cys Trp Ala 
195 200 205 

Leu Pro Trp Leu Asn Val Ser Ala Asp Gly Asp Asn Val His Phe Gly 
210 215 220 

Leu Ser Leu Tyr Trp Asn Gin Val Gin Gly Pro Pro Lys Pro Arg Trp 
225 230 235 240 

His Lys Asn Leu Thr Gly Pro Gin lie He Thr Leu Asn His Thr Asp 

245 250 255 

Leu Val Pro Cys Leu Cys He Gin Val Trp Pro Leu Glu Pro Asp Ser 

260 265 270 

Val Arg Thr Asn He Cys Pro Phe Arg Glu Asp Pro Arg Ala His Gin 
275 280 285 

Asn Leu Trp Gin Ala Ala Arg Leu Arg Leu Leu Thr Leu Gin Ser Trp 
290 295 300 

Leu Leu Asp Ala Pro Cys Ser Leu Pro Ala Glu Ala Ala Leu Cys Trp 
305 310 315 320 
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Arg Ala Pro Gly Gly Asp Pro Cys Gin Pro Leu Val Pro Pro Leu Ser 

325 330 335 

Trp Glu Asn Val Thr Val Asp Lys Val Leu Glu Phe Pro Leu Leu Lys 

340 345 350 

Gly His Pro Asn Leu Cys Val Gin Val Asn Ser Ser Glu Lys Leu Gin 
355 360 365 

Leu Gin Glu Cys Leu Trp Ala Asp Ser Leu Gly Pro Leu Lys Asp Asp 
370 375 380 

Val Leu Leu Leu Glu Thr Arg Gly Pro Gin Asp Asn Arg Ser Leu Cys 
385 390 395 400 

Ala Leu Glu Pro Ser Gly Cys Thr Ser Leu Pro Ser Lys Ala Ser Thr 

405 410 415 

Arg Ala Ala Arg Leu Gly Glu Tyr Leu Leu Gin Asp Leu Gin Ser Gly 

420 425 430 

Gin Cys Leu Gin Leu Trp Asp Asp Asp Leu Gly Ala Leu Trp Ala Cys 
435 440 445 

Pro Met Asp Lys Tyr lie His Lys Arg Trp Ala Leu Val Trp Leu Ala 
450 455 460 

Cys Leu Leu Phe Ala Ala Ala Leu Ser Leu lie Leu Leu Leu Lys Lys 
465 470 475 480 

Asp His Ala Lys Gly Trp Leu Arg Leu Leu Lys Gin Asp Val Arg Ser 

485 490 495 

* 

Gly Ala Ala Ala Arg Gly Arg Ala Ala Leu Leu Leu Tyr Ser Ala Asp 

500 505 510 

Asp Ser Gly Phe Glu Arg Leu Val Gly Ala Leu Ala Ser Ala Leu Cys 
515 520 525- 

Gin Leu Pro Leu Arg Val Ala Val Asp Leu Trp Ser Arg Arg Glu Leu 
530 535 540 

Ser Ala Gin Gly Pro Val Ala Trp Phe His Ala Gin Arg Arg Gin Thr 
545 550 555 560 

Leu Gin Glu Gly Gly Val Val Val Leu Leu Phe Ser Pro Gly Ala Val 

565 570 575 

Ala Leu Cys Ser Glu Trp Leu Gin Asp Gly Val Ser Gly Pro Gly Ala 

580 585 590 

His Gly Pro His Asp Ala Phe Arg Ala Ser Leu Ser Cys Val Leu Pro 
595 600 605 

Asp Phe Leu Gin Gly Arg Ala Pro Gly Ser Tyr Val Gly Ala Cys Phe 
610 615 620 

Asp Arg Leu Leu His Pro Asp Ala Val Pro Ala Leu Phe Arg Thr Val 
625 630 635 640 
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Pro Val Phe Thr Leu Pro Ser Gin Leu Pro Asp Phe Leu Gly Ala Leu 

645 650 655 

Gin Gin Pro Arg Ala Pro Arg Ser Gly Arg Leu Gin Glu Arg Ala Glu 

660 665 670 

Gin Val Ser Arg Ala Leu Gin Pro Ala Leu Asp Ser Tyr Phe His Pro 
675 680 685 

Pro Gly Thr Pro Ala Pro Gly Arg Gly Val Gly Pro Gly Ala Gly Pro 
690 695 700 

Gly Ala Gly Asp Gly Thr 
705 710 



<210> 342 
<211> 48 
<212> PRT 

<213> Homo sapiens 
<400> 342 

Met Phe Ala Pro Cys Phe Val Asn Leu Ala Leu Phe Tyr Leu Tyr lie 
15 10 15 

Asn Ser Cys Asn Leu Leu Asn Leu Thr Ser lie Asp Pro Phe Glri Gin 

20 25 30 

Lys Gly Lys Phe Lys Met Gin Thr Leu Leu Phe Ala Lys Glu Asp Ser 
35 40 45 



<210> 343 
<211> 467 
<212> PRT 

<213> Homo sapiens 
<400> 343 

Met Leu Leu Leu Leu Leu Leu Pro Leu Leu Trp Gly Arg Glu Arg Val 
15 10 15 

Glu Gly Gin Lys Ser Asn Arg Lys Asp Tyr Ser Leu Thr Met Gin Ser 

20 25 30 

Ser Val Thr Val Gin Glu Gly Met Cys Val His Val Arg Cys Ser Phe 
35 40 45 

Ser Tyr Pro Val Asp Ser Gin Thr Asp Ser Asp Pro Val His Gly Tyr 
50 55 60 

Trp Phe Arg Ala Gly Asn Asp lie Ser Trp Lys Ala Pro Val Ala Thr 
65 70 75 80 

Asn Asn Pro Ala Trp Ala Val Gin Glu Glu Thr Arg Asp Arg Phe His 

85 90 95 
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Leu Leu Gly Asp Pro Gin Thr Lys Asn Cys Thr Leu Ser lie Arg Asp 

100 105 110 

Ala Arg Met Ser Asp Ala Gly Arg Tyr Phe Phe Arg Met Glu Lys Gly 
115 120 125 

Asn lie Lys Trp Asn Tyr Lys Tyr Asp Gin Leu Ser Val Asn Val Thr 
130 135 140 

Ala Leu Thr His Arg Pro Asn lie Leu lie Pro Gly Thr Leu Glu Ser 
145 150 155 160 

Gly Cys Phe Gin Asn Leu Thr Cys Ser Val Pro Trp Ala Cys Glu Gin 

165 170 175 

Gly Thr Pro Pro Met lie Ser Trp Met Gly Thr Ser Val Ser Pro Leu 

180 185 190 

His Pro Ser Thr Thr Arg Ser Ser Val Leu Thr Leu lie Pro Gin Pro 
195 200 205 

Gin His His Gly Thr Ser Leu Thr Cys Gin Val Thr Leu Pro Gly Ala 
210 215 220 

Gly Val Thr Thr Asn Arg Thr lie Gin Leu Asn Val Ser Tyr Pro Pro 
225 230 235 240 

Gin Asn Leu Thr Val Thr Val Phe Gin Gly Glu Gly Thr Ala Ser Thr 

245 250 255 

Ala Leu Gly Asn Ser Ser Ser Leu Ser Val Leu Glu Gly Gin Ser Leu 

260 265 270 

Arg Leu Val Cys Ala Val Asp Ser Asn Pro Pro Ala Arg Leu Ser Trp 
275 280 285 

Thr Trp Arg Ser Leu Thr Leu Tyr Pro Ser Gin Pro Ser Asn Pro Leu 
290 295 300 

Val Leu Glu Leu Gin Val His Leu Gly Asp Glu Gly Glu Phe Thr Cys 
305 310 315 320 

Arg Ala Gin Asn Ser Leu Gly Ser Gin His Val Ser Leu Asn Leu Ser 

325 330 335 

Leu Gin Gin Glu Tyr Thr Gly Lys Met Arg Pro Val Ser Gly Val Leu 

340 345 350 

Leu Gly Ala Val Gly Gly Ala Gly Ala. Thr Ala Leu Val Phe Leu Ser 
355 360 365 

Phe Cys Val lie Phe lie Val Val Arg Ser Cys Arg Lys Lys Ser Ala 
370 375 380 

Arg Pro Ala Ala Asp Val Gly Asp lie Gly Met Lys Asp Ala Asn Thr 
385 390 395 400 

He Arg Gly Ser Ala Ser Gin Gly Asn Leu Thr Glu Ser Trp Ala Asp 

405 410 415 

Asp Asn Pro Arg His His Gly Leu Ala Ala His Ser Ser Gly Glu Glu 
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420 

Arg Glu lie Gin Tyr Ala 
435 

Asp Leu Ser Gly Gin Glu 
450 

lie Pro Lys 
465 



425 

Pro Leu Ser Phe His Lys 
440 

Ala Thr Asn Asn Glu Tyr 
455 460 



PCTAJS02/08276 

430 

Gly Glu Pro Gin 
445 

Ser Glu lie Lys 



<210> 344 
<211> 98 
<212> PRT 

<213> Homo sapiens 
<400> 344 

Met His Cys Cys Gin Leu Pro Trp Arg Cys Ala Gin Ala Pro Gin Glu 
15 10 15 

Ala Phe Leu Leu Cys Leu Leu Phe Leu lie Leu Val Leu Val Leu Leu 

20 25 30 

Gly Cys Ser Arg Gly Leu Pro Gly His Thr Pro Trp Arg Leu His Pro 
35 40 45 

Ala Ala Ala Ala Leu Leu Ala Pro Leu Leu His Asp Ala Leu Gly Ala 
50 55 60 

Cys Gly Phe Gin Gly Pro Glu Tyr Leu Leu Pro Cys Leu Leu Pro Leu 
65 70 75 80 

Pro Lys Pro Gly Gin Leu Gin Gly Pro Trp Gly Pro Leu Trp Ala Leu 

85 90 95 

Leu Pro 



<210> 345 
<211> 365 
<212> PRT 

<213> Homo sapiens 
<400> 345 

Met Phe Val Gly Leu Met Ala Phe Leu Leu Ser Phe Tyr Leu lie Phe 
1 5 10 15 

Thr Asn Glu Gly Arg Ala Leu Lys Thr Ala Thr Ser Leu Ala Glu Gly 

20 25 30 

Leu Ser Leu Val Val Ser Pro Asp Ser lie His Ser Val Ala Pro Glu 
35 40 45 

Asn Glu Gly Arg Leu Val His lie lie Gly Ala Leu Arg Thr Ser Lys 
50 55 60 

Leu Leu Ser Asp Pro Asn Tyr Gly Val His Leu Pro Ala Val Lys Leu 
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65 70 75 80 

Arg Arg His Val Glu Met Tyr Gin Trp Val Glu Thr Glu Glu Ser Arg 

85 90 95 

Glu Tyr Thr Glu Asp Gly Gin Val Lys Lys Glu Thr Arg Tyr Ser Tyr 

100 105 110 

Asn Thr Glu Trp Arg Ser Glu lie lie Asn Ser Lys Asn Phe Asp Arg 
115 120 125 

Glu lie Gly His Lys Asn Pro Ser Ala Met Ala Val Glu Ser Phe Met 
130 135 140 

Ala Thr Ala Pro Phe Val Gin lie Gly Arg Phe Phe Leu Ser Ser Gly 
145 150 155 160 

Leu lie Asp Lys Val Asp Asn Phe Lys Ser Leu Ser Leu Ser Lys Leu 

165 170 175 

Glu Asp Pro His Val Asp lie lie Arg Arg Gly Asp Phe Phe Tyr His 

180 185 190 

Ser Glu Asn Pro Lys Tyr Pro Glu Val Gly Asp Leu Arg Val Ser Phe 
195 200 205 

Ser Tyr Ala Gly Leu Ser Gly Asp Asp Pro Asp Leu Gly Pro Ala His 
210 215 220 

Val Val Thr Val lie Ala Arg Gin Arg Gly Asp Gin Leu Val Pro Phe 
225 230 235 240 

Ser Thr Lys Ser Gly Asp Thr Leu Leu Leu Leu His His Gly Asp Phe 

245 250 255 

Ser Ala Glu Glu Val Phe His Arg Glu Leu Arg Ser Asn Ser Met Lys 

260 265 270 

Thr Trp Gly Leu Arg Ala Ala Gly Trp Met Ala Met Phe Met Gly Leu 
275 280 285 

Asn Leu Met Thr Arg lie Leu Tyr Thr Leu Val Asp Trp Phe Pro Val 
290 295 300 

Phe Arg Asp Leu Val Asn lie Gly Leu Lys Ala Phe Ala Phe Cys Val 
305 310 315 320 

Ala Thr Ser Leu Thr Leu Leu Thr Val Ala Ala Gly Trp Leu Phe Tyr 

325 330 335 

Arg Pro Leu Trp Ala Leu Leu lie Ala Gly Leu Ala Leu Val Pro lie 

340 345 350 

Leu Val Ala Arg Thr Arg Val Pro Ala Lys Lys Leu Glu 
355 360 365 



<210> 346 
<211> 608 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> SITE 
<222> (265) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (597) 

<223> Xaa equals any amino acid 
<400> 346 

Met Val Gly Thr Lys Leu Arg Gin Thr Lys Asp Ala Leu Phe Thr lie 
15 10 15 

Leu His Asp Leu Arg Pro Gin Asp Arg Phe Ser lie He Gly Phe Ser 

20 25 30 

Asn Arg He Lys Val Trp Lys Asp His Leu He Ser Val Thr Pro Asp 
35 40 45 

Ser He Arg Asp Gly Lys Val Tyr He His His Met Ser Pro Thr Gly 
50 55 60 

Gly Thr Asp He Asn Gly Val Leu Gin Arg Ala He Arg Leu Leu Asn 
65 70 75 80 

Lys Tyr Val Ala His Ser Gly He Gly Asp Arg Ser Val Ser Leu He 

85 90 95 

Val Phe Leu Thr Asp Gly Lys Pro Thr Val Gly Glu Thr His Thr Leu 

100 105 110 

Lys He Leu Asn Asn Thr Arg Glu Ala Ala Arg Gly Gin Val Cys He 
115 120 125 

Phe Thr He Gly He Gly Asn Asp Val Asp Phe Arg Leu Leu Glu Lys 
130 135 140 

Leu Ser Leu Glu Asn Cys Gly Leu Thr Arg Arg Val His Glu Glu Glu 
145 150 155 160 

Asp Ala Gly Ser Gin Leu He Gly Phe Tyr Asp Glu He Arg Thr Pro 

165 170 175 

Leu Leu Ser Asp He Arg He Asp Tyr Pro Pro Ser Ser Val Val Gin 

180 185 190 

Ala Thr Lys Thr Leu Phe Pro Asn Tyr Phe Asn Gly Ser Glu He He 
195 200 205 

He Ala Gly Lys Leu Val Asp Arg Lys Leu Asp His Leu His Val Glu 
210 215 220 

Val Thr Ala Ser Asn Ser Lys Lys Phe He He Leu Lys Thr Asp Val 
225 230 235 240 

Pro Val Arg Pro Gin Lys Ala Gly Lys Asp Val Thr Gly Ser Pro Arg 

245 250 255 
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Pro Gly Gly Asp Gly Glu Gly Asp Xaa Asn His lie Glu Arg Leu Trp 

260 265 270 

Ser Tyr Leu Thr Thr Lys Glu Leu Leu Ser Ser Trp Leu Gin Ser Asp 
275 280 285 

Asp Glu Pro Glu Lys Glu Arg Leu Arg Gin Arg Ala Gin Ala Leu Ala 
290 295 300 

Val Ser Tyr Arg Phe Leu Thr Pro Phe Thr Ser Met Lys Leu Arg Gly 
305 310 315 320 

Pro Val Pro Arg Met Asp Gly Leu Glu Glu Ala His Gly Met Ser Ala 

325 330 335 

4 

Ala Met Gly Pro Glu Pro Val Val Gin Ser Val Arg Gly Ala Gly Thr 

•340 345 350 

Gin Pro Gly Pro Leu Leu Lys Lys Pro Tyr Gin Pro Arg lie Lys lie 
355 360 365 

Ser Lys Thr Ser Val Asp Gly Asp Pro His Phe Val Val Asp Phe Pro 
370 375 380 

Leu Ser Arg Leu Thr Val Cys Phe Asn lie Asp Gly Gin Pro Gly Asp 
385 390 395 400 

He Leu Arg Leu Val Ser Asp His Arg Asp Ser Gly Val Thr Val Asn 

405 410 415 

Gly Glu Leu He Gly Ala Pro Ala Pro Pro Asn Gly His Lys Lys Gin 

420 425 430 

Arg Thr Tyr Leu Arg Thr He Thr He Leu lie Asn Lys Pro Glu Arg 
435 440 445 

Ser Tyr Leu Glu He Thr Pro Ser Arg Val He Leu Asp Gly Gly Asp 
450 455 460 

Arg Leu Val Leu Pro Cys Asn Gin Ser Val Val Val Gly Ser Trp Gly 
465 470 475 480 

Leu Glu Val Ser Val Ser Ala Asn Ala Asn Val Thr Val Thr lie Gin 

485 490 495 

Gly Ser He Ala Phe Val He Leu He His Leu Tyr Lys Lys Pro Ala 

500 505 510 

Pro Phe Gin Arg His His Leu Gly Phe Tyr lie Ala Asn Ser Glu Gly 
515 520 . 525 

Leu Ser Ser Asn Cys His Gly Leu Leu Gly Gin Phe Leu Asn Gin Asp 
530 535 540 

Ala Arg Leu Thr Glu Asp Pro Ala Gly Pro Ser Gin Asn Leu Thr His 
545 550 555 560 

Pro Leu Leu Leu Gin Val Gly Glu Gly Pro Glu Ala Val Leu Thr Val 

565 570 575 

Lys Gly His Gin Val Pro Val Val Trp Lys Gin Arg Lys He Tyr Asn 
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580 585 590 

Gly Glu Glu Gin Xaa Asp Cys Trp Phe Ala Arg Asn Met Pro Pro Asn 
595 600 605 



<210> 347 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 347 

Met Phe Tyr Lys Leu Thr Leu lie Leu Cys Glu Leu Ser Val Ala Gly 
15 10 15 

Val Thr Gin Ala Ala Ser Gin Arg Pro Leu Gin Arg Leu Pro Arg His 

20 25 30 

lie Cys Ser Gin Arg Ser Ser Ser Trp Glu Met Pro Pro Gin Gly Pro 
35 40 45 

Ala Pro Asp His Val Gly Arg Ala 
50 55 



<210> 348 
<211> 540 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (137) 

<223> Xaa equals any amino acid 
<400> 348 

Met Val Arg Thr Asp Gly His Thr Leu Ser Glu Lys Arg Asn Tyr Gin 
15 10 15 

Val Thr Asn Ser Met Phe Gly Ala Ser Arg Lys Lys Phe Val Glu Gly 

20 25 30 

Val Asp Ser Asp Tyr His Asp Glu Asn Met Tyr Tyr Ser Gin Ser Ser 
35 40 45 

Met Phe Pro His Arg Ser Glu Lys Asp Met Leu Ala Ser Pro Ser Thr 
50 55 60 

Ser Gly Gin Leu Ser Gin Phe Gly Ala Ser Leu Tyr Gly Gin Gin Ser 
65 70 75 80 

Ala Leu Gly Leu Pro Met Arg Gly Met Ser Asn Asn Thr Pro Gin Leu 

85 90 95 

Asn Arg Ser Leu Ser Gin Gly Thr Gin Leu Pro Ser His Val Thr Pro 

100 105 110 
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Thr Thr Gly Val Pro Thr Met Ser Leu His Thr Pro Pro Ser Pro Ser 
115 120 125 

Arg Gly He Leu Pro Met Asn Pro Xaa Asn Met Met Asn His Ser Gin 
130 135 140 

Val Gly Gin Gly He Gly He Pro Ser Arg Thr Asn Ser Met Ser Ser 
145 150 155 160 

Ser Gly Leu Gly Ser Pro Asn Arg Ser Ser Pro Ser He He Cys Met 

165 170 175 

Pro Lys Gin Gin Pro Ser Arg Gin Pro Phe Thr Val Asn Ser Met Ser 

180 185 190 

Gly Phe Gly Met Asn Arg Asn Gin Ala Phe Gly Met Asn Asn Ser Leu 
195 200 205 

Ser Ser Asn He Phe Asn Gly Thr Asp Gly Ser Glu Asn Val Thr Gly 
210 215 220 

Leu Asp Leu Ser Asp Phe Pro Ala Leu Ala Asp Arg Asn Arg Arg Glu 
225 230 235 240 

Gly Ser Gly Asn Pro Thr Pro Leu He Asn Pro Leu Ala Gly Arg Ala 

245 250 255 

Pro Tyr Val Gly Met Val Thr Lys Pro Ala Asn Glu Gin Ser Gin Asp 

260 265 270 

Phe Ser He His Asn Glu Asp Phe Pro Ala Leu Pro Gly Ser Ser Tyr 
275 280 285 

Lys Asp Pro Thr Ser Ser Asn Asp Asp Ser Lys Ser Asn Leu Asn Thr 
290 295 300 

Ser Gly Lys Thr Thr Ser Ser Thr Asp Gly Pro Lys Phe Pro Gly Asp 
305 310 315 320 

Lys Ser Ser Thr Thr Gin Asn Asn Asn Gin Gin Lys Lys Gly He Gin 

325 330 335 

Val Leu Pro Asp Gly Arg Val Thr Asn He Pro Gin Gly Met Val Thr 

340 345 350 

Asp Gin Phe Gly Met He Gly Leu Leu Thr Phe He Arg Ala Ala Glu 
355 360 365 

Thr Asp Pro Gly Met Val His Leu Ala Leu Gly Ser Asp Leu Thr Thr 
370 375 380 

Leu Gly Leu Asn Leu Asn Ser Pro Glu Asn Leu Tyr Pro Lys Phe Ala 
385 390 395 400 

Ser Pro Trp Ala Ser Ser Pro Cys Arg Pro Gin Asp He Asp Phe His 

405 410 415 

Val Pro Ser Glu Tyr Leu Thr Asn He His He Arg Asp Lys Leu Ala 

420 425 430 
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Ala lie Lys Leu Gly Arg Tyr Gly Glu Asp Leu Leu Phe Tyr Leu Tyr 
435 440 445 

Tyr Met Asn Gly Gly Asp Val Leu Gin Leu Leu Ala Ala Val Glu Leu 
450 455 460 

Phe Asn Arg Asp Trp Arg Tyr His Lys Glu Glu Arg Val Trp lie Thr 
465 470 475 480 

Arg Ala Pro Gly Met Glu Pro Thr Met Lys Thr Asn Thr Tyr Glu Arg 

485 490 495 

Gly Thr Tyr Tyr Phe Phe Asp Cys Leu Asn Trp Arg Lys Val Ala Lys 

500 505 510 

Glu Phe His Leu Glu Tyr Asp Lys Leu Glu Glu Arg Pro His Leu Pro 
515 520 525 

Ser Thr Phe Asn Tyr Asn Pro Ala Gin Gin Ala Phe 
530 535 540 



<210> 349 
<211> 99 
<212> PRT 

<213> Homo sapiens 
<400> 349 

Met Leu Phe Phe Leu Ser Leu Phe Leu Ser Leu Leu Leu Thr Leu Ser 
15 10 15 

Leu Pro Ser Phe Leu Pro Phe Ser Phe Phe Phe Phe Ser Leu Phe Pro 

20 25 30 

His Leu Ser Ala Cys Leu Leu Pro Ser Leu Pro Ser Pro Pro Phe Pro 
35 40 45 

Leu Pro Pro Ser Leu Pro Ser Phe Leu Pro Ser Phe Leu Pro Ser Phe 
50 55 60 

Leu Pro Ser Leu Leu Ser Pro Ser Phe Pro Ala Phe Phe Pro Ser Phe 
65 70 75 80 

Cys Gin Leu Ala Arg Arg Ser Pro Arg Lys Ser Thr Gin Met Leu Gin 

85 90 95 

Ser Thr Ser 



<210> 350 
<211> 66 
<212> PRT 

<213> Homo sapiens 
<400> 350 

Met Asn Tyr lie Phe Leu Leu Met Ala Leu Pro His Leu He Ala lie 
15 10 15 
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Ala Leu Thr Trp Gly Arg Tyr Ser Phe 

20 25 

Thr Glu Phe Gin Arg Cys Gin Val Thr 
35 40 

Val Leu Met Phe Asn Phe Glu Leu Arg 
50 55 

Leu His 
65 



PCT/US02/08276 



Ser Cys Leu Ala Asn Lys Glu 

30 

Cys Leu Leu His Thr Leu Gly 

45 

Ser He Trp Leu Glu Ser Ser 
60 



<210> 351 
<211> 72 
<212> PRT 

<213> Homo sapiens 
<400> 351 

Met Arg His Thr Cys He Val Asn He Ala Ala Ser Leu Leu Val Ala 
15 10 15 

Asn Thr Trp Phe He Val Val Ala Ala He Gin Asp Asn Arg Tyr He 

20 25 30 

Leu Cys Lys Thr Ala Cys Val Ala Ala Thr Phe Phe He His Phe Phe 
35 40 45 

Tyr Leu Ser Val Phe Phe Trp Met Leu Thr Leu Gly Pro His Ala Val 
50 55 60 

Leu Ser Pro Gly Phe His Ser Ala 
65 70 



<210> 352 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 352 

Met Pro Pro Lys Gin He Pro Leu Thr Ser Leu Ser Leu Leu Ala Leu 
15 10 15 

Leu Leu Phe Phe Phe Phe Lys He Phe Cys Leu Leu Phe Leu Phe Tyr 

20 25 30 

Pro Leu Pro Asp Glu Ser Glu His Phe 
35 40 



<210> 353 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<400> 353 

Met Leu He Ser Val Asp Ser Asn Val Pro Val Val Phe Leu Leu Leu 



224 



WO 02/076488 PCT/US02/08276 



15 10 15 

Phe He Leu Val He Leu Cys His Met Glu Cys Lys Gly His He Tyr 

20 25 30 

He Cys Val Cys Val Cys Val Tyr Met Tyr He Phe Lys Asn He 
35 40 45 



<210> 354 
<211> 121 
<212> PRT 

<213> Homo sapiens 
<400> 354 

Met His Arg Ser Glu Pro Phe Leu Lys Met Ser Leu Leu He Leu Leu 
15 10 15 

Phe Leu Gly Leu Ala Glu Ala Cys Thr Pro Arg Glu Val Asn Leu Leu 

20 25 30 

Lys Gly He He Gly Leu Met Ser Arg Leu Ser Pro Asp Glu He Leu 
35 40 45 

Gly Leu Leu Ser Leu Gin Val Leu His Glu Glu Thr Ser Gly Cys Lys 
50 55 60 

Glu Glu Val Lys Pro Phe Ser Gly Thr Thr Pro Ser Arg Lys Pro Leu 
65 70 75 80 

Pro Lys Arg Lys Asn Thr Trp Asn Phe Leu Lys Cys Ala Tyr Met Val 

85 90 95 

Met Thr Tyr Leu Phe Val Ser Tyr Asn Lys Gly Asp Trp Phe Thr Phe 

100 105 110 

Ser Ser Gin Val Leu Leu Pro Leu Leu 
115 120 



<210> 355 
<211> 116 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (46) 

<223> Xaa equals any amino acid 
<400> 355 

Met Pro Gly Gly Thr Arg Cys Arg Val Leu Leu Leu Ser Leu Thr Phe 
15 10 15 

Gly Thr Ser Met Ala Cys Gly Asn Val Gly Leu Arg Leu Cys Pro Trp 

20 25 30 

Thr Trp His Asn Trp Leu Leu Pro Pro His Leu Cys Ser Xaa Trp Pro 
35 40 45 
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Cys Arg Arg Cys Cys Trp Ala Ala Ala Thr Thr His Phe Ser Trp Pro 
50 55 60 

Pro Trp Val Arg Ser Ala Trp Gly Pro Pro Ala Ala Trp Leu Glu Ser 

65 70 75 80 

« 

Ser Gly His Pro Leu Pro Ala Val Ala Ser Cys Ser Gin Pro Pro Ala 

85 90 95 

Ser Ala Asp Ser Ser Arg Phe Ser Lys Val Pro Cys Cys Arg Arg Arg 

100 105 110 

Gly Trp Thr Arg 
115 



<210> 356 
<211> 86 
<212> PRT 

<213> Homo sapiens 
<400> 356 

Met Pro Trp His Val Cys Phe Phe Leu Ser Gly Leu Leu Phe Pro Ser 
15 10 15 

Pro Gin Thr Ser Leu Gin His Leu Cys Leu Leu Thr Ser Leu lie Leu 

20 25 30 

Gly Val Thr lie Ser Ala Tyr Glu His Ala He Asn Leu Pro Ser Leu 
35 40 45 

Gin Asn Ser Leu Leu Thr Ser His Pro Ser Val Ala Ala Leu Ser Leu 
50 55 60 

Leu Ser Ser Ser Leu Gin Gin Asn Ser Leu Lys Glu Leu Leu Ala Gly 
65 70 75 80 

His Ser Gly Ser Leu Leu 

85 



<210> 357 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 357 • 

Gly Leu Leu Tyr He Met Tyr Cys Asn He 
15 10 



<210> 358 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<400> 358 
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Met Val Lys Trp lie lie 
1 5 

Thr Leu Asn Ser Ser Thr 

20 

Asn Arg Asn Leu Ser Trp 
35 



Leu Ser Cys Leu lie Leu 

10 

Phe Tyr Ala Ala Asn Lys 
25 

Gin Ala Leu Pro Phe Thr 
40 



PCTAJS02/08276 

Lys Gly Lys Arg 
15 

Ser Ser Thr lie 
30 

His 
45 



<210> 359 
<211> 38 
<212> PRT 

<213> Homo sapiens 
<400> 359 

Met Leu Lys Leu Ala Thr lie Leu Leu Thr Leu Leu Leu Lys Asn Leu 
15 10 15 

Asp Ala Gly Leu Thr Asp Lys Leu Ser Arg Ser Asn Phe lie Thr Asp 

20 25 30 

Phe lie Leu Thr Lys Tyr 
35 



<210> 360 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<400> 360 

Met Pro Cys His Gly Leu Leu Ala Gin Gly Leu Ser Leu Ala Pro Leu 
15 10 15 

Pro Pro Trp Ala Leu Cys Cys Val Gly Val Ser Arg Ala Leu Gin Asp 

20 25 30 

lie Gin Gin His Pro Arg Pro Pro Ala Pro Cys Gin 
35 40 



<210> 361 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 361 

Met Gin Ala Arg Trp Phe His lie Leu Gly Met Met Met Phe lie Trp 
15 10 15 

Ser Ser Ala His Gin Tyr Lys Cys Pro Cys Tyr Ser Arg Gin Ser Gin 

20 25 30 

Glu Lys 
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<210> 362 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<400> 362 

Met Val His Asn Cys Leu Leu Leu Leu Lys Phe Leu Leu Leu Phe Cys 
1*5 10 15 

Phe Pro Leu lie Ser Tyr Gin Leu Met Asn Gly Ser Leu Gin Ser Leu 

20 25 30 

Gin Arg Leu Arg Met lie Gin Asn Val Gin Cys He Val Leu Asn Lys 
35 40 45 

Gin Glu Ala Glu Phe Leu Met Gly He Ser Phe Gin He Tyr Asp Trp 
50 55 60 

Ser Leu Gly Phe 
65 



<210> 363 
<211> 162 
<212> PRT 

<213> Homo sapiens 
<400> 363 

Met Thr Ser Asn Phe Pro Phe Cys Thr Leu He Leu Gly He Ala Gin 
15 10 15 

Ala Gin Ala Cys Pro Gly Cys Pro Gly Asp Trp Pro Gly Leu Gly Ser 

20 25 30 

Gly Val Gly Glu Gly Leu His His He Arg Thr Cys Arg Thr Pro lie 
35 40 45 

Pro Cys Ser Pro Pro Ala Pro Ala Ala Ala Cys Leu Gly Ser Gly His 
50 55 60 

Ala Arg Leu Pro Cys Val Leu Arg Leu Trp Pro Val Pro Ala Asn Leu 
65 70 75 80 

■ 

Ser Ser Pro Phe Arg Leu Glu Ala Leu His Cys Ser Phe Trp Ser Ser 

85 90 95 

Pro Leu Leu Pro Ala Pro His Leu Ala Phe Phe Gly Phe Arg Asp Leu 

100 105 110 

Leu Thr Asp Phe Leu Leu Ala Ala Cys Leu Leu Thr Phe Gin Lys Thr 
115 120 125 

Pro Leu Glu Leu Pro Met Ala Val Val His Leu Leu Val Ala Thr Pro 
130 135 140 

Cys Tyr Gin Met Leu Asp Asn Leu Pro Leu Pro Ser Ala Ala Ala Asn 
145 150 155 160 

Trp Cys 



228 



WO 02/076488 PCT/US02/08276 



<210> 364 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<400> 364 

Met Leu Leu Phe Ser Ser Arg Phe lie Met Phe Leu Trp Pro Pro Val 
15 10 15 

Ser Gly Val Cys Leu Ser Phe He Arg Asp Arg Ser Phe Leu Pro Met 

20 25 30 

* 

Cys His Phe He Tyr Val Leu He Leu Cys Asn Ser He Ala Leu 
35 40 45 



<210> 365 
<211> 79 
<212> PRT 

<213> Homo sapiens 
<400> 365 

Met Thr Leu Met Cys Leu Cys Leu Ser Val Thr Val Leu His Pro Leu 
15 10 15 

Arg Ser Lys Glu Arg Leu Ser Gly Thr Phe Cys Gly Tyr Ser Ser Ser 

20 25 30 

Trp Cys Ser Pro Ala Ser Glu Ser Ser Ser Pro Gly Ser Leu Leu Thr 
35 40 45 

Cys Ala Ala Ser Gly Ser His Pro Asp Cys Pro Leu Ser Gin Arg Leu 
50 55 60 

Leu Gly Val Gin Leu Ala Ala Leu Gly Arg Pro Gin Gly Leu Phe 
65 70 75 



<210> 366 
<211> 292 
<212> PRT 

<213> Homo sapiens 
<400> 366 

Met Leu Arg Val Leu Cys Leu Leu Arg Pro Trp Arg Pro Leu Arg Ala 
15 10 15 

Arg Gly Cys Ala Ser Asp Gly Ala Ala Gly Gly Ser Glu He Gin Val 

20 25 30 

Arg Ala Leu Ala Gly Pro Asp Gin Gly He Thr Glu He Leu Met Asn 
35 40 45 

Arg Pro Ser Ala Arg Asn Ala Leu Gly Asn Val Phe Val Ser Glu Leu 
50 55 60 
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Leu Glu Thr Leu Ala Gin Leu Arg Glu Asp Arg Gin Val Arg Val Leu 
65 70 75 80 

Leu Phe Arg Ser Gly Val Lys Gly Val Phe Cys Ala Gly Ala Asp Leu 

85 90 95 

Lys Glu Arg Glu Gin Met Ser Glu Ala Glu Val Gly Val Phe Val Gin 

100 105 110 

Arg Leu Arg Gly Leu Met Asn Asp lie Ala Ala Phe Pro Ala Pro Thr 
115 120 125 

lie Ala Ala Met Asp Gly Phe Ala Leu Gly Gly Gly Leu Glu Leu Ala 
130 135 140 

Leu Ala Cys Asp Leu Arg Val Ala Ala Ser Ser Ala Val Met Gly Leu 
145 150 155 160 

lie Glu Thr Thr Arg Gly Leu Leu Pro Gly Ala Gly Gly Thr Gin Arg 

165 170 175 

Leu Pro Arg Cys Leu Gly Val Ala Leu Ala Lys Glu Leu lie Phe Thr 

180 185 190 

Gly Arg Arg Leu Ser Gly Thr Glu Ala His Val Leu Gly Leu Val Asn 
195 200 205 

His Ala Val Ala Gin Asn Glu Glu Gly Asp Ala Ala Tyr Gin Arg Ala 
210 215 220 

Arg Ala Leu Ala Gin Glu lie Leu Pro Gin Ala Pro lie Ala Val Arg 
225 230 235 240 

Leu Gly Lys Val Ala He Asp Arg Gly Thr Glu Val Asp He Ala Ser 

245 250 255 

Gly Met Ala He Glu Gly Met Cys Tyr Ala Gin Asn He Pro Thr Arg 

260 265 270 

Asp Arg Leu Glu Gly Met Ala Ala Phe Arg Glu Lys Arg Thr Pro Lys 
275 280 285 

Phe Val Gly Lys 
290 



<210> 367 
<211> 121 
<212> PRT 

<213> Homo sapiens 
<400> 367 

Met He Met Ala Gin Lys He Gly Gly Leu Thr Trp Trp Ala He Met 
15 10 15 

Phe He He Leu Phe Glu lie Thr Gly Thr Ser Ser Ser Phe Leu Arg 

20 25 30 

He Asn Ala Leu Pro His Phe Ser Met Asn Arg Cys Gly Glu Ala Tyr 
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35 40 45 

Phe Pro Phe Ser Tyr Leu Tyr Thr Ser Leu Gin Lys Gin Phe Leu Met 
50 55 60 

Lys Val Ser Gly lie Val Lys Asn Leu Arg Gly Asn Asp Asp Trp Arg 
65 70 75 80 

Cys Phe Gly Val Phe Phe Cys He His Phe Leu Met Arg Lys Val Leu 

85 90 95 

Asn Val Val Gin Val Arg Pro Asn Tyr Tyr Leu Thr He He Gly Arg 

100 105 110 

Phe Tyr Val Ser Val Lys Val Phe Lys 
115 120 



<210> 368 

<211> 50 

<212> PRT 

<213> Homo sapiens 

<400> 368 

Met Tyr He Tyr Leu He His Leu Cys Met Cys Val Tyr He Tyr He 
15 10 15 

Tyr He Leu Leu He He Tyr Thr Leu Asp Pro Glu Pro Pro Ser Trp 

20 25 30 

Ser Pro Lys Leu Asp Ser His Leu Ser Leu Arg Gin Pro Ser Asn Asp 
35 40 45 

Arg Phe 
50 



<210> 369 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (11) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (34) 

<223> Xaa equals any amino acid 
<400> 369 

Met Val Leu His Cys He Ala Trp Leu Gin Xaa Gly He Ser Phe Leu 
15 10 15 

Phe Leu Phe Leu Cys Val lie Ala He Gly Ala Thr Asn Phe Ala Ser 

20 25 30 
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Pro Xaa Phe Tyr Lys Leu Val Ser Ser Gly Val Ala 
35 40 



<210> 370 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE ) 
<222> (12) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (13) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (72) 

<223> Xaa equals any amino acid 
<400> 370 

Met Ser Gly Gly Leu Ser Phe Leu Leu Leu Val Xaa Xaa Gly Thr Gin 
15 10 15 

Ser Pro Leu His Leu Ala Gly Ser Cys Pro Gly Gin Thr His Leu Ser 

20 25 30 

Phe Pro Leu Gly Gin Asp Arg Gly Gin Gin Leu Gin Gin Lys Gin Gin 
35 40 45 

Asp Leu Glu Gin Glu Gly Leu Glu Ala Thr Gin Gly Leu Leu Ala Gly 
50 55 60 

Glu Trp Ala Pro Pro Leu Trp Xaa Leu Gly Ser Leu Phe Gin Ala Phe 
65 70 75 80 

Val Lys Arg Glu Ser Gin Ala Tyr Ala 

85 



<210> 371 
<211> 508 
<212> PRT 

<213> Homo sapiens 
<400> 371 

Met Asp Pro Lys Leu Gly Arg Met Ala Ala Ser Leu Leu Ala Val Leu 
15 10 15 

Leu Leu Leu Leu Leu Glu Arg Gly Met Phe Ser Ser Pro Ser Pro Pro 

20 25 30 

Pro Ala Leu Leu Glu Lys Val Phe Gin Tyr lie Asp Leu His Gin Asp 
35 40 45 



232 



WO 02/076488 PCT/US02/08276 



Glu Phe Val Gin Thr Leu Lys Glu Trp Val Ala lie Glu Ser Asp Ser 
50 55 60 

Val Gin Pro Val Pro Arg Phe Arg Gin Glu Leu Phe Arg Met Met Ala 
65 70 75 80 

Val Ala Ala Asp Thr Leu Gin Arg Leu Gly Ala Arg Val Ala Ser Val 

85 90 95 

Asp Met Gly Pro Gin Gin Leu Pro Asp Gly Gin Ser Leu Pro lie Pro 

100 105 110 

Pro Val He Leu Ala Glu Leu Gly Ser Asp Pro Thr Lys Gly Thr Val 
115 120 125 

Cys Phe Tyr Gly His Leu Asp Val Gin Pro Ala Asp Arg Gly Asp Gly 
130 135 140 

Trp Leu Thr Asp Pro Tyr Val Leu Thr Glu Val Asp Gly Lys Leu Tyr 
145 150 155 160 

Gly Arg Gly Ala Thr Asp Asn Lys Gly Pro Val Leu Ala Trp lie Asn 

165 170 175 

Ala Val Ser Ala Phe Arg Ala Leu Glu Gin Asp Leu Pro Val Asn He 

180 185 190 

Lys Phe He He Glu Gly Met Glu Glu Ala Gly Ser Val Ala Leu Glu 
195 200 205 

Glu Leu Val Glu Lys Glu Lys Asp Arg Phe Phe Ser Gly Val Asp Tyr 
210 215 220 

He Val He Ser Asp Asn Leu Trp He Ser Gin Arg Lys Pro Ala He 
225 230 235 240 

Thr Tyr Gly Thr Arg Gly Asn Ser Tyr Phe Met Val Glu Val Lys Cys 

245 250 255 

Arg Asp Gin Asp Phe His Ser Gly Thr Phe Gly Gly He Leu His Glu 

260 265 270 

Pro Met Ala Asp Leu Val Ala Leu Leu Gly Ser Leu Val Asp Ser Ser 
275 280 285 

Gly His He Leu Val Pro Gly He Tyr Asp Glu Val Val Pro Leu Thr 
290 295 300 

Glu Glu Glu lie Asn Thr Tyr Lys Ala He His Leu Asp Leu Glu Glu 
305 310 315 320 

Tyr Arg Asn Ser Ser Arg Val Glu Lys Phe Leu Phe Asp Thr Lys Glu 

325 330 335 

Glu He Leu Met His Leu Trp Arg Tyr Pro Ser Leu Ser He His Gly 

340 345 350 

He Glu Gly Ala Phe Asp Glu Pro Gly Thr Lys Thr Val He Pro Gly 
355 360 365 

Arg Val He Gly Lys Phe' Ser He Arg Leu Val Pro His Ket Asn Val 
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370 375 380 

Ser Ala Val Glu Lys Gin Val Thr Arg His Leu Glu Asp Val Phe Ser 
385 390 395 . 400 

Lys Arg Asn Ser Ser Asn Lys Met Val Val Ser Met Thr Leu Gly Leu 

405 410 415 

His Pro Trp lie Ala Asn lie Asp Asp Thr Gin Tyr Leu Ala Ala Lys 

420 425 430 

Arg Ala He Arg Thr Val Phe Gly Thr Glu Pro Asp Met He Arg Asp 
435 440 445 

Gly Ser Thr He Pro He Ala Lys Met Phe Gin Glu He Val His Lys 
450 455 460 

Ser Val Val Leu He Pro Leu Gly Ala Val Asp Asp Gly Glu His Ser 
465 470 475 480 

Gin Asn Glu Lys He Asn Arg Trp Asn Tyr He Glu Gly Thr Lys Leu 

485 490 495 

Phe Ala Ala Phe Phe Leu Glu Met Ala Gin Leu His 

500 505 



<210> 372 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (69) 

<223> Xaa equals any amino acid 
<400> 372 

Met Thr Gly Gin He Pro Arg Leu Ser Lys Val Asn Leu Phe Thr Leu 
15 10 15 

Leu Ser Leu Trp Met Glu Leu Phe Pro .Ala Glu Ala Gin Arg Gin Lys 

20 25 30 

Ser Gin Lys Asn Glu Glu Gly Lys His Gly Pro Leu Gly Asp Asn Glu 
35 40 45 

Glu Arg Thr Arg Val Ser Thr Asp Lys Arg Gin Asp Tyr Trp Glu Gin 
50 55 60 

Leu Arg Cys Leu Xaa Glu Arg Phe Thr He Thr Ala Gly 
65 70 75 



<210> 373 
<211> 44 
<212> PRT 

<213> Homo sapiens 
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<400> 373 

Met Arg Leu Arg Asn Gly Thr Val Ala Thr Ala Leu Ala Phe He Thr 
15 10 15 

Ser Phe Leu Thr Leu Ser Trp Tyr Thr Thr Trp Gin Asn Gly Lys Gly 

20 25 30 

Lys Glu Asn Asp Ser Glu Asn Val His Glu Met Tyr 
35 40 



<210> 374 
<211> 327 
<212> PRT 
<213> Homo sapiens 

<400> 374 

Met Ala Cys Arg Lys Leu Ala Val Ala His Pro Leu Leu Leu Leu Arg 
15 10 15 

His Leu Pro Met He Ala Ala Leu Leu His Gly Arg Thr His Leu Asn 

20 25 30 

9 

Phe Gin Glu Phe Arg Gin Gin Asn His Leu Ser Cys Phe Leu His Val 
35 40 45 

Leu Gly Leu Leu Glu Leu Leu Gin Pro His Val Phe Arg Ser Glu His 
50 55 60 

Gin Gly Ala Leu Trp Asp Cys Leu Leu Ser Phe He Arg Leu Leu Leu 
65 70 75 80 

Asn Tyr Arg Lys Ser Ser Arg His Leu Ala Ala Phe He Asn Lys Phe 

85 90 95 

Val Gin Phe He His Lys Tyr lie Thr Tyr Asn Ala Pro Ala Ala He 

100 105 110 

Ser Phe Leu Gin Lys His Ala Asp Pro Leu His Asp Leu Ser Phe Asp 
115 120 125 

Asn Ser Asp Leu Val Met Leu Lys Ser Leu Leu Ala Gly Leu Ser Leu 
130 135 140 

Pro Ser Arg Asp Asp Arg Thr Asp Arg Gly Leu Asp Glu Glu Gly Glu 
145 150 155 160 

Glu Glu Ser Ser Ala Gly Ser Leu Pro Leu Val Ser Val Ser Leu Phe 

165 170 175 

Thr Pro Leu Thr Ala Ala Glu Met Ala Pro Tyr Met Lys Arg Leu Ser 

180 185 ■ 190 

Arg Gly Gin Thr Val Glu Asp Leu Leu Glu Val Leu Ser Asp He Asp 
195 200 205 

Glu Met Ser Arg Arg Arg Pro Glu He Leu Ser Phe Phe Ser Thr Asn 
210 215 220 

Leu Gin Arg Leu Met Ser Ser Ala Glu Glu Cys Cys Arg Asn Leu Ala 
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225 230 235 240 

Phe Ser Leu Ala Leu Arg Ser Met Gin Asn Ser Pro Ser He Ala Ala 

245 250 255 

Ala Phe Leu Pro Thr Phe Met Tyr Cys Leu Gly Ser Gin Asp Phe Glu 

260 265 270 

Val Val Gin Thr Ala Leu Arg Asn Leu Pro Glu Tyr Ala Leu Leu Cys 
275 280 285 

Gin Glu His Ala Ala Val Leu Leu His Arg Ala Phe Leu Val Gly Met 
290 295 300 

Tyr Gly Gin Met Asp Pro Ser Ala Gin He Ser Glu Ala Leu Arg He 
305 310 315 320 

Leu His Met Glu Ala Val Met 

325 



<210> 375 

<211> 91 

<212> PRT 

<213> Homo sapiens 

<400> 375 

Met Gly Asp Lys Leu Gly Met Ala Arg Ala Pro Ser Val Ala Leu Ala 
15 10 15 

Gin Leu Trp Leu He Cys Leu Cys Pro Glu Ser Leu Ala Ser Phe Val 

20 25 30 

Gin Ala Val Pro Trp Lys Val Leu Gin Pro Ser Ser Asn Arg Ser Thr 
35 40 45 

Asp Cys Ser Pro His Met Arg Pro Thr Cys Glu Thr Leu Gly Ser Arg 
50 55 60 

Lys Ala Gin Asp Leu Val Leu Asp Thr Met Cys Leu Ser Thr Asp Asp 
65 70 75 80 

Cys Gin Gly Leu He Cys Arg Gly His Arg Ser 

85 90 



<210> 376 
<211> 243 
<212> PRT 
<213> Homo sapiens 

<400> 376 

Met Gly Thr Leu Pro Trp Leu Leu Ala Phe Phe He Leu Gly Leu Gin 
15 10 15 

Ala Trp Asp Thr Pro Thr He Val Ser Arg Lys Glu Trp Gly Ala Arg 

20 25 30 

Pro Leu Ala Cys Arg Ala Leu Leu Thr Leu Pro Val Ala Tyr lie He 
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35 40 45 

Thr Asp Gin Leu Pro Gly Met Gin Cys Gin Gin Gin Ser Val Cys Ser 
50 55 60 

Gin Met Leu Arg Gly Leu Gin Ser His Ser Val Tyr Thr lie Gly Trp 
65 70 75 80 

Cys Asp Val Ala Tyr Asn Phe Leu Val Gly Asp Asp Gly Arg Val Tyr 

85 90 95 

Glu Gly Val Gly Trp Asn He Gin Gly Leu His Thr Gin Gly Tyr Asn 

100 105 110 

Asn He Ser Leu Gly He Ala Phe Phe Gly Asn Lys He Ser Ser Ser 
115 120 125 

Pro Ser Pro Ala Ala Leu Ser Ala Ala Glu Gly Leu He Ser Tyr Ala 
130 135 140 

He Gin Lys Gly His Leu Ser Pro Arg Tyr He Gin Pro Leu Leu Leu 
145 150 155 160 

Lys Glu Glu Thr Cys Leu Asp Pro Gin His Pro Val Met Pro Arg Lys 

165 170 175 

Val Cys Pro Asn He He Lys Arg Ser Ala Trp Glu Ala Arg Glu Thr 

180 185 190 

His Cys Pro Lys Met Asn Leu Pro Ala Lys Tyr Val lie lie He His 
195 200 205 

Thr Ala Gly Thr Ser Cys Thr Val Ser Thr Asp Cys Gin Thr Val Val 
210 215 220 

Arg Asn He Gin Ser Phe His Met Asp Thr Arg Asn Phe Cys Asp He 
225 230 235 240 

Gly Tyr Gin 



<210> 377 
<211> 80 
<212> PRT 

<213> Homo sapiens 
<400> 377 

Met Lys Leu Ser Gly Met Phe Leu Leu Leu Ser Leu Ala Leu Phe Cys 
15 10 15 

Phe Leu Thr Gly Val Phe Ser Gin Gly Gly Gin Val Asp Cys Gly Glu 

20 25 30 

Phe Gin Asp Thr Lys Val Tyr Cys Thr Arg Glu Ser Asn Pro His Cys 
35 40 45 

Gly Ser Asp Gly Gin Thr Tyr Gly Asn Lys Cys Ala Phe Cys Lys Ala 
50 55 60 
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lie Val Lys Ser Gly Gly Lys lie Ser Leu Lys His Pro Gly Lys Cys 
65 ** 70 75 80 



<210> 378 
<211> 301 
<212> PRT 

<213> Homo sapiens 
<400> 378 

Met Ala Arg His Gly Leu Pro Leu Leu Pro Leu Leu Ser Leu Leu Val 
15 10 15 

Gly Ala Trp Leu Lys Leu Gly Asn Gly Gin Ala Thr Ser Met Val Gin 

20 25 30 

Leu Gin Gly Gly Arg Phe Leu Met Gly Thr Asn Ser Pro Asp Ser Arg 
35 40 45 

Asp Gly Glu Gly Pro Val Arg Glu Ala Thr Val Lys Pro Phe Ala He 
50 55 60 

Asp He Phe Pro Val Thr Asn Lys Asp Phe Arg Asp Phe Val Arg Glu 
65 70 75 80 

Lys Lys Tyr Arg Thr Glu Ala Glu Met Phe Gly Trp Ser Phe Val Phe 

85 90 95 

Glu Asp Phe Val Ser Asp Glu Leu Arg Asn Lys Ala Thr Gin Pro Met 

100 105 110 

Lys Ser Val Leu Trp Trp Leu Pro Val Glu Lys Ala Phe Trp Arg Gin 
115 120 125 

Pro Ala Gly Pro Gly Ser Gly He Arg Glu Arg Leu Glu His Pro Val 
130 135 140 

Leu His Val Ser Trp Asn Asp Ala Arg Ala Tyr Cys Ala Trp Arg Gly 
145 150 155 160 

Lys Arg Leu Pro Thr Glu Glu Glu Trp Glu Phe Ala Ala Arg Gly Gly 

165 170 175 

Leu Lys Gly Gin Val Tyr Pro Trp Gly Asn Trp Phe Gin Pro Asn Arg 

180 . 185 190 

Thr Asn Leu Trp Gin Gly Lys Phe Pro Lys Gly Asp Lys Ala Glu Asp 
195 200 205 

Gly Phe His Gly Val Ser Pro Val Asn Ala Phe Pro Ala Gin Asn Asn 
210 215 220 

Tyr Gly Leu Tyr Asp Leu Leu Gly Asn Val Trp Glu Trp Thr Ala Ser 
225 230 235 240 

Pro Tyr Gin Ala Ala Glu Gin Asp Met Arg Val Leu Arg Gly Ala Ser 

245 250 255 
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Trp lie Asp Thr Ala Asp Gly Ser Ala Asn His Arg Ala Arg Val Thr 

260 265 270 

Thr Arg Met Gly Asn Thr Pro Asp Ser Ala Ser Asp Asn Leu Gly Phe 
275 280 285 

Arg Cys Ala Ala Asp Ala Gly Arg Pro Pro Gly Glu Leu 
290 295 300 



<210> 379 
<211> 438 
<212> PRT 
<213> Homo sapiens 

<400> 379 

Met Pro Cys Thr Cys Thr Trp Arg Asn Trp Arg Gin Trp lie Arg Pro 
15 10 15 

Leu Val Ala Val He Tyr Leu Val Ser He Val Val Ala Val Pro Leu 

20 25 30 

Cys Val Trp Glu Leu Gin Lys Leu Glu Val Gly He His Thr Lys Ala 
35 40 45 

Trp Phe He Ala Gly He Phe Leu Leu Leu Thr He Pro He Ser Leu 
50 55 60 

Trp Val He Leu Gin His Leu Val His Tyr Thr Gin Pro Glu Leu Gin 
65 70 75 80 

Lys Pro He He Arg He Leu Trp Met Val Pro He Tyr Ser Leu Asp 

85 90 95 

Ser Trp He Ala Leu Lys Tyr Pro Gly He Ala He Tyr Val Asp Thr 

100 105 110 

Cys Arg Glu Cys Tyr Glu Ala Tyr Val He Tyr Asn Phe Met Gly Phe 
115 120 125 

Leu Thr Asn Tyr Leu Thr Asn Arg Tyr Pro Asn Leu Val Leu lie Leu 
130 135 140 

Glu Ala Lys Asp Gin Gin Lys His Phe Pro Pro Leu Cys Cys Cys Pro 
145 150 155 160 

Pro Trp Ala Met Gly Glu Val Leu Leu Phe Arg Cys Lys Leu Gly Val 

165 170 175 

Leu Gin Tyr Thr Val Val Arg Pro Phe Thr Thr He Val Ala Leu He 

180 185 190 

Cys Glu Leu Leu Gly He Tyr Asp Glu Gly Asn Phe Ser Phe Ser Asn 
195 200 205 

Ala Trp Thr Tyr Leu Val He He Asn Asn Met Ser Gin Leu Phe Ala 

210 215 220 

i 

Met Tyr Cys Leu Leu Leu Phe Tyr Lys Val Leu Lys Glu Glu Leu Ser 
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225 230 235 240 

Pro lie Gin Pro Val Gly Lys Phe Leu Cys Val Lys Leu Val Val Phe 

245 250 255 

Val Ser Phe Trp Gin Ala Val Val lie Ala Leu Leu Val Lys Val Gly 

260 265 270 

Val He Ser Glu Lys His Thr Trp Glu Trp Gin Thr Val Glu Ala Val 
275 280 285 

Ala Thr Gly Leu Gin Asp Phe He He Cys He Glu Met Phe Leu Ala 
290 295 300 

Ala He Ala His His Tyr Thr Phe Ser Tyr Lys Pro Tyr Val Gin Glu 
305 310 315 320 

Ala Glu Glu Gly Ser Cys Phe Asp Ser Phe Leu Ala Met Trp Asp Val 

325 330 335 

Ser Asp He Arg Asp Asp He Ser Glu Gin Val Arg His Val Gly Arg 

'340 345 350 

Thr Val Arg Gly His Pro Arg Lys Lys Leu Phe Pro Glu Asp Gin Asp 
355 360 365 

Gin Asn Glu His Thr Ser Leu Leu Ser Ser Ser Ser Gin Asp Ala He 
370 375 380 

Ser He" Ala Ser Ser Met Pro Pro Ser Pro Met Gly His Tyr Gin Gly 
385 390 395 400 

Phe Gly His Thr Val Thr Pro Gin Thr Thr Pro Thr Thr Ala Lys lie 

405 410 415 

Ser Asp Glu He Leu Ser Asp Thr He Gly Glu Lys Lys Glu Pro Ser 

420 425 430 

Asp Lys Ser Val Asp Ser 
435 



<210> 380 
<211> 107 
<212> PRT 

<213> Homo sapiens 
<400> 380 

Met Val Arg Tyr Thr Tyr Ser Met Leu Ser Val He Gly He Ser Tyr 
15 10 15 

Ala Val Leu Thr Trp Leu Ser Gin Thr Leu Trp Met Pro He Tyr Pro 

20 25 30 

Leu Cys Val Leu Ala Glu Ala Phe Ala He Tyr Gin Ser Leu Pro Tyr 
35 40 45 

Phe Glu Ser Phe Gly Thr Tyr Ser Thr Lys Leu Pro Phe Asp Leu Ser 
50 55 60 
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lie Tyr Phe Pro Tyr Val Leu Lys lie Tyr Leu Met Met Leu Phe lie 
65 70 75 80 

Gly Met Tyr Phe Thr Tyr Ser His Leu Tyr Ser Glu Arg Arg Asp lie 

85 90 95 

Leu Gly lie Phe Pro lie Lys Lys Lys Lys Met 

100 105 



<210> 381 
<211> 234 
<212> PRT 

<213> Homo sapiens 
<400> 381 

Met Arg lie Arg Phe Thr Ser Pro His Pro Lys Asp Phe Pro Asp Glu 
15 10 15 

Val Leu Gin Leu lie His Glu Arg Asp Asn He Cys Lys Gin He His 

20 25 30 

Leu Pro Ala Gin Ser Gly Ser Ser Arg Val Leu Glu Ala Met Arg Arg 
35 40 45 

Gly Tyr Ser Arg Glu Ala Tyr Val Glu Leu Val His His He Arg Glu 
50 55 60 

Ser He Pro Gly Val Ser Leu Ser Ser Asp Phe He Ala Gly Phe Cys 
65 .70 75 80 

Gly Glu Thr Glu Glu Asp His Val Gin Thr Val Ser Leu Leu Arg Glu 

85 90 95 

Val Gin Tyr Asn Met Gly Phe Leu Phe Ala Tyr Ser Met Arg Gin Lys 

100 105 110 

Thr Arg Ala Tyr His Arg Leu Lys Asp Asp Val Pro Glu Glu Val Lys 
115 120 125 

Leu Arg Arg Leu Glu Glu Leu He Thr He Phe Arg Glu Glu Ala Thr 
130 135 140 

Lys Ala Asn Gin Thr Ser Val Gly Cys Thr Gin Leu Val Leu Val Glu 
145 150 155 160 

Gly Leu Ser Lys Arg Ser Ala Thr Asp Leu Cys Gly Arg Asn Asp Gly 

165 170 175 

Asn Leu Lys Val He Phe Pro Asp Ala Glu Met Glu Asp Val Asn Asn 

180 185 190 

Pro Gly Leu Arg Val Arg Ala Gin Pro Gly Asp Tyr Val Leu Val Lys 
195 200 205 

He Thr Ser Ala Ser Ser Gin Thr Leu Arg Gly His Val Leu Cys Arg 
210 215 220 

Thr Thr Leu Arg Asp Ser Ser Ala Tyr Cys 
225 230 
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<210> 382 
<211> 470 
<212> PRT 

<213> Homo sapiens 
<400> 382 

Met Trp Phe Thr Tyr Leu Leu Leu Tyr Leu His Ser Val Arg Ala Tyr 
1 5 10 . 15 

Ser Ser Arg Gly Ala Gly Leu Leu Leu Leu Leu Gly Gin Val Ala Asp 

20 25 30 

Gly Leu Cys Thr Pro Leu Val Gly Tyr Glu Ala Asp Arg Ala Ala Ser 
35 40 * 45 

Cys Cys Ala Arg Tyr Gly Pro Arg Lys Ala Trp His Leu Val Gly Thr 
50 55 60 

Val Cys Val Leu Leu Ser Phe Pro Phe lie Phe Ser Pro Cys Leu Gly 
65 70 75 80 

Cys Gly Ala Ala Thr Pro Glu Trp Ala Ala Leu Leu Tyr Tyr Gly Pro 

85 90 95 

Phe He Val He Phe Gin Phe Gly Trp Ala Ser Thr Gin He Ser His 

100 105 110 

Leu Ser Leu He Pro Glu Leu Val Thr Asn Asp His Glu Lys Val Glu 
115 120 125 

Leu Thr Ala Leu Arg Tyr Ala Phe Thr Val Val Ala Asn He Thr Val 
130 135 140 

Tyr Gly Ala Ala Trp Leu Leu Leu His Leu Gin Gly Ser Ser Arg Val 
145 150 155 160 

Glu Pro Thr Gin Asp He Ser He Ser Asp Gin Leu Gly Gly Gin Asp 

165 170 175 

Val Pro Val Phe Arg Asn Leu Ser Leu Leu Val Val Gly Val Gly Ala 

180 185 190 

Val Phe Ser Leu Leu Phe His Leu Gly Thr Arg Glu Arg Arg Arg Pro 
195 200 205 

His Ala Glu Glu Pro Gly Glu His Thr Pro Leu Leu Ala Pro Ala Thr 
210 215 220 

Ala Gin Pro Leu Leu Leu Trp Lys His Trp Leu Arg Glu Pro Ala Phe 
225 230 235 240 

Tyr Gin Val Gly He Leu Tyr Met Thr Thr Arg Leu He Val Asn Leu 

245 250 255 

Ser Gin Thr Tyr Met Ala Met Tyr Leu Thr Tyr Ser Leu His Leu Pro 

260 265 270 

Lys Lys Phe He Ala Thr He Pro Leu Val Met Tyr Leu Ser Gly Phe 
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Leu Ser Ser Phe Leu Met Lys Pro lie Asn Lys Cys lie Gly Arg Asn 
290 295 300 

Met Thr Tyr Phe Ser Gly Leu Leu Val lie Leu Ala Phe Ala Ala Trp 
305 310 315 320 

Val Ala Leu Ala Glu Gly Leu Gly Val Ala Val Tyr Ala Ala Ala Val 

325 330 335 

Leu Leu Gly Ala Gly Cys Ala Thr lie Leu Val Thr Ser Leu Ala Met 

340 345 350 

Thr Ala Asp Leu lie Gly Pro His Thr Asn Ser Gly Ala Phe Val Tyr 
355 360 365 

Gly Ser Met Ser Phe Leu Asp Lys Val Ala Asn Gly Leu Ala Val Met 
370 375 380 

Ala lie Gin Ser Leu His Pro Cys Pro Ser Glu Leu Cys Cys Arg Ala 
385 390 395 400 

Cys Val Ser Phe Tyr His Trp Ala Met Val Ala Val Thr Gly Gly Val 

405 410 415 

Gly Val Ala Ala Ala Leu Cys Leu Cys Ser Leu Leu Leu Trp Pro Thr 

420 425 430 

Arg Leu Arg Arg Ser Arg Gly Gly Glu His Arg Thr Pro Ser Glu Gly 
435 440 445 

Glu Gly lie Ser Thr Ala Pro Pro Pro Cys Trp Asn Glu Thr Gin Pro 
450 455 460 

Gin Gly Gly Ala Lys Leu 
465 470 



<210> 383 
<211> 260 
<212> PRT 

<213> Homo sapiens 
<400> 383 

Met Ala Gly Ser Pro Leu Leu Trp Gly Pro Arg Ala Gly Gly Val Gly 
15 10 15 

Leu Leu Val Leu Leu Leu Leu Gly Leu Phe Arg Pro Pro Pro Ala Leu 

20 25 30 

Cys Ala Arg Pro Val Lys Glu Pro Arg Gly Leu Ser Ala Ala Ser Pro 
35 40 45 

Pro Leu Ala Glu Thr Gly Ala Pro Arg Arg Phe Arg Arg Ser Val Pro 
50 55 60 

Arg Gly Glu Ala Ala Gly Ala Val Gin Asp Leu Ala Arg Ala Leu Ala 
65 70 75 80 
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His Leu Leu Glu Ala Glu Arg Gin Glu Arg Ala Arg Ala Glu Ala Gin 

85 90 95 

Glu Ala Glu Asp Gin Gin Ala Arg Val Leu Ala Gin Leu Leu Arg Val 

100 105 110 

Trp Gly Ala Pro Arg Asn Ser Asp Pro Ala Leu Gly Leu Asp Asp Asp 
115 120 125 

Pro Asp Ala Pro Ala Ala Gin Leu Ala Arg Ala Leu Leu Arg Ala Arg 
130 135 140 

Leu Asp Pro Ala Ala Leu Ala Ala Gin Leu Val Pro Ala Pro Val Pro 
145 150 155 160 

Ala Ala Ala Leu Arg Pro Arg Pro Pro Val Tyr Asp Asp Gly Pro Ala 

165 170 175 

Gly Pro Asp Ala Glu Glu Ala Gly Asp Glu Thr Pro Asp Val Asp Pro 

180 185 190 

Glu Leu Leu Arg Tyr Leu Leu Gly Arg lie Leu Ala Gly Ser Ala Asp 
195 200 205 

Ser Glu Gly Val Ala Ala Pro Arg Arg Leu Arg Arg Ala Ala Asp His 
210 215 220 

Asp Val Gly Ser Glu Leu Pro Pro Glu Gly Val Leu Gly Ala Leu Leu 
225 230 235 240 

Arg Val Lys Arg Leu Glu Thr Pro Ala Pro Gin Val Pro Ala Arg Arg 

245 250 255 

Leu Leu Pro Pro 

260 



<210> 384 

<211> 95 

<212> PRT 

<213> Homo sapiens 

<400> 384 

Met His Leu Cys He Cys Ala Val Trp Val Leu Val Ala Leu Leu Arg 
15 10 15 

Met His Gly Ala Ser Pro Ala Gin Thr Ser Gly Thr Arg Ser Gly Asn 

20 25 30 

Gly Gly Cys Arg Arg His Gly Ala Gly Gin Gly Arg Gly Ala Ala Thr 
35 40 45 

Gin Pro Leu Arg Pro Pro Arg Gly Thr Ala Ser Gly Gin Leu Met Ala 
50 55 60 

Leu Leu Ser Ala Leu Leu Pro Arg Leu Ser Gly Ser Ser Thr Pro Met 
65 70 75 80 

Met Ala His Gly Arg Pro Ala Pro Pro Gin Trp Ser Arg Val Ser 

85 90 95 
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<210> 385 
<211> 130 
<212> PRT 

<213> Homo sapiens 
<400> 385 

Met Glu Thr Leu Gly Ala Leu Leu Val Leu Glu Phe Leu Leu Leu Ser 
15 10 15 

Pro Val Glu Ala Gin Gin Ala Thr Glu His Arg Leu Lys Pro Trp Leu 

20 25 30 

Val Gly Leu Ala Ala Val Val Gly Phe Leu Phe lie Val Tyr Leu Val 
35 40 45 

Leu Leu Ala Asn Arg Leu Trp Cys Ser Lys Ala Arg Ala Glu Asp Glu 
50 55 60 

Glu Glu Thr Thr Phe Arg Met Glu Ser Asn Leu Tyr Gin Asp Gin Ser 
65 70 75 80 

Glu Asp Lys Arg Glu Lys Lys Glu Ala Lys Glu Lys Glu Glu Lys Arg 

85 90 95 

Lys Lys Glu Lys Lys Thr Ala Lys Glu Gly Glu Ser Asn Leu Gly Leu 

100 105 110 

Asp Leu Glu Glu Lys Glu Pro Gly Asp His Glu Arg Ala Lys Ser Thr 
115 120 125 

Val Met 
130 



<210> 386 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 386 

Met Asn Leu Ser Phe Leu Ser Phe Phe Leu Phe Phe Tyr Leu Leu Trp 
15 10 15 

Ser Pro Ala Glu Ser Val Tyr Lys Lys Gly Met Val Lys Lys Asn Leu 

20 25 30 

Ser His Ser lie Val Glu Lys lie Lys 
35 40 



<210> 387 
<211> 113 
<212> PRT 

<213> Homo sapiens 
<220> 
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<221> SITE 
<222> (38) 

<223> Xaa equals any amino acid 
<400> 387 

Met Arg Pro Leu Leu Leu Gly Gly Tyr Trp Val Leu Cys Leu Ser Val 
15 10 15 

Leu Gly His Ala Ala Leu Tyr His Phe Trp Leu Arg Glu Glu Gly Lys 

20 25 30 

Gly Pro Pro Gin Val Xaa Ser Val Leu Ala Leu Ala Leu Pro Ala Gly 
35 40 45 

Ser Cys Ala Pro Gly Leu Pro Phe Pro Gly Pro Leu lie Pro Thr Gin 
50 55 60 

Leu Leu Phe Ala Leu Glu Trp Gly Thr Pro Thr Pro Leu Arg Asp His 
65 70 75 80 

Pro Pro His Ser Met His Ser Ala Pro Gin Asn Pro Pro Val Phe Leu 

85 90 95 

Gly Thr His Thr Cys Pro Pro Ser Trp Tyr Phe Arg Leu lie Pro Gin 

100 105 110 

Ala 



<210> 388 
<211> 161 
<212> PRT 
<213> Homo sapiens 

<400> 388 

Met Ala Leu Ser Leu Thr Leu Cys Phe Val Met Phe Trp Thr Pro Asn 
15 10 15 

Val Ser Glu Lys He Leu He Asp He He Gly Val Asp Phe Ala Phe 

20 25 30 

Ala Glu Leu Cys Val Val Pro Leu Arg He Phe Ser Phe Phe Pro Val 
35 * 40 45 

Pro Val Thr Val Arg Ala His Leu Thr Gly Trp Leu Met Thr Leu Lys 
50 55 60 

Lys Thr Phe Val Leu Ala Pro Ser Ser Val Leu Arg He He Val Leu 
65 70 75 80 

He Ala Ser Leu Val Val Leu Pro Tyr Leu Gly Val His Gly Ala Thr 

85 90 95 

Leu Gly Val Gly Ser Leu Leu Ala Gly Phe Val Gly Glu Ser Thr Met 

100 105 110 

Val Ala He Ala Ala Cys Tyr Val Tyr Arg Lys Gin Lys Lys Lys Met 
115 120 125 
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Glu Asn Glu Ser Ala Thr Glu Gly Glu Asp Ser Ala Met Thr Asp Met 
130 135 140 

Pro Pro Thr Glu Glu Val Thr Asp lie Val Glu Met Arg Glu Glu Asn 
145 150 155 160 

Glu 



<210> 389 
<211> 348 
<212> PRT 
<213> Homo sapiens 

<400> 389 

Met Asn Met Thr Gin Ala Arg Val Leu Val Ala Ala Val Val Gly Leu 
1 5 10 15 

Val Ala Val Leu Leu Tyr Ala Ser lie His Lys He Glu Glu Gly His 

20 25 30 

Leu Ala Val Tyr Tyr Arg Gly Gly Ala Leu Leu Thr Ser Pro Ser Gly 
35 40 45 

Pro Gly Tyr His He Met Leu Pro Phe He Thr Thr Phe Arg Ser Val 
50 55 60 

Gin Thr Thr Leu Gin Thr Asp Glu Val Lys Asn Val Pro Cys Gly Thr 
65 70 75 80 

Ser Gly Gly Val Met He Tyr He Asp Arg He Glu Val Val Asn Met 

85 90 95 

Leu Ala Pro Tyr Ala Val Phe Asp He Val Arg Asn Tyr Thr Ala Asp 

100 105 110 

Tyr Asp Lys Thr Leu lie Phe Asn Lys He His His Glu Leu Asn Gin 
115 120 125 

Phe Cys Ser Ala His Thr Leu Gin Glu Val Tyr He Glu Leu Phe Asp 
130 135 140 

Gin He Asp Glu Asn Leu Lys Gin Ala Leu Gin Lys Asp Leu Asn Leu 
145 150 155 160 

Met Ala Pro Gly Leu Thr He Gin Ala Val Arg Val Thr Lys Pro Lys 

165 170 175 

He Pro Glu Ala He Arg Arg Asn Phe Glu Leu Met Glu Ala Glu Lys 

180 185 190 

Thr Lys Leu Leu He Ala Ala Gin Lys Gin Lys Val Val Glu Lys Glu 
195 200 205 

Ala Glu Thr Glu Arg Lys Lys Ala Val He Glu Ala Glu Lys He Ala 
210 215 220 

Gin Val Ala Lys He Arg Phe Gin Gin Lys Val Met Glu Lys Glu Thr 
225 230 235 240 
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Glu Lys Arg lie Ser Glu He Glu 

245 

Lys Ala Lys Ala Asp Ala Glu Tyr 

260 

Ser Asn Lys His Lys Leu Thr Pro 
275 280 

Gin Ala He Ala Ser Asn Ser Lys 
290 295 

Asn Met Phe Val Asp Ser Ser Cys 
305 310 

Thr Gly Arg Glu Ser Ser Leu Pro 

325 

Gly Glu Asn Val He Gin Asn Lys 

340 



Asp Ala Ala Phe Leu Ala Arg Glu 
250 255 

Tyr Ala Ala His Lys Tyr Ala Thr 
265 270 

Glu Tyr Leu Glu Leu Lys Lys Tyr 

285 

He Tyr Phe Gly Ser Asn He Pro 

300 

Ala Leu Lys Tyr Ser Asp He Arg 
315 320 

Ser Lys Glu Ala Leu Glu Pro Ser 
330 335 

Glu Ser Thr Gly 
345 



<210> 390 . 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 390 

Met Pro Leu Cys Gly Leu Tyr Cys Leu Arg He Leu Met Phe Pro Leu 
15 10 15 

Arg Ser Ala Asn Ser Val Pro Leu Gin Cys Leu Pro Pro Ser Ser Leu 

20 25 30 

Ala Asn Lys Asp Ser His Phe Arg Ala Pro Arg Lys 
35 40 



<210> 391 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 391 

Met Pro Gly He Leu Ala Gly He Pro Val Lys Asp Leu Cys Leu Ser 
15 10 15 

Leu Leu Gin Gly Phe Arg Leu Leu Leu Leu Cys Val Cys Pro Gly Trp 

20 25 30 

Leu Ser Gly Trp Met Gly Gly Gin Lys Gly Ser Pro Arg He Val Asp 
35 40 45 

He Gly 
50 
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<210> 392 
<211> 206 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (143) 

<223> Xaa equals any amino acid 
<400> 392 

Met Ala Ser His Gly Leu Cys Pro Cys Leu Leu Met Gly Thr Gly Trp 
15 10 15 

i 

i 

Gly Leu Trp Thr Leu Leu Pro Asp Leu Glu Val Met Ala Gly Lys Gly 

20 25 30 

Arg Met Pro Phe Ala Gly lie Ser Val Thr Ser Gly Phe Leu Arg Ser 
35 40 45 

Leu Lys Arg Ala Pro Leu Pro His Thr Gly Ser Pro Asp Pro Arg Pro 
50 55 60 

Ser Gly lie Trp Ser Gly Val Arg Thr Thr Ser Glu Glu Ala Gly Ala 
65 70 75 80 

Thr Ser Thr Gin lie Ser Thr Ala Ala Pro Arg Phe His Ser Arg Arg 

85 90 95 

Lys Gly Pro Lys Arg Asn Leu Ala Pro Gin Leu Arg Val Leu Val His 

100 105 110 

Arg Thr Val Pro Pro Gly Gin Leu Val Tyr Ala Pro Gin Thr Val Asp 
115 120 125 

Ser Leu Arg Gly Thr Leu Leu Arg Pro Pro Ala Trp Leu Leu Xaa Gin 
130 135 140 

Val Pro Cys Phe Tyr Ser Gly Gin Pro Leu Leu Val Ser Ala Ser Val 
145 150 155 160 

Leu Cys Arg Asp Leu Met Gin Phe Leu Phe Leu Leu Lys Ser Tyr Leu 

165 170 175 

Leu Pro Phe Leu Glu Val Cys Arg lie Gly Trp Glu Gin lie Gin Arg 

180 185 190 

lie Leu Gly Ala Gly Leu Trp Arg Gin Lys Glu Gly Asn Gly 
195 200 205 



<210> 393 
<211> 75 
<212> PRT 

<213> Homo sapiens 
<400> 393 

Met Ser Arg Phe* lie Leu Asn His Leu Val Leu Ala lie Pro Leu Arg 
15 10 15 
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Val Leu Val Val Leu Trp Ala Phe Val Leu Gly Leu Ser Arg Val Met 

20 25 30 

Leu Gly Arg His Asn Val Thr Asp Val Ala Phe Gly Phe Phe Leu Gly 
35 40 45 

Tyr Met Gin Tyr Ser lie Val Asp Tyr Cys Trp Leu Ser Pro His Asn 
50 55 60 

Ala Pro Val Leu Phe Leu Leu Trp Ser Gin Arg 
65 70 75 



<210> 394 
<211> 97 
<212> PRT 

<213> Homo sapiens 
<400> 394 

Met Cys Lys Gly Leu Lys Asn Pro Glu Gly Leu Leu Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Phe Thr Asp Thr Ser Asn Ser His Cys Leu Pro Pro Tyr 

20 25 30 

Leu Ser Cys Phe Leu His Glu Arg Gin Pro Glu Leu Gin Ser Val Cys 
35 40 45 

lie Ser Ala Ala Tyr Val Leu Ala Thr Pro Pro Glu Pro Ser Phe lie 
50 55 60 

Leu Val Gly Phe Ser Glu Ala Gly Phe Ala Gin Val Ala Cys Phe Leu 
65 70 75 80 

Lys Tyr Leu Phe Cys Arg Pro Phe Thr Arg His Gly Tyr Phe Tyr Ser 

85 90 95 

Gly 



<210> 395 
<211> 187 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (167) 

<223> Xaa equals any amino acid 
<400> 395 

Met Gly Phe Phe Leu Val Leu Val Met Glu Gin He Thr Leu Ala Tyr 
15 10 15 

Lys Glu Gin Ser Gly Pro Ser Pro Leu Glu Glu Thr Arg Ala Leu Leu 

20 25 30 

Gly Thr Val Asn Gly Gly Pro Gin His Trp His Asp Gly Pro Gly Val 
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35 40 45 

Pro Gin Ala Ser Gly Ala Pro Ala Thr Pro Ser Ala Leu Arg Ala Cys 
50 55 60 

Val Leu Val Phe Ser Leu Ala Leu His Ser Val Phe Glu Gly Leu Ala 
65 70 75 80 

Val Gly Leu Gin Arg Asp Arg Ala Arg Ala Met Glu Leu Cys Leu Ala 

85 90 95 

Leu Leu Leu His Lys Gly lie Leu Ala Val Ser Leu Ser Leu Arg Leu 

100 105 110 

Leu Gin Ser His Leu Arg Ala Gin Val Val Ala Gly Cys Gly lie Leu 
115 120 125 

Phe Ser Cys Met Thr Pro Leu Gly He Gly Leu Gly Ala Ala Leu Ala 
130 135 140 

Glu Ser Ala Gly Pro Leu His Gin Leu Ala Gin Ser Val Leu Glu Gly 
145 150 155 160 

Met Ala Ala Gly Thr Phe Xaa Tyr He Thr Phe Leu Glu He Leu Leu 

165 170 175 

Phe His Pro Lys Phe Lys Gly Val Ser Arg Arg 

180 185 



<210> 396 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 396 

Met Thr Leu Ser Leu Gin Leu Ala Glu Leu Val His Phe Val Cys Ala 
15 10 15 

Phe Gin Ser Gin Trp Thr Gly Val Tyr Pro Met Met Pro Pro Leu Lys 

20 25 30 

Pro Thr Glu Pro Leu Cys Phe Ala Cys Val Pro Cys Arg Val 
35 -40 45 



<210> 397 
<211> 152 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (77) 
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* <223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (81) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (84) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (86) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (87) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (93) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (103) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (110) 

<223> Xaa equals any amino acid 
<400> 397 

Met Asp His Ser Pro Thr Thr Gly Val Val Thr Val He Val He Leu 
15 10 15 

He Ala He Ala Ala Leu Gly Ala Phe Asp Pro Gly Leu Leu Val Leu 

20 25 30 

Pro Ala Ala Ala Ala His Gin Pro Val Arg Gly Arg Gly Glu His Arg 
35 40 45 

Gly Gly Trp Gly Asp Gin Gly Thr Leu Pro Ala Gly Ala Val Phe Gly 
50 55 60 

Gin Xaa Thr Val Arg Gly Glu Lys Gly Gin Ala Asp Xaa Ser Gin Thr 
65 70 75 80 

Xaa Arg Lys Xaa Thr Xaa Xaa Pro Gly Cys Lys Gly Xaa Leu Val Pro 

85 90 95 

Val Cys Lys Pro Ala Lys Xaa Gly Leu Gly Gly Ala Lys Xaa He Arg 

100 105 110 

Met Arg Cys Cys Leu Arg Gly Arg Ala Asp Thr Cys Trp His Gly Leu 
115 120 125 
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Cys Gly Phe Arg Pro Ser His Ala Leu Met Pro Gly Asp Leu Ala Val 
130 135 140 

Leu Gly Phe Pro Ser Ala Ser Arg 
145 150 



<210> 398 
<211> 340 
<212> PRT 

<213> Homo sapiens 
<400> 398 

Met Ala Leu Arg Leu Leu Arg Arg Ala Ala Arg Gly Ala Ala Ala Ala 
15 10 15 

Ala Leu Leu Arg Leu Lys Ala Ser Leu Ala Ala Asp lie Pro Arg Leu 

20 25 30 

Gly Tyr Ser Ser Ser Ser His His Lys Tyr lie Pro Arg Arg Ala Val 
35 40 45 

Leu Tyr Val Pro Gly Asn Asp Glu Lys Lys lie Lys Lys lie Pro Ser 
50 55 60 

Leu Asn Val Asp Cys Ala Val Leu Asp Cys Glu Asp Gly Val Ala Ala 
65 70 75 80 

Asn Lys Lys Asn Glu Ala Arg Leu Arg lie Val Lys Thr Leu Glu Asp 

85 90 95 

lie Asp Leu Gly Pro Thr Glu Lys Cys Val Arg Val Asn Ser Val Ser 

100 105 110 

Ser Gly Leu Ala Glu Glu Asp Leu Glu Thr Leu Leu Gin Ser Arg Val 
115 120 125 

Leu Pro Ser Ser Leu Met Leu Pro Lys Val Glu Ser Pro Glu Glu lie 
130 135 140 

Gin Trp Phe Ala Asp Lys Phe Ser Phe His Leu Lys Gly Arg Lys Leu 
145 150 155 160 

Glu Gin Pro Met Asn Leu lie Pro Phe Val Glu Thr Ala Met Gly Leu 

165 170 175 

Leu Asn Phe Lys Ala Val Cys Glu Glu Thr Leu Lys Val Gly Pro Gin 

180 185 190 

Val Gly Leu Phe Leu Asp Ala Val Val Phe Gly Gly Glu Asp Phe Arg 
195 200 205 

Ala Ser lie Gly Ala Thr Ser Ser Lys Glu Thr Leu Asp lie Leu Tyr 
210 215 220 

Ala Arg Gin Lys lie Val Val He Ala Lys Ala Phe Gly Leu Gin Ala 
225 230 235 240 

Val Asp Leu Val Tyr He Asp Phe Arg Asp Gly Ala Gly Leu Leu Arg 
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245 250 255 

Gin Ser Arg Glu Gly Ala Ala Met Gly Phe Thr Gly Lys Gin Val He 

260 265 270 

His Pro Asn Gin He Ala Val Val Gin Glu Gin Phe Ser Pro Ser Pro 
275 280 285 

Glu Lys He Lys Trp Ala Glu Glu Leu He Ala Ala Phe Lys Glu His 
290 295 300 

Gin Gin Leu Gly Lys Gly Ala Phe Thr Phe Gin Gly Ser Met He Asp 
305 310 315 320 

Met Pro Leu Leu Lys Gin Ala Gin Asn Thr Val Thr Leu Ala Thr Ser 

325 330 335 

He Lys Glu Lys 

340 



<210> 399 
<211> 64 
<212> PRT 

<213> Homo sapiens 
<400> 399 

Met Val Arg His He Arg Glu Arg Arg Arg Gin Pro Leu Ala Phe Gin 
15 10 15 

Arg Val Leu Leu Ser Leu Cys Leu Leu Glu Gly He Trp His Ser Pro 

20 ' 25 30 

Ala Ala Ala Ala Gly Gly Gly Ser His Cys Ser Ser Trp Pro Ser Leu 
35 40 45 

Tyr Thr Thr Phe Gin Arg Val Ser Leu Leu Glu Leu Asp Leu Gly Leu 
50 55 60 



<210> 400 
<211> 44 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (16) 

<223> Xaa equals any amino acid 
<400> 400 

Met Cys Leu Pro Leu Leu His Cys Thr Gly Ala Leu Trp Gly Lys Xaa 
15 10 15 

Val Leu Leu Phe Leu Tyr Cys Leu Ala Gin Ser Phe Ala Tyr Ser Arg 

20 25 30 
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His Gin Thr Val Gly Leu Val Val His Asp Tyr Trp 
35 40 



<210> 401 
<211> 221 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (184) 

<223> Xaa equals any amino acid 
<400> 401 

Met Ala Gly Gly Val Arg Pro Leu Arg Gly Leu Arg Ala Leu Cys Arg 
15 10 15 

Val Leu Leu Phe Leu Ser Gin Phe Cys lie Leu Ser Gly Gly Glu Ser 

20 25 30 

Thr Glu lie Pro Pro Tyr Val Met Lys Cys Pro Ser Asn Gly Leu Cys 
35 40 45 

Ser Arg Leu Pro Ala Asp Cys lie Asp Cys Thr Thr Asn Phe Ser Cys 
50 55 60 

Thr Tyr Gly Lys Pro Val Thr Phe Asp Cys Ala Val Lys Pro Ser Val 
65 70 75 80 

Thr Cys Val Asp Gin Asp Phe Lys Ser Gin Lys Asn Phe lie lie Asn 

85 90 95 

Met Thr Cys Arg Phe Cys Trp Gin Leu Pro Glu Thr Asp Tyr Glu Cys 

100 105 110 

Thr Asn Ser Thr Ser Cys Met Thr Val Ser Cys Pro Arg Gin Arg Tyr 
115 120 125 

Pro Ala Asn Cys Thr Val Arg Asp His Val His Cys Leu Gly Asn Arg 
130 135 140 

Thr Phe Pro Lys Met Leu Tyr Cys Asn Trp Thr Gly Gly Tyr Lys Trp 
145 150 155 160 

Ser Thr Ala Leu Ala Leu Ser lie Thr Leu Gly Gly Phe Gly Ala Asp 

165 170 175 

Arg Phe Tyr Leu Gly Gin Trp Xaa Glu Gly Leu Gly Lys Leu Phe Ser 

180 185 190 

Phe Gly Gly Leu Gly lie Trp Thr Leu lie Asp Val Leu Leu lie Gly 
195 200 205 

Val Gly Tyr Val Gly Pro Ala Asp Gly Ser Leu Tyr lie 
210 215 220 
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<210> 402 
<211> 39 
<212> PRT 

<213> Homo sapiens 
<400> 402 

Met Trp Leu Thr Gin Pro Glu Ser Leu Ser Leu Cys Val Ser Val Ser 
15 10 15 

Gin Asp Trp Ala His lie Leu Ala Leu Ser lie Thr Met Leu Trp Asp 

20 25 30 

Phe Arg Glu Phe Pro His Leu 
35 



<210> 403 
<211> 62 
<212> PRT 

<213> Homo sapiens 
<400> 403 

Met Glu Asn Val Cys Gin Ala Gly Phe Pro Ser Leu Leu His Leu Asn 
15 10 15 

He Thr Leu Thr Leu Leu Gly Leu Ala Gin Cys Tyr Leu Ala Asn Phe 

20 25 30 

Ser Ser Cys Arg Glu Gly Ser Glu His Tyr Leu Phe Phe Phe Phe Phe 
35 40 45 

Leu Leu Glu Pro Gly Leu His Lys Ala Met Ala Lys Phe Ser 
50 55 60 



<210> 404 
<211> 64 
<212> PRT 

<213> Homo sapiens 
<400> 404 

Met Val Ser Pro Leu He Ser Ala Leu Phe His Val Pro Phe Leu Trp 
15 10 15 

Leu Gly Met Phe Phe Pro His Ser Leu Ser Gly Pro Phe Pro Ser His 

20 25 30 

Leu Arg Arg Ala Ser Ser Ser Arg Lys Pro Leu Val Lys Pro Pro Arg 
35 40 45 

Ala Arg Gin Tyr Pro Pro Leu Ala Ser Ser Gly Tyr Arg Gly Arg He 
50 55 60 



<210> 405 
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<211> 62 
<212> PRT 

<213> Homo sapiens 
<400> 405 

Met Lys Asn Ser Thr Ser Leu Leu Tyr Lys Leu Phe Ser Ser Leu Ser 
15 10 15 

Val Phe He Phe Lys Phe Leu Leu Leu Phe Tyr Thr Leu His He Ala 

20 25 30 

Leu Gly Val Lys He Gin Tyr Lys Pro Leu Ala His Phe He Asp His 
35 40 45 

Ser Cys He Gin Gin Val Ser Gin Val Gin Trp Ser He Pro 
50 55 60 



<210> 406 
<211> 139 
<212> PRT 

<213> Homo sapiens 
<400> 406 

Met Ala Leu Gly He Gin Lys Arg Phe Ser Pro Glu Val Leu Gly Leu 
15 10 15 

Cys Ala Ser Thr Ala Leu Val Trp Val Val Met Glu Val Leu Ala Leu 

20 25 30 

Leu Leu Gly Leu Tyr Leu Ala Thr Val Arg Ser Asp Leu Ser Thr Phe 
35 40 45 

His Leu Leu Ala Tyr Ser Gly Tyr Lys Tyr Val Gly Met He Leu Ser 
50 55 60 

Val Leu Thr Gly Leu Leu Phe Gly Ser Asp Gly Tyr Tyr Val Ala Leu 
65 70 75 80 

Ala Trp Thr Ser Ser Ala Leu Met Tyr Phe lie Val Arg Ser Leu Arg 

85 90 95 

Thr Ala Ala Leu Gly Pro Asp Ser Met Gly Gly Pro Val Pro Arg Gin 

100 105 110 

Arg Leu Gin Leu Tyr Leu Thr Leu Gly Ala Ala Ala Phe Gin Pro Leu 
115 120 125 

He He Tyr Trp Leu Thr Phe His Leu Val Arg 
130 135 



<210> 407 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<400> 407 

Met Arg Lys Glu Glu Gly He Ala His Leu Ser He Ala Phe Phe Val 
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15 10 15 

Gin Val Leu Cys Leu Tyr Gin Leu Leu Pro Val lie Leu Pro Gin Phe 

20 25 30 

Asn Leu Gly Ser Gly Lys Asn Met Asn Arg 
35 40 



<210> 408 
<211> 121 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (30) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (32) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (87) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (101) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (115) 

<223> Xaa equals any amino acid 
<400> 408 

Met Cys Ser His Ser Thr Leu lie His Leu Tyr Leu Val Leu Pro Phe 
15 10 15 

Phe Phe Leu Phe Leu Pro Ser Ser Phe Pro Phe Pro Ser Xaa Ser Xaa 

20 25 30 

Ser Ser lie Leu Pro Ser Leu Arg Leu Pro Pro Phe Phe Pro Pro Ser 
35 40 45 

Leu Phe Leu His Ser Ser Leu Pro Pro Ser Leu Ser His Pro Leu Gly 
50 55 60 

Leu Ser lie Thr Ser Ser Arg Gin Ser Phe Leu Asp Tyr His His Leu 
65 70 75 80 

Cys Thr Lys His Leu Ser Xaa Thr Leu Cys Gly Leu lie Tyr His Cys 

85 90 95 

Leu Asn lie Phe Xaa Thr Arg Ala Val Met Trp His Met Gin Val Ser 

100 105 110 
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Phe Leu Xaa lie His Trp Leu Leu Pro 
115 120 



<210> 409 
<211> 71 
<212> PRT 

<213> Homo sapiens 
<400> 409 

Met Arg lie His Phe Lys lie Leu Val Leu Val lie Tyr Phe lie Leu 
15 10 15 

Leu Gly Ser Phe Ser Asp Arg Cys Ser Leu Leu Asp Cys Lys Ser Arg 

20 25 30 

lie Gin Arg He Phe He Cys Asn He Leu Asn Leu Ser Leu Val Ser 
35 40 45 

■ 

Cys His Leu Cys Arg Tyr Ser Phe Asp Cys Leu Thr Arg Gly Lys Cys 
50 55 60 

Phe Pro Leu Ser Phe Pro Ala 
65 70 



<210> 410 
<211> 68 
<212> PRT 

<213> Homo sapiens 
<400> 410 

Met Leu Met Leu Leu Thr Leu Leu Val Leu Gly Met Val Trp Val Ala 
1 5 . 10 15 

Ser Ala He Val Asp Lys Asn Lys Ala Asn Arg Glu Ser Leu Tyr Asp 

20 25 30 

Phe Trp Glu Tyr Tyr Leu Pro Tyr Leu Tyr Ser Cys He Ser Phe Leu 
35 40 45 

Gly Val Leu Leu Leu Leu Ala Ala Gly Arg Pro Gly Gly Ala Ala Val 
50 55 60 

Leu Leu Ser Leu 
65 



<210> 411 
<211> 233 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (173) 

<223> Xaa equals any amino acid 
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<400> 411 

Met His Arg Gly Lys Leu Asp Cys Ala Gly Gly Ala Leu Leu Ser Ser 
15 10 15 

Tyr Leu lie Val Leu Met lie Leu Leu Ala Val Val lie Cys Thr Val 

20 25 30 

Ser Ala lie Met Cys Val Ser Met Arg Gly Thr lie Cys Asn Pro Gly 
35 40 45 

Pro Arg Lys Ser Met Ser Lys Leu Leu Tyr lie Arg Leu Ala Leu Phe 
50 55 60 

Phe Pro Glu Met Val Trp Ala Ser Leu Gly Ala Ala Trp Val Ala Asp 
65 70 75 80 

Gly Val Gin Cys Asp Arg Thr Val Val Asn Gly lie He Ala Thr Val 

85 90 95 

Val Val Ser Trp He He He Ala Ala Thr Val Val Ser He He He 

100 105 110 

Val Phe Asp Pro Leu Gly Gly Lys Met Ala Pro Tyr Ser Ser Ala Gly 
115 120 125 

Pro Ser His Leu Asp Ser His Asp Ser Ser Gin Leu Leu Asn Gly Leu 
130 135 140 

Lys Thr Ala Ala Thr Ser Val Trp Glu Thr Arg He Lys Leu Leu Cys 
145 150 155 160 

Cys Cys He Gly Lys Asp Asp His Thr Arg Val Ala Xaa Ser Ser Thr 

165 170 175 

Ala Glu Leu Phe Ser Thr Tyr Phe Ser Asp Thr Asp Leu Val Pro Ser 

180 185 190 

Asp He Ala Ala Gly Leu Ala Leu Leu His Gin Gin Gin Asp Asn He 
195 200 205 

Arg Asn Asn Gin Asp Leu Pro Arg Trp Ser Ala Met Pro Gin Gly Ala 
210 215 220 

Pro Arg Lys Leu He Trp Met Gin Asn 
225 230 



<210> 412 
<211> 66 
<212> PRT 

<213> Homo sapiens 
<400> 412 

Met Phe Val Glu Arg Trp Leu Pro Cys Phe Leu Val Val Ala Val Val 
15 10 15 

Val Trp Val Phe Ala Cys Gly Pro Val Glu Asp Lys Glu Asp Ser Phe 

20 25 30 
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Gly Trp Ser Ser Tyr Phe Leu Ala Ser Gly Leu Pro Pro Leu Leu Phe 
35 40 45 

Glu Ala Ser Gin Thr Arg Thr Val Arg Ala Gly Arg Leu Gly Val Phe 
50 55 60 

Val Cys 
65 



<210> 413 

<211> 90 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (29) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (30) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (65) 

<223> Xaa equals any amino acid 
<400> 413 

Met Leu Arg Cys Ser Phe Ser Ser Phe Leu Leu Cys His Thr lie Leu 
1 5 10 15 

Leu Phe Leu Gly Ser Ser Ala His Leu Leu Val Glu Xaa Xaa Val Trp 

20 25 30 

Gly Leu Tyr Glu Tyr Arg He Gly Asp Met Val Asp Gin Lys Ala Thr 
35 40 45 

Phe Cys Val Gin Lys Gin Glu Cys Leu Phe Pro Leu Gly Ser Trp Val 
50 55 60 

Xaa Arg Val Glu Gly Gly Ala Phe Ala Arg Glu Pro Pro Ser Ser Thr 
65 70 75 80 

Gin Tyr Phe Pro Val Ser Cys Leu Tyr Gin 

85 90 



<210> 414 
<211> 36 
<212> PRT 

<213> Homo sapiens 
<400> 414 

Met Gly Cys Thr Ala Leu Leu Leu Leu Phe His Leu Cys Val Pro Cys 
1.5 10 15 
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Glu Pro Tyr Gly Thr His Glu Lys Glu Leu Val Pro Gly Leu Tyr Phe 

20 25 30 

Leu Val Tyr Arg 
35 



<210> 415 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 415 

Met Cys He Pro Glu Ala Leu Gly Lys Asn Ser Leu Phe Leu Ser Ser 
15 10 15 

Thr Phe Leu Trp Leu Leu Ala Phe Phe Gly Leu Trp Ser His His Ser 

20 25 30 

Tyr Leu Glu Gly Gin His Leu Gin He Cys Phe Phe Phe Thr 
35 40 45 



<210> 416 
<211> 82 
<212> PRT 

<213> Homo sapiens 
<400> 416 

Met Ala He Ser Cys Trp Ala Ser Leu Thr Val Lys Ser Leu Tyr Cys 
15 10 15 

Leu Leu Gly Phe Trp Trp Glu Ala Val He Ser Ser Asn Glu Leu Pro 

20 25 30 

Leu Pro Trp He Cys Gin Glu Ala Asp Gly Asn Leu Ala Asn Ser Gly 
35 40 45 

Arg Tyr Gin Ala Pro Ser Ser Ala Pro Val Thr Leu Phe Tyr Thr Cys 
50 55 60 

Gly Ser Thr Thr Val Cys Ser Glu Gly Gin Ser Leu Pro Leu Leu Cys 
65 70 75 80 

Phe Ser 



<210> 417 
<211> 57 
<212> PRT 

<213> Homo sapiens 



<400> 417 

Met Pro Pro His Arg Gin Thr Asp Gly Gin Met Gly Leu Pro Ala Pro 
15 10 15 

Ala Leu Trp Val Trp Gly Leu Leu Leu Ser Ser Ser Phe Gin Thr Leu 
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20 25 30 

Leu Pro Ala Phe Pro Lys Pro Pro Ala Leu Asn Leu Gly Cys Ser Thr 
35 40 45 

Arg Pro lie Pro Ser Phe Leu Lys lie 
50 55 



<210> 418 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals any amino acid 
<400> 418 

Met Arg Met Arg Val Ala Val Ala Pro Arg Pro His Gin His Leu Val 
15 10 15 

Val Ser Val Ser Trp lie Leu Ala lie Leu lie Ser Val Ser Gly Tyr 

20 25 30 

His Cys Phe His Leu Gin Phe Ser Tyr Met Val Xaa Asn lie Phe Pro 
35 40 45 

His Val Tyr Leu Ser Ser Ala Tyr Leu Leu Arg Pro Val lie Cys Ser 
50 55 60 

Asp Leu Leu Pro Val Phe Val Cys Leu His Val Cys Leu Cys Leu lie 
65 70 75 80 

Phe 



<210> 419 
<211> 80 
<212> PRT 

<213> Homo sapiens 
<400> 419 

Met Cys Val Val Cys Val Cys Val Trp Cys Met Cys Val Cys Gly Val 
15 10 15 

Cys Val Cys Leu Cys Val Cys Gly Val Cys Met Cys lie Ser Leu Asn 

20 25 30 

Glu Lys Leu Ala Pro Met lie Met Glu Leu Thr Thr Pro Lys Val Cys 
35 40 45 

Arg Gin Gin Ala Gly Gly Pro Gly Gly Pro Val Val Trp Leu Gin Pro 
50 55 60 

Val Ser Glu Gly Leu Arg Thr Arg Arg Ala Gly Gly Ala Ala Ala Val 
65 70 75 80 
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<210> 420 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 420 

Met Ser Thr Phe Val Cys Val Cys Val Phe Cys Phe Val Leu Arg Ser 
15 10 15 

Glu Ala Arg Ala Lys Arg Lys Gin Asp Gin Arg Asn Thr Lys Arg Cys 

20 25 30 

Leu Leu Thr Lys Gly Gin Arg Asp Leu Ser Val Asn Gin Ser Lys lie 
35 40 45 

Asn Arg Thr Ala Asn 
50 



<210> 421 
<211> 80 
<212> PRT 

<213> Homo sapiens 
<400> 421 

Met Ala Leu Trp Val Thr Cys lie Leu Ser Leu Cys Thr Trp Phe Ser 
15 10 15 

Cys Leu Tyr Gly Ala Asp Ser Leu Ala Asn Lys Cys Leu Ser Ala Gly 

20 25 30 

Ala Thr Arg Lys Ala Phe Pro Phe Cys Val Leu Phe Arg Asp Leu Glu 
35 40 45 

Val Gly Leu Gly Phe Glu Gly Phe Val Thr His Leu Ala Cys Lys Leu 
50 55 60 

Phe Cys Tyr Cys Glu Leu Ser Asp Ser Ala Leu Ser Leu Gly His Glu 
65 70 75 80 



<210> 422 
<211> 320 
<212> PRT 

<213> Homo sapiens 
<400> 422 

Met Arg Gly Ser Val Glu Cys Thr Trp Gly Trp Gly His Cys Ala Pro 
15 10 15 
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Ser Pro Leu Leu Leu Trp Thr Leu Leu Leu Phe Ala Ala Pro Phe Gly 

20 25 30 

Leu Leu Gly Glu Lys Thr Arg Gin Leu Leu Glu Phe Asp Ser Thr Asn 
35 40 45 

Val Ser Asp Thr Ala Ala Lys Pro Leu Gly Arg Pro Tyr Pro Pro Tyr 
50 55 60 

Ser Leu Ala Asp Phe Ser Trp Asn Asn lie Thr Asp Ser Leu Asp Pro 
65 70 75 80 

Ala Thr Leu Ser Ala Thr Phe Gin Gly His Pro Met Asn Asp Pro Thr 

85 90 95 

Arg Thr Phe Ala Asn Gly Ser Leu Ala Phe Arg Val Gin Ala Phe Ser 

100 105 110 

Arg Ser Ser Arg Pro Ala Gin Pro Pro Arg Leu Leu His Thr Ala Asp 
115 120 125 

Thr Cys Gin Leu Glu Val Ala Leu lie Gly Ala Ser Pro Arg Gly Asn 
130 135 140 

Arg Ser Leu Phe Gly Leu Glu Val Ala Thr Leu Gly Gin Gly Pro Asp 
145 150 155 160 

Cys Pro Ser Met Gin Glu Gin His Ser lie Asp Asp Glu Tyr Ala Pro 

165 170 175 

Ala Val Phe Gin Leu Asp Gin Leu Leu Trp Gly Ser Leu Pro Ser Gly 

180 185 190 

« 

Phe Ala Gin Trp Arg Pro Val Ala Tyr Ser Gin Lys Pro Gly Gly Arg 
195 200 205 

Glu Ser Ala Leu Pro Cys Gin Ala Ser Pro Leu His Pro Ala Leu Ala 
210 215 220 

Tyr Ser Leu Pro Gin Ser Pro lie Val Arg Ala Phe Phe Gly Ser Gin 
225 230 235 240 

Asn Asn Phe Cys Ala Phe Asn Leu Thr Phe Gly Ala Ser Thr Gly Pro 

245 250 255 

Gly Tyr Trp Asp Gin His Tyr Leu Ser Trp Ser Met Leu Leu Gly Val 

260 265 270 

Gly Phe Pro Pro Val Asp Gly Leu Ser Pro Leu Val Leu Gly lie Met 
* 275 280 285 

Ala Val Ala Leu Gly Ala Pro Gly Leu Met Leu Leu Gly Gly Gly Leu 
290 295 300 

Val Leu Leu Leu His His Lys Lys Tyr Ser Glu Tyr Gin Ser He Asn 
305 310 315 320 
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<210> 423 
<211> 115 
<212> PRT 

<213> Homo sapiens 
<400> 423 

Met Leu Ala Leu Ser Ser Ser Phe Leu Val Leu Ser Tyr Leu Leu Thr 
15 10 15 

Arg Trp Cys Gly Ser Val Gly Phe He Leu Ala Asn Cys Phe Asn Met 

20 25 30 

Gly Tie Arg He Thr Gin Ser Leu Cys Phe He His Arg Tyr Tyr Arg 
35 40 45 

Arg Ala Pro Thr Gly Pro Trp Leu Ala Cys Thr Tyr Arg Gin Ser Cys 
50 55 60 

Ser Gly His Leu Pro Ser Val Val Gly Leu Leu Leu Phe Arg Arg Tyr 
65 70 75 80 

Ser Ser Ala Val Ser Arg Ala Gly Gin Pro Asp Trp His Thr Leu Leu 

85 90 95 

Trp Gly Pro Ser Val Trp Glu Gin Leu Ser Gly Gin His Ser Ser Gin 

100 105 110 

Arg Pro Ser 
115 



<210> 424 
<211> 402 
<212> PRT 
<213> Homo sapiens 

<400> 424 

Met Tyr Ser Gly Asn Arg Ser Gly Gly His Gly Tyr Trp Asp Gly Gly 
15 10 15 

Gly Ala Ala Gly Ala Glu Gly Pro Ala Pro Ala Gly Thr Leu Ser Pro 

20 25 30 

Ala Pro Leu Phe Ser Pro Gly Thr Tyr Glu Arg Leu Ala Leu Leu Leu 
35 40 45 

Gly Ser He Gly Leu Leu Gly Val Gly Asn Asn Leu Leu Val Leu Val 
50 55 60 

Leu Tyr Tyr Lys Phe Gin Arg Leu Arg Thr Pro Thr His Leu Leu Leu 
65 70 75 80 

Val Asn He Ser Leu Ser Asp Leu Leu Val Ser Leu Phe Gly Val Thr 

85 90 95 

Phe Thr Phe Val Ser Cys Leu Arg Asn Gly Trp Val Trp Asp Thr Val 

100 105 110 

Gly Cys Val Trp Asp Gly Phe Ser Gly Ser Leu Phe Gly He Val Ser 
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115 120 125 

He Ala Thr Leu Thr Val Leu Ala Tyr Glu Arg Tyr He Arg Val Val 
130 135 140 

His Ala Arg Val He Asn Phe Ser Trp Ala Trp Arg Ala He Thr Tyr 
145 150 155 160 

He Trp Leu Tyr Ser Leu Ala Trp Ala Gly Ala Pro Leu Leu Gly Trp 

165 170 175 

Asn Arg Tyr He Leu Asp Val His Gly Leu Gly Cys Thr Val Asp Trp 

180 185 190 

Lys Ser Lys Asp Ala Asn Asp Ser Ser Phe Val Leu Phe Leu Phe Leu 
195 200 205 

Gly Cys Leu Val Val Pro Leu Gly Val He Ala His Cys Tyr Gly His 
210 215 220 

He Leu Tyr Ser He Arg Met Leu Arg Cys Val Glu Asp Leu Gin Thr 
225 230 235 240 

He Gin Val He Lys He Leu Lys Tyr Glu Lys Lys Leu Ala Lys Met 

245 250 255 

Cys Phe Leu Met He Phe Thr Phe Leu Val Cys Trp Met Pro Tyr He 

260 265 270 

Val He Cys Phe Leu Val Val Asn Gly His Gly His Leu Val Thr Pro 
275 280 285 

Thr He Ser He Val Ser Tyr Leu Phe Ala Lys Ser Asn Thr Val Tyr 
290 295 300 

Asn Pro Val He Tyr Val Phe Met He Arg Lys Phe Arg Arg Ser Leu 
305 310 315 320 

Leu Gin Leu Leu Cys Leu Arg Leu Leu Arg Cys Gin Arg Pro Ala Lys 

325 330 335 

Asp Leu Pro Ala Ala Gly Ser Glu Met Gin He Arg Pro He Val Met 

340 345 350 

Ser Gin Lys Asp Gly Asp Arg Pro Lys Lys Lys Val Thr Phe Asn Ser 
355 360 365 

Ser Ser He He Phe He He Thr Ser Asp Glu Ser Leu Ser Val Asp 
370 375 380 

Asp Ser Asp Lys Thr Asn Gly Ser Lys Val Asp Val He Gin Val Arg 
385 390 395 400 

Pro Leu 



<210> 425 
<211> 76 
<212> PRT 
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<213> Homo sapiens 
<400> 425 

Met Gly Ala His Ser Phe Gly Phe Gin Leu Phe Met Ser Val Ser Val 
15 10 15 

Leu Trp Gly Arg Leu Cys Leu Tyr Gly Arg Phe Ser Val lie Thr Phe 

20 25 30 

Ala Ser Pro Pro Thr Thr Phe Met Asp He Gin Cys Cys Phe Ala Leu 
35 40 45 

Gin Leu Glu Arg Arg Asp Gly Gin Leu Val Thr Leu Ser His He Ala 
50 55 60 

Thr Phe He Cys Ser Gly Lys Lys Leu Asp Arg Trp 
65 70 75 



<210> 426 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 426 

Met Ala Val Pro Leu Phe Leu Tyr 
1 5 

Pro Phe Leu Leu Ser Leu Cys Phe 

20 

Ser Ser Ser Glu Ser Lys Ser Ser 
35 40 



He Phe Thr Leu Leu Pro Leu Leu 
10 15 

Ser Pro Leu Thr Val Lys Arg Ser 
25 30 

Leu 



<210> 427 
<211> 35 
<212> PRT 

<213> Homo sapiens 
<400> 427 

He Tyr Ser Ser Gly Tyr Phe Gin He Tyr Asn Met Leu Leu Leu Thr 
15 10 15 

He Leu He Leu Leu Cys Asn Arg Thr Pro Glu Leu He Pro Gly Phe 

20 25 30 

Tyr He Arg 
35 



<210> 428 
<211> 484 
<212> PRT 

<213> Homo sapiens 
<400> 428 

Met Pro Arg His Leu Ser Gly Leu Leu Leu Leu Leu Trp Pro Leu Leu 
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1 



5 



10 



15 



Leu Leu Leu Pro Pro Thr Pro Ala Ala Pro Gly Pro Leu Ala Arg Pro 

20 25 30 

Gly Leu Arg Arg Leu Gly Thr Arg Gly Pro Gly Gly Ser Pro Gly Arg 
35 40 45 

Arg Pro Val Ser Ala Val Pro Thr Arg Ala Pro Tyr Ser Gly Ala Gly 
50 55 60 

Gin Pro Gly Gly Ala Arg Gly Ala Gly Val Cys Arg Ser Arg Pro Leu 
65 70 75 80 

Asp Leu Val Phe lie lie Asp Ser Ser Arg Ser Val Arg Pro Leu Glu 



Phe Thr Lys Val Lys Thr Phe Val Ser Gin lie lie Asp Thr Leu Asp 

100 105 110 

lie Gly Ala Ala Asp Thr Arg Val Ala Val Val Asn Tyr Ala Ser Thr 
115 120 125 

Val Lys lie Glu Phe His Leu Gin Thr His Ser Asp Lys Gin Ser Leu 
130 135 140 

Lys Gin Ala Val Ala Arg lie Thr Pro Leu Ser Thr Gly Thr Met Ser 
145 150 155 160 

Gly Leu Ala lie Gin Thr Ala Met Asp Glu Ala Phe Thr Val Glu Ala 

165 170 175 

Gly Ala Arg Gly Pro Thr Ser Asn He Pro Lys Val Ala He He Val 

180 185 190 

Thr Asp Gly Arg Pro Gin Asp Gin Val Asn Glu .Val Ala Ala Arg Ala 
195 200 205 

Arg Ala Ser Gly He Glu Leu Tyr Ala Val Gly Val Asp Arg Ala Asp 
210 215 220 

Met Glu Ser Leu Lys Met Met Ala Ser Glu Pro Leu Asp Glu His Val 
225 230 235 240 

Phe Tyr Val Glu Thr Tyr Gly Val He Glu Lys Leu Ser Ser Arg Phe 

245 250 255 

Gin Glu Thr Phe Cys Ala Leu Asp Pro Cys Val Leu Gly Thr His Arg 

260 265 270 

Cys Gin His Val Cys Val Ser Asp Gly Glu Gly Lys His His Cys Glu 
275 280 285 

Cys Ser Gin Gly Tyr Ser Leu Asn Ala Asp Gin Lys Thr Cys Ser Ala 
290 295 300 

He Asp Lys Cys Ala Leu Asn Thr His Gly Cys Glu His He Cys Val 
305 310 315 320 

Asn Asp Arg Thr Gly Ser Tyr His Cys Glu Cys Tyr Glu Gly Tyr Thr 



85 



90 



95 



325 



330 



335 
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Leu Asn Gin Asp Arg Lys Thr Cys Ser Ala Gin Asp Gin Cys Ala Phe 

340 345 350 

Gly Thr His Gly Cys Gin His He Cys Val Asn Asp Arg Asp Gly Ser 
355 360 365 

His His Cys Glu Cys Tyr Glu Gly Tyr Thr Leu Asn Ala Asp Asn Lys 
370 375 380 

Thr Cys Ser Val Arg Ser Glu Cys Ala Gly Gly Ser His Gly Cys Gin 
385 390 395 400 

His Leu Cys Val Asp Asp Gly Pro Ala Ala Tyr His Cys Asp Cys Phe 

405 410 415 

Pro Gly Tyr Thr Leu Thr Glu Asp Arg Arg Thr Cys Ala Ala He Glu 

420 425 430 

Glu Ala Arg Arg Leu Val Ser Thr Glu Asp Ala Cys Gly Cys Glu Ala 
435 440 445 

Thr Leu Ala Phe Gin Glu Arg Ala Ser Ser Tyr Leu Gin Arg Leu Asn 
450 455 460 

Ala Lys Leu Asp Asp He Leu Gly Lys Leu Gin Ala Asp Ala Tyr Gly 
465 470 475 480 

Gin He His Arg 



<210> 429 
<211> 129 
<212> PRT 

<213> Homo sapiens 
<400> 429 

Met Ala Pro Ser Gly Pro Leu Leu Leu Val Leu Leu Val Pro Leu Ala 
15 10 15 

Ala Ala Arg Ala Gly Pro Tyr Phe Arg Pro Gly Arg Gly Cys Arg Leu 

20 25 30 

Pro Leu Arg Gly Asp Gin Leu Ser Gly Leu Gly Arg Arg Thr Tyr Pro 
35 40 45 

Arg Pro His Glu Tyr Leu Ser Pro Ser Asp Leu Pro Lys Ser Trp Asp 
50 55 60 

Trp Arg Asn Val Asn Gly Val Asn Tyr Ala Ser Ala Thr Arg Asn Gin 
65 70 75 80 

His He Pro Gin Tyr Cys Gly Ser Cys Trp Ala His Gly Ser Thr Ser 

85 90 95 

Ala Met Ala Gly Pro Asp Gin His Gin Glu Lys Gly Gly Val Ala Leu 

100 105 110 

His Pro Ala Val Arg Ala Ala Arg Pro Arg Leu Arg Gin Arg Gly Leu 



270 



WO 02/076488 PCTYUS02/08276 

* 

115 120 125 

Leu 



<210> 430 
<211> 164 
<212> PRT 

<213> Homo sapiens 
<400> 430 

Met Thr Thr Trp Ser Cys Leu Val Ala Met lie Val Ser Gly Val He 
15 10 15 

Thr Ala Val Trp Ala Val Arg Ala Ala Pro He Trp Arg Ser Gin Val 

20 25 30 

Lys Gin Lys Met Arg He Gly Lys Gin Gly Asn Cys Arg Pro Pro Arg 
35 40 45 

Cys He Cys Ser Ala Leu Gly Leu Leu Ala Pro Trp Met Ala Val Val 
50 55 60 

Leu Ser Gin Leu Ser Val Arg Cys Val Val Ser Trp Val Gin Gly Lys 
65 70 75 80 

Pro Ser Ser Pro Arg Pro Arg Gly Ser Ala Ala Ser Pro Ala Pro Gly 

85 90 95 

Ala Thr Pro Pro Thr Pro Arg Lys Pro Val Ser Trp Leu Gly Tyr Arg 

100 105 110 

Glu Asn His Arg Pro Lys Lys Pro Lys Ser Cys Thr Arg Leu Pro Gly 
115 120 125 

Leu Pro Lys Leu Glu Pro Ser Ser Thr Leu Lys Gly Gin Asp Ser Trp 
130 135 140 

Gin Met Gly His Gin Gin Asp Lys Thr Leu Trp Ser Trp Ala Ser Thr 
145 150 155 160 

Gly Gly Ser Ser 



<210> 431 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<400> 431 

Met Pro Leu Glu Glu Ser Phe Glu He Val Leu Lys Leu Val Pro Leu 
15 10 15 

Leu Gly Leu Glu Leu Phe Phe Phe Leu Phe He He Asn Gly Tyr He 

20 25 30 

Asn Val Tyr Cys Pro Ser Gin Tyr Phe He Tyr Ala Lys Asp Ser Leu 
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35 



40 



45 



Ala Gly Leu Ala Leu He Pro Gin 
50 55 



<210> 432 
<211> 40 
<212> PRT 

<213> Homo sapiens 
<400> 432 

Met Val Ala Met Val Phe Leu Lys He Ser Val Leu Pro Leu Met Cys 
15 10 15 

Arg Gly Gin Thr Lys His Lys Val Leu Arg Asp His Ala Tyr Pro Arg 

20 25 30 

■ 

Val Ser Gin Lys Arg Gly His He 
35 40 



<210> 433 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 433 

Met Cys Val Cys Leu He Cys Ser He Cys Gin Phe Leu Trp Cys Lys 
15 10 15 

Tyr Ser His Tyr Ser Cys Phe Gin Ala Asn He Val He Pro Gin Lys 

20 25 30 

Met Glu Leu Gly Arg His Asn Gin Asp 
35 40 



<210> 434 
<211> 211 
<212> PRT 

<213> Homo sapiens 
<400> 434 

Met Val Phe Leu Lys Phe Phe Cys Met Ser Phe Phe Cys His Leu Cys 
15 10 15 

Gin Gly Tyr Phe Asp Gly Pro Leu Tyr Pro Glu Met Ser Asn Gly Thr 

20 25 30 

Leu His His Tyr Phe Val Pro Asp Gly Asp Tyr Glu Glu Asn Asp Asp 
35 40 45 

Pro Glu Lys Cys Gin Leu Leu Phe Arg Val Ser Asp His Arg Arg Cys 
50 55 . 60 

Ser Gin Gly Glu Gly Ser Gin Val Gly Ser Leu Leu Ser Leu Thr Leu 
65 70 75 80 
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Arg Glu Glu Phe Thr Val Leu Gly His Gin Val Glu Asp Ala Gly Arg 

85 90 95 

Val Leu Glu Gly lie Ser Lys Ser He Ser Tyr Asp Leu Asp Gly Glu 

100 105 110 

Glu Ser Tyr Gly Lys Tyr Leu Arg Arg Glu Ser His Gin He Gly Asp 
115 120 125 

Ala Tyr Ser Asn Ser Asp Lys Ser Leu Thr Glu Leu Glu Ser Lys Phe 
130 135 140 

Lys Gin Gly Gin Glu Gin Asp Ser Arg Gin Glu Ser Arg Leu Asn Glu 
145 150 155 160 

Asp Phe Leu Gly Met Leu Val His Thr Arg Ser Leu Leu Lys Glu Thr 

165 170 175 

Leu Asp He Ser Val Gly Leu Arg Asp Lys Tyr Glu Leu Leu Ala Leu 

180 185 190 

Thr He Arg Ser His Gly Thr Arg Leu Gly Arg Leu Lys Asn Asp Tyr 
195 200 205 

Leu Lys Val 
210 



<210> 435 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (49) 

<223> Xaa equals any amino acid 
<400> 435 

Met Ser His His Ala Gly Leu Gly Gly Gly He Leu Phe Ser Leu Lys 
15 10 15 

He Ser Phe Phe He Ala Leu Ala Val Val Gly Gly Ser Arg Gly Val 

20 25 30 

Asn Asp Cys Gin Leu Gly Gly Cys Arg Val Gly Ser Cys Pro Arg Val 
35 40 45 

Xaa Val Arg Val Ala 
50 



<210> 436 
<211> 48 
<212> PRT 

<213> Homo sapiens 
<400> 436 
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Met Met Leu Tyr Gin Asn Met Leu Leu Tyr Phe Arg He He Gly Val 

1 5 10 15 

Leu Ala Leu Asn Phe Ser He Ser Pro He Phe Phe His Gly Ser Leu 

20 25 30 

Gly Lys Leu Tyr Val Tyr Ser Ala Ala Lys Tyr Ser Leu Glu Leu Lys 

35 40 45 



<210> 437 
<211> 201 
<212> PRT 
<213> Homo sapiens 

<400> 437 

Met Lys Leu Leu He Leu Phe Leu Ser His Leu Leu Ser Leu Ala Phe 
15 10 15 

Gly He Leu Cys Leu Ser Val Thr Val He Leu Ser Leu Leu Leu Ser 

20 25 30 

Phe Ser Lys Arg Gly Phe Ser Val Arg Ser Phe Gly Thr Gly Thr His 
35 40 45 

Val Lys Leu Pro Gly Pro Ala Pro Asp Lys Pro Asn Val Tyr Asp Phe 
50 55 60 

Lys Thr Thr Tyr Asp Gin Met Tyr Asn Asp Leu Leu Arg Lys Asp Lys 
65 70 75 80 

Glu Leu Tyr Thr Gin Asn Gly He Leu His Met Leu Asp Arg Asn Lys 

85 90 95 

Arg lie Lys Pro Arg Pro Glii Arg Phe Gin Asn Cys Lys Asp Leu Phe 

100 105 110 

Asp Leu lie Leu Thr Cys Glu Glu Arg Val Tyr Asp Gin Val Val Glu 
115 120 125 

Asp Leu Asn Ser Arg Glu Gin Glu Thr Cys Gin Pro Val His Val Val 
130 135 140 

Asn Val Asp He Gin Asp Asn His Glu Glu Ala Thr Leu Gly Ala Phe 
145 150 155 160 

Leu He Cys Glu Leu Cys Gin Cys He Gin His Thr Glu Asp Met Glu 

165 170 175 

Asn Glu He Asp Glu Leu Leu Gin Glu Phe Glu Glu Lys Ser Gly Arg 

180 185 190 

Thr Phe Leu His Thr Val Cys Phe Tyr 
195 200 
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<210> 438 
<211> 420 
<212> PRT 

<213> Homo sapiens 
<400> 438 

Met Ala Pro Trp Pro Pro Lys Gly Leu Val Pro Ala Val Leu Trp Gly 
15 10 15 

Leu Ser Leu Phe Leu Asn Leu Pro Gly Pro He Trp Leu Gin Pro Ser 

20 25 30 

Pro Pro Pro Gin Ser Ser Pro Pro Pro Gin Pro His Pro Cys His Thr 
35 40 45 

Cys Arg Gly Leu Val Asp Ser Phe Asn Lys Gly Leu Glu Arg Thr He 
50 55 60 

Arg Asp Asn Phe Gly Gly Gly Asn Thr Ala Trp Glu Glu Glu Asn Leu 
65 70 75 80 

Ser Lys Tyr Lys Asp Ser Glu Thr Arg Leu Val Glu Val Leu Glu Gly 

85 90 95 

Val Cys Ser Lys Ser Asp Phe Glu Cys His Arg Leu Leu Glu Leu Ser 

100 105 110 

Glu Glu Leu Val Glu Ser Trp Trp Phe His Lys Gin Gin Glu Ala Pro 
115 120 125 

Asp Leu Phe Gin Trp Leu Cys Ser Asp Ser Leu Lys Leu Cys Cys Pro 
130 135 140 

Ala Gly Thr Phe Gly Pro Ser Cys Leu Pro Cys Pro Gly Gly Thr Glu 
145 150 155 160 

Arg Pro Cys Gly Gly Tyr Gly Gin Cys Glu Gly Glu Gly Thr Arg Gly 

165 170 175 

Gly Ser Gly His Cys Asp Cys Gin Ala Gly Tyr Gly Gly Glu Ala Cys 

180 185 190 

Gly Gin Cys Gly Leu Gly Tyr Phe Glu Ala Glu Arg Asn Ala Ser His 
195 200 205 

Leu Val Cys Ser Ala Cys Phe Gly Pro Cys Ala Arg Cys Ser Gly Pro 
210 215 220 

Glu Glu Ser Asn Cys Leu Gin Cys Lys Lys Gly Trp Ala Leu His His 
225 230 235 240 

Leu Lys Cys Val Asp He Asp Glu Cys Gly Thr Glu Gly Ala Asn Cys 

245 250 255 

Gly Ala Asp Gin Phe Cys Val Asn Thr Glu Gly Ser Tyr Glu Cys Arg 

260 265 270 

Asp Cys Ala Lys Ala Cys Leu Gly Cys Met Gly Ala Gly Pro Gly Arg 
275 280 285 



Cys Lys Lys Cys Ser Pro Gly Tyr Gin Gin Val Gly Ser Lys Cys Leu 
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290 295 300 

Asp Val Asp Glu Cys Glu Thr Glu Val Cys Pro Gly Glu Asn Lys Gin 
305 310 315 320 

Cys Glu Asn Thr Glu Gly Gly Tyr Arg Cys lie Cys Ala Glu Gly Tyr 

325 330 335 

Lys Gin Met Glu Gly He Cys Val Lys Glu Gin He Pro Glu Ser Ala 

340 345 350 

Gly Phe Phe Ser Glu Met Thr Glu Asp Glu Leu Val Val Leu Gin Gin 
355 360 365 

Met Phe Phe Gly He He lie Cys Ala Leu Ala Thr Leu Ala Ala Lys 
370 375 380 

Gly Asp Leu Val Phe Thr Ala lie Phe He Gly Ala Val Ala Ala Met 
385 390 395 400 

Thr Gly Tyr Trp Leu Ser Glu Arg Ser Asp Arg Val Leu Glu Gly Phe 

405 410 415 

He Lys Gly Arg 

420 



<210> 439 
<211> 102 
<212> PRT 

i 

<213> Homo sapiens 
<400> 439 

Met Thr Val Arg Arg Leu Ser Leu Leu Cys Arg Asp Leu Trp Ala Leu 
15 10 15 

Trp Leu Leu Leu Lys Ala Gly Ala Val Arg Gly Ala Arg Ala Gly Pro 

20 25 30 

Arg Leu Pro Gly Arg Cys Cys Gly Ala Thr Cys Gly Asp Ala Gly Arg 
35 40 45 

Gly Trp Thr Phe Trp Ala Gin Pro Cys Pro Gin Arg Leu Leu Gly Gin 
50 55 60 

Lys Pro Gly Ala Gly Gly Cys Arg Gly Trp Val Leu Gly Trp Val Pro 
65 70 75 80 

Pro Arg Pro Glu Glu Pro Cys Ser Leu Ala Gly Lys Val Cys Thr Gly 

85 90 95 

Leu Ala Arg Trp Met Val 

100 



<210> 440 
<211> 53 
<212> PRT 

<213> Homo sapiens 



276 



WO 02/076488 PCT/US02/08276 



<220> 

<221> SITE 
<222> (ID 

<223> Xaa equals any amino acid 
<400> 440 

Met Cys Lys Ala Val Cys Lys His Arg Leu Xaa Leu Phe Ala Val Ser 
1,5 10 15 

Ser Phe Ser Leu Gly Leu Gly Trp Val Cys Val Leu Val Leu Met Leu 

20 25 30 

Trp Pro Val Arg Leu Ser Leu Ala Pro Arg Pro Val Gin Leu Gin Gin 
35 40 45 

Arg Arg Ser His Cys 
50 



<210> 441 
<211> 472 
<212> PRT 

<213> Homo sapiens 
<400> 441 

Met Lys Phe Leu lie Phe Ala Phe Phe Gly Gly Val His Leu Leu Ser 
15 10 15 

Leu Cys Ser Gly Lys Ala lie Cys Lys Asn Gly He Ser Lys Arg Thr 

20 25 30 

Phe Glu Glu lie Lys Glu Glu He Ala Ser Cys Gly Asp Val Ala Lys 
35 40 45 

Ala He He Asn Leu Ala Val Tyr Gly Lys Ala Gin Asn Arg Ser Tyr 
50 55 60 

Glu Arg Leu Ala Leu Leu Val Asp Thr Val Gly Pro Arg Leu Ser Gly 
65 70 75 80 

Ser Lys Asn Leu Glu Lys Ala He Gin He Met Tyr Gin Asn Leu Gin 

85 90 95 

Gin Asp Gly Leu Glu Lys Val His Leu Glu Pro Val Arg He Pro His 

100 105 110 

Trp Glu Arg Gly Glu Glu Ser Ala Val Met Leu Glu Pro Arg lie His 
115 120 125 

Lys He Ala He Leu Gly Leu Gly Ser Ser He Gly Thr Pro Pro Glu 
130 135 140 

Gly He Thr Ala Glu Val Leu Val Val Thr Ser Phe Asp Glu Leu Gin 
145 150 155 160 

Arg Arg Ala Ser Glu Ala Arg Gly Lys He Val Val Tyr Asn Gin Pro 

165 170 175 

Tyr He Asn Tyr Ser Arg Thr Val Gin Tyr Arg Thr Gin Gly Ala Val 
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180 185 190 

Glu Ala Ala Lys Val Gly Ala Leu Ala Ser Leu lie Arg Ser Val Ala 
195 200 205 

Ser Phe Ser lie Tyr Ser Pro His Thr Gly He Gin Glu Tyr Gin Asp 
210 215 220 

Gly Val Pro Lys He Pro Thr Ala Cys He Thr Val Glu Asp Ala Glu 
225 230 235 240 

Met Met Ser Arg Met Ala Ser His Gly He Lys He Val He Gin Leu 

245 250 255 

Lys Met Gly Ala Lys Thr Tyr Pro Asp Thr Asp Ser Phe Asn Thr Val 

260 265 270 

Ala Glu He Thr Gly Ser Lys Tyr Pro Glu Gin Val Val Leu Val Ser 
275 280 285 

Gly His Leu Asp Ser Trp Asp Val Gly Gin Gly Ala Met Asp Asp Gly 
290 295 300 

Gly Gly Ala Phe He Ser Trp Glu Ala Leu Ser Leu He Lys Asp Leu 
305 310 315 320 

Gly Leu Arg Pro Lys Arg Thr Leu Arg Leu Val Leu Trp Thr Ala Glu 

325 330 335 

Glu Gin Gly Gly Val Gly Ala Phe Gin Tyr Tyr Gin Leu His Lys Val 

340 345 350 

Asn He Ser Asn Tyr Ser Leu Val Met Glu Ser Asp Ala Gly Thr Phe 
355 360 365 

Leu Pro Thr Gly Leu Gin Phe Thr Gly Ser Glu Lys Ala Arg Ala He 
370 375 380 

* 

Met Glu Glu Val Met Ser Leu Leu Gin Pro Leu Asn He Thr Gin Val 
385 390 395 400 

Leu Ser His Gly Glu Gly Thr Asp He Asn Phe Trp He Gin Ala Gly 

405 410 415 

Val Pro Gly Ala Ser Leu Leu Asp Asp Leu Tyr Lys Tyr Phe Phe Phe 

420 425 430 

His His Ser His Gly Asp Thr Met Thr Val Met Asp Pro Lys Gin Met 
435 440 445 

Asn Val Ala Ala Ala Val Trp Ala Val Val Ser Tyr Val Val Ala Asp 
450 455 460 

Met Glu Glu Met Leu Pro Arg Ser 
465 470 



<210> 442 
<211> 359 
<212> PRT 
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<213> Homo sapiens 
<400> 442 

Met Lys Leu Gly Cys Val Leu Met Ala Trp Ala Leu Tyr Leu Ser Leu 
15 10 15 

Gly Val Leu Trp Val Ala Gin Met Leu Leu Ala Ala Ser Phe Glu Thr 

20 25 30 

Leu Gin Cys Glu Gly Pro Val Cys Thr Glu Glu Ser Ser Cys His Thr 
35 40 45 

Glu Asp Asp Leu Thr Asp Ala Arg Glu Ala Gly Phe Gin Val Lys Ala 
50 55 60 

Tyr Thr Phe Ser Glu Pro Phe His Leu lie Val Ser Tyr Asp Trp Leu 
65 70 75 80 

lie Leu Gin Gly Pro Ala Lys Pro Val Phe Glu Gly Asp Leu Leu Val 

85 90 95 

Leu Arg Cys Gin Ala Trp Gin Asp Trp Pro Leu Thr Gin Val Thr Phe 

100 105 110 

Tyr Arg Asp Gly Ser Ala Leu Gly Pro Pro Gly Pro Asn Arg Glu Phe 
115 120 125 

Ser lie Thr Val Val Gin Lys Ala Asp Ser Gly His Tyr His Cys Ser 
130 135 140 

Gly lie Phe Gin Ser Pro Gly Pro Gly lie Pro Glu Thr Ala Ser Val 
145 150 155 160 

Val Ala lie Thr Val Gin Glu Leu Phe Pro Ala Pro lie Leu Arg Ala 

165 170 175 

Val Pro Ser Ala Glu Pro Gin Ala Gly Gly Pro Met Thr Leu Ser Cys 

180 185 190 

Gin Thr Lys Leu Pro Leu Gin Arg Ser Ala Ala Arg Leu Leu Phe Ser 
195 200 205 

Phe Tyr Lys Asp Gly Arg lie Val Gin Ser Arg Gly Leu Ser Ser Glu 
210 215 220 

Phe Gin He Pro Thr Ala Ser Glu Asp His Ser Gly Ser Tyr Trp Cys 
225 230 235 240 

Glu Ala Ala Thr Glu Asp Asn Gin Val Trp Lys Gin Ser Pro Gin Leu 

245 250 255 

Glu He Arg Val Gin Gly Ala Ser Ser Ser Ala Ala Pro Pro Thr Leu 

260 265 270 

Asn Pro Ala Pro Gin Lys Ser Ala Ala Pro Gly Thr Ala Pro Glu Glu 
275 280 285 

Ala Pro Gly Pro Leu Pro Pro Pro Pro Thr Pro Ser Ser Glu Asp Pro 
290 295 300 

Gly Phe Ser Ser Pro Leu Gly Met Pro Asp Pro His Leu Tyr His Gin 
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305 310 315 320 

Met Gly Leu Leu Leu Lys His Met Gin Asp Val Arg Val Leu Leu Gly 

325 330 335 

His Leu Leu Met Glu Leu Arg Glu Leu Ser Gly His Arg Lys Pro Gly 

340 345 350 

Thr Thr Lys Ala Thr Ala Glu 
355 



<210> 443 
<211> 379 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (283) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (303) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (307) 

<223> Xaa equals any amino acid 
<400> 443 

Met Gly Tyr lie Asp Asp Pro Asp Lys Tyr His Gin Gly Phe Glu Leu 
15 10 15 

Leu Leu Ser Ala Leu Gly Asp Pro Ser Glu Arg Val Val Ser Ala Thr 

20 25 30 

His Gin Val Phe Leu Pro Ala Tyr Ala Ala Trp Thr Thr Glu Leu Gly 
35 40 45 

Asn Leu Gin Ser His Leu lie Leu Thr Leu Leu Asn Lys lie Glu Lys 
50 55 60 

Leu Leu Arg Glu Gly Glu His Gly Leu Asp Glu His Lys Leu His Met 
65 70 75 80 

Tyr Leu Ser Ala Leu Gin Ser Leu lie Pro Ser Leu Phe Ala Leu Val 

85 90 95 

Leu Gin Asn Ala Pro Phe Ser Ser Lys Ala Lys Leu His Gly Glu Val 

100 105 110 

Pro Gin lie Glu Val Thr Arg Phe Pro Arg Pro Met Ser Pro Leu Gin 
115 120 125 

Asp Val Ser Thr lie lie Gly Ser Arg Glu Gin Leu Ala Val Leu Leu 
130 135 140 
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Gin Leu Tyr Asp Tyr Gin Leu Glu Gin Glu Gly Thr Thr Gly Trp Glu 
145 150 155 160 

Ser Leu Leu Trp Val Val Asn Gin Leu Leu Pro Gin Leu lie Glu lie 

165 170 175 

Val Gly Lys lie Asn Val Thr Ser Thr Ala Cys Val His Glu Phe Ser 

180 185 190 

Arg Phe Phe Trp Arg Leu Cys Arg Thr Phe Gly Lys lie Phe Thr Asn 
195 200 205 

Thr Lys Val Lys Pro Gin Phe Gin Glu lie Leu Arg Leu Ser Glu Glu 
210 215 220 

Asn lie Asp Ser Ser Ala Gly Asn Gly Val Leu Thr Lys Ala Thr Val 
225 230 235 240 

Pro lie Tyr Ala Thr Gly Val Leu Thr Cys Tyr lie Gin Glu Glu Asp 

245 250 255 

Arg Lys Leu Leu Val Gly Phe Leu Glu Asp Val Met Thr Leu Leu Ser 

260 265 270 

Leu Ser His Ala Pro Leu Asp Ser Leu Lys Xaa Ser Phe Val Glu Leu 
275 280 285 

Gly Ala Asn Gin Ala Tyr His Glu Leu Leu Leu Thr Val Leu Xaa Tyr 
290 295 300 

Gly Val Xaa His Thr Ser Ala Leu Val Arg Cys Thr Ala Ala Arg Met 
305 310 315 320 

Phe Glu Leu Leu Val Lys Gly Val Asn Glu Thr Leu Val Ala Gin Arg 

325 330 335 

Val Val Pro Ala Leu lie Thr Leu Ser Ser Asp Pro Glu lie Ser Val 

340 345 350 

Arg He Ala Thr He Pro Ala Phe Gly Thr He Met Glu Thr Val He 
355 360 365 

Gin Arg Glu Leu Leu Glu Arg Val Lys Met Gin 
370 375 



<210> 444 
<211> 48 
<212> PRT 

<213> Homo sapiens 
<400> 444 

Met Ser Thr Val Thr Trp Leu Leu Lys Leu Phe Thr Gin Phe Met Phe 
1 5 10 15 

Pro Pro Thr Val Ser Asn Ser His Thr Cys Ala Arg Tyr Tyr Val Phe 

20 25 30 

Asn Phe Cys Leu He He Ser Phe Asn Phe Asn Phe His Tyr His Trp 
35 40 45 
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<210> 445 
<211> 142 
<212> PRT 

<213> Homo sapiens 
<400> 445 

Met Gly Cys Leu Val Trp Gly Pro Ser Trp Pro Pro Leu Ser Leu Leu 
15 10 15 

Ala Ser Leu Leu His Ser Gly lie Ala Gly Arg Cys Leu Leu Cys Leu 

20 25 30 

Phe Lys Gly Leu Ala Ala Ala Ala Ser Leu Gin lie Arg Asp Leu Ala 
35 40 45 

Ser Arg Leu Thr Thr Gly Pro Arg Thr Cys Arg Val Gin Pro Pro Pro 
50 55 60 

His Pro Gin Ser Ser Pro Pro Trp Pro Gly Pro Pro Gly Ala Glu Thr 
65 70 75 80 

Cys Arg Pro Leu Ser Arg Thr Val Gly Gly Val Cys Pro Ser Asp Trp 

85 90 95 

Pro Val Ser Trp Leu Leu Leu Pro Pro Leu Pro Glu Val Val Thr Cys 

100 105 110 

Ser Cys Pro Arg lie Lys Ala Arg Pro Glu Arg Thr Pro Glu Leu Leu 
115 120 125 

Cys Ala Trp Gly Gly Arg Gly Lys His Ser Gin Leu Val Ala 
130 135 140 



<210> 446 
<211> 399 
<212> PRT 

<213> Homo sapiens 
<400> 446 

Met Gly lie Leu Leu Gly Leu Leu Leu Leu Gly His Leu Thr Val Asp 
15 10 15 

Thr Tyr Gly Arg Pro lie Leu Glu Val Pro Glu Ser Val Thr Gly Pro 

20 25 30 

Trp Lys Gly Asp Val Asn Leu Pro Cys Thr Tyr Asp Pro Leu Gin Gly 
35 40 45 

Tyr Thr Gin Val Leu Val Lys Trp Leu Val Gin Arg Gly Ser Asp Pro 
50 55 60 

Val Thr He Phe Leu Arg Asp Ser Ser Gly Asp His He Gin Gin Ala 
65 70 75 80 
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Lys Tyr Gin Gly Arg Leu His Val Ser His Lys Val Pro Gly Asp Val 

85 90 95 

Ser Leu Gin Leu Ser Thr Leu Glu Met Asp Asp Arg Ser His Tyr Thr 

100 105 110 

Cys Glu Val Thr Trp Gin Thr Pro Asp Gly Asn Gin Val Val Arg Asp 
115 120 125 

Lys lie Thr Glu Leu Arg Val Gin Lys Leu Ser Val Ser Lys Pro Thr 
130 135 140 

Val Thr Thr Gly Ser Gly Tyr Gly Phe Thr Val Pro Gin Gly Met Arg 
145 150 155 160 

lie Ser Leu Gin Cys Gin Ala Arg Gly Ser Pro Pro He Ser Tyr He 

165 170 175 

Trp Tyr Lys Gin Gin Thr Asn Asn Gin Glu Pro He Lys Val Ala Thr 

180 185 190 

Leu Ser Thr Leu Leu Phe Lys Pro Ala Val He Ala Asp Ser Gly Ser 
195 200 205 

Tyr Phe Cys Thr Ala Lys Gly Gin Val Gly Ser Glu Gin His Ser Asp 
210 215 220 

He Val Lys Phe Val Val Lys Asp Ser Ser Lys Leu Leu Lys Thr Lys 
225 230 235 240 

Thr Glu Ala Pro Thr Thr Met Thr Tyr Pro Leu Lys Ala Thr Ser Thr 

245 250 255 

Val Lys Gin Ser Trp Asp Trp Thr Thr Asp Met Asp Gly Tyr Leu Gly 

260 265 270 

Glu Thr Ser Ala Gly Pro Gly Lys Ser Leu Pro Val Phe Ala He He 
275 280 285 

Leu He He Ser Leu Cys Cys Met Val Val Phe Thr Met Ala Tyr He 
290 295 300 

Met Leu Cys Arg Lys Thr Ser Gin Gin Glu His Val Tyr Glu Ala Ala 
305 310 315 320 

Arg Ala His Ala Arg Glu Ala Asn Asp Ser Gly Glu Thr Met Arg Val 

325 330 335 

Ala He Phe Ala Ser Gly Cys Ser Ser Asp Glu Pro Thr Ser Gin Asn 

340 345 350 

Leu Gly Asn Asn Tyr Ser Asp Glu Pro Cys He Gly Gin Glu Tyr Gin 
355 360 365 

He He Ala Gin He Asn Gly Asn Tyr Ala Arg Leu Leu Asp Thr Val 
370 375 380 

Pro Leu Asp Tyr Glu Phe Leu Ala Thr Glu Gly Lys Ser Val Cys 
385 390 395 
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<210> 447 
<211> 223 
<212> PRT 

<213> Homo sapiens 
<400> 447 

Met Lys Phe Val Pro Cys Leu Leu Leu Val Thr Leu Ser Cys Leu Gly 
15 10 15 

Thr Leu Gly Gin Ala Pro Arg Gin Lys Gin Gly Ser Thr Gly Glu Glu 

20 25 30 

Phe His Phe Gin Thr Gly Gly Arg Asp Ser Cys Thr Met Arg Pro Ser 
35 40 45 

Ser Leu Gly Gin Gly Ala Gly Glu Val Trp Leu Arg Val Asp Cys Arg 
50 55 60 

Asn Thr Asp Gin Thr Tyr Trp Cys Glu Tyr Arg Gly Gin Pro Ser Met 
65 70 75 80 

Cys Gin Ala Phe Ala Ala Asp Pro Lys Ser Tyr Trp Asn Gin Ala Leu 

85 90 95 

Gin Glu Leu Arg Arg Leu His His Ala Cys Gin Gly Ala Pro Val Leu 

100 105 110 

Arg Pro Ser Val Cys Arg Glu Ala Gly Pro Gin Ala His Met Gin Gin 
115 120 125 

Val Thr Ser Ser Leu Lys Gly Ser Pro Glu Pro Asn Gin Gin Pro Glu 
130 ' 135 140 

Ala Gly Thr Pro Ser Leu Arg Pro Lys Ala Thr Val Lys Leu Thr Glu 
145 150 155 160 

Ala Thr Gin Leu Gly Lys Asp Ser Met Glu Glu Leu Gly Lys Ala Lys 

165 170 175 

Pro Thr Thr Arg Pro Thr Ala Lys Pro Thr Gin Pro Gly Pro Arg Pro 

180 185 190 

Gly Gly Asn Glu Glu Ala Lys Lys Lys Ala Trp Glu His Cys Trp Lys 
195 200 205 

Pro Phe Gin Ala Leu Cys Ala Phe Leu lie Ser Phe Phe Arg Gly 
210 215 220 



<210> 448 
<211> 135 
<212> PRT 

<213> Homo sapiens 
<400> 448 

Met Gly Leu Trp Leu Gly Met Leu Ala Cys Val Phe Leu Ala Thr Ala 
15 10 15 
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Ala Phe Val Ala Tyr Thr Ala Arg Leu Asp Trp Lys Leu Ala Ala Glu 

20 25 30 

Glu Ala Lys Lys His Ser Gly Arg Gin Gin Gin Gin Arg Ala Glu Ser 
35 40 45 

Thr Ala Thr Arg Pro Gly Pro Glu Lys Ala Val Leu Ser Ser Val Ala 
50 55 60 

Thr Gly Ser Ser Pro Gly lie Thr Leu Thr Thr Tyr Ser Arg Ser Glu 
65 70 75 80 

Cys His Val Asp Phe Phe Arg Thr Pro Glu Glu Ala His Ala Leu Ser 

85 90 95 

Ala Pro Thr Ser Arg Leu Ser Val Lys Gin Leu Val lie Arg Arg Gly 

100 105 110 

Ala Ala Leu Gly Ala Ala Ser Ala Thr Leu Met Val Gly Leu Thr Val 
115 120 125 

Arg lie Leu Ala Thr Arg His 
130 135 



<210> 449 
<211> 181 
<212> PRT 

<213> Homo sapiens 
<400> 449 

Met Thr Val lie Leu lie lie Leu lie Val Val Met Ala Arg Tyr Cys 
15 10 15 

Arg Ser Lys Asn Lys Asn Gly Tyr Glu Ala Gly Lys Lys Asp His Glu 

20 25 30 

Asp Phe Phe Thr Pro Gin Gin His Asp Lys Ser Lys Lys Pro Lys Lys 
35 40 45 

■ 

Asp Lys Lys Asn Lys Lys Ser Lys Gin Pro Leu Tyr Ser Ser lie Val 
50 55 60 

Thr Val Glu Ala Ser Lys Pro Asn Gly Gin Arg Tyr Asp Ser Val Asn 
65 70 75 80 

Glu Lys Leu Ser Asp Ser Pro Ser Met Gly Arg Tyr Arg Ser Val Asn 

85 90 95 

Gly Gly Pro Gly Ser Pro Asp Leu Ala Arg His Tyr Lys Ser Ser Ser 

100 105 110 

Pro Leu Pro Thr Val Gin Leu His Pro Gin Ser Pro Thr Ala Gly Lys 
115 120 125 

Lys His Gin Ala Val Gin Asp Leu Pro Pro Ala Asn Thr Phe Val Gly 
130 135 140 

Ala Gly Asp Asn lie Ser lie Gly Ser Asp His Cys Ser Glu Tyr Ser 
145 150 155 160 
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Cys Gin Thr Asn Asn Lys Tyr Ser Lys Gin Met Arg Leu His Pro Tyr 

165 170 175 

He Thr Val Phe Gly 

180 



<210> 450 

<211> 58 

<212> PRT 

<213> Homo sapiens 

<400> 450 

Met Arg Thr Phe Leu Thr Phe Val He Leu Lys Val He Leu He Phe 
15 10 15 

Leu Ser Ser Cys Ala Ser Phe Thr Arg Asn Leu Leu Thr Trp Pro Asn 

20 25 30 

Asp Val Ser Thr Glu Gin Phe Glu Thr Arg Pro Phe Gly Ser Glu Leu 
35 40 45 

Leu Gin Thr Val He Asn Val Ser Arg Thr 
50 55 



<210> 451 
<211> 950 
<212> PRT 

<213> Homo sapiens 
<400> 451 

Met Thr Trp Arg Met Gly Pro Arg Phe Thr Met Leu Leu Ala Met Trp 
15 10 15 

Leu Val Cys Gly Ser Glu Pro His Pro His Ala Thr He Arg Gly Ser 

20 25 30 

His Gly Gly Arg Lys Val Pro Leu Val Ser Pro Asp Ser Ser Arg Pro 
35 40 45 

Ala Arg Phe Leu Arg His Thr Gly Arg Ser Arg Gly He Glu Arg Ser 
50 55 60 

Thr Leu Glu Glu Pro Asn Leu Gin Pro Leu Gin Arg Arg Arg Ser Val 
65 70 75 80 

Pro Val Leu Arg Leu Ala Arg Pro Thr Glu Pro Pro Ala Arg Ser Asp 

85 90 95 

He Asn Gly Ala Ala Val Arg Pro Glu Gin Arg Pro Ala Ala Arg Gly 

100 105 110 

Ser Pro Arg Glu Met He Arg Asp Glu Gly Ser Ser Ala Arg Ser Arg 
115 120 125 

Met Leu Arg Phe Pro Ser Gly Ser Ser Ser Pro Asn He Leu Ala Ser 
130 135 140 
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Phe Ala Gly Lys Asn Arg Val Trp Val lie Ser Ala Pro His Ala Ser 
145 150 155 160 

Glu Gly Tyr Tyr Arg Leu Met Met Ser Leu Leu Lys Asp Asp Val Tyr 

165 170 175 

Cys Glu Leu Ala Glu Arg His lie Gin Gin lie Val Leu Phe His Gin 

180 185 190 

Ala Gly Glu Glu Gly Gly Lys Val Arg Arg He Thr Ser Glu Gly Gin 
195 200 205 

He Leu Glu Gin Pro Leu Asp Pro Ser Leu He Pro Lys Leu Met Ser 
210 215 220 

Phe Leu Lys Leu Glu Lys Gly Lys Phe Gly Met Val Leu Leu Lys Lys 
225 230 235 240 

Thr Leu Gin Val Glu Glu Arg Tyr Pro Tyr Pro Val Arg Leu Glu Ala 

245 250 255 

Met Tyr Glu Val He Asp Gin Gly Pro He Arg Arg He Glu Lys He 

260 265 270 

Arg Gin Lys Gly Phe Val Gin Lys Cys Lys Ala Ser Gly Val Glu Gly 
275 280 285 

Gin Val Val Ala Glu Gly Asn Asp Gly Gly Gly Gly Ala Gly Arg Pro 
290 295 300 

Ser Leu Gly Ser Glu Lys Lys Lys Glu Asp Pro Arg Arg Ala Gin Val 
305 310 315 320 

Pro Pro Thr Arg Glu Ser Arg Val Lys Val Leu Arg Lys Leu Ala Ala 

325 330 335 

Thr Ala Pro Ala Leu Pro Gin Pro Pro Ser Thr Pro Arg Ala Thr Thr 

340 345 350 

Leu Pro Pro Ala Pro Ala Thr Thr Val Thr Arg Ser Thr Ser Arg Ala 
355 360 365 

Val Thr Val Ala Ala Arg Pro Met Thr Thr Thr Ala Phe Pro Thr Thr 
370 375 380 

Gin Arg Pro Trp Thr Pro Ser Pro Ser His Arg Pro Pro Thr Thr Thr 
385 390 395 400 

Glu Val He Thr Ala Arg Arg Pro Ser Val Ser Glu Asn Leu Tyr Pro 

405 410 415 

Pro Ser Arg Lys Asp Gin His Arg Glu Arg Pro Gin Thr Thr Arg Arg 

420 425 430 

Pro Ser Lys Ala Thr Ser Leu Glu Ser Phe Thr Asn Ala Pro Pro Thr 
435 440 445 

Thr He Ser Glu Pro Ser Thr Arg Ala Ala Gly Pro Gly Arg Phe Arg 
450 455 460 
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Asp Asn Arg Met Asp Arg Arg Glu His Gly His Arg Asp Pro Asn Val 
465 470 475 480 

Val Pro Gly Pro Pro Lys Pro Ala Lys Glu Lys Pro Pro Lys Lys Lys 

485 490 495 

Ala Gin Asp Lys lie Leu Ser Asn Glu Tyr Glu Glu Lys Tyr Asp Leu 

500 505 510 

Ser Arg Pro Thr Ala Ser Gin Leu Glu Asp Glu Leu Gin Val Gly Asn 
515 520 525 

Val Pro Leu Lys Lys Ala Lys Glu Ser Lys Lys His Glu Lys Leu Glu 
530 535 540 

Lys Pro Glu Lys Glu Lys Lys Lys Lys Met Lys Asn Glu Asn Ala Asp 
545 550 555 560 

Lys Leu Leu Lys Ser Glu Lys Gin Met Lys Lys Ser Glu Lys Lys Ser 

565 570 575 

Lys Gin Glu Lys Glu Lys Ser Lys Lys Lys Lys Gly Gly Lys Thr Glu 

580 585 590 

Gin Asp Gly Tyr Gin Lys Pro Thr Asn Lys His Phe Thr Gin Ser Pro 
595 600 605 

Lys Lys Ser Val Ala Asp Leu Leu Gly Ser Phe Glu Gly Lys "Arg Arg 
610 615 620 

Leu Leu Leu lie Thr Ala Pro Lys Ala Glu Asn Asn Met Tyr Val Gin 
625 * 630 635 640 

Gin Arg Asp Glu Tyr Leu Glu Ser Phe Cys Lys Met Ala Thr Arg Lys 

645 650 655 

lie Ser Val lie Thr He Phe Gly Pro Val Asn Asn Ser Thr Met Lys 

660 665 670 

He Asp His Phe Gin Leu Asp Asn Glu Lys Pro Met Arg Val Val Asp 
675 680 685 

Asp Glu Asp Leu Val Asp Gin Arg Leu He Ser Glu Leu Arg Lys Glu 
690 695 700 

Tyr Gly Met Thr Tyr Asn Asp Phe Phe Met Val Leu Thr Asp Val Asp 
705 710 715 720 

Leu Arg Val Lys Gin Tyr Tyr Glu Val Pro He Thr Met Lys Ser Val 

725 730 735 

Phe Asp Leu He Asp Thr Phe Gin Ser Arg He Lys Asp Met Glu Lys 

740 745 750 

Gin Lys Lys Glu Gly He Val Cys Lys Glu Asp Lys Lys Gin Ser Leu 
755 760 765 

Glu Asn Phe Leu Ser Arg Phe Arg Trp Arg Arg Arg Leu Leu Val He 
770 775 780 

Ser Ala Pro Asn Asp Glu Asp Trp Ala Tyr Ser Gin Gin Leu Ser Ala 



288 



WO 02/076488 



PCT/US02/08276 



785 790 795 800 

Leu Ser Gly Gin Ala Cys Asn Phe Gly Leu Arg His lie Thr lie Leu 

805 810 815 

Lys Leu Leu Gly Val Gly Glu Glu Val Gly Gly Val Leu Glu Leu Phe 

820 825 830 

Pro He Asn Gly Ser Ser Val Val Glu Arg Glu Asp Val Pro Ala His 
835 840 845 

Leu Val Lys Asp He Arg Asn Tyr Phe Gin Val Ser Pro Glu Tyr Phe 
850 855 860 

Ser Met Leu Leu Val Gly Lys Asp Gly Asn Val Lys Ser Trp Tyr Pro 
865 870 875 880 

Ser Pro Met Trp Ser Met Val He Val Tyr Asp Leu He Asp Ser Met 

885 890 895 

Gin Leu Arg Arg Gin Glu Met Ala He Gin Gin Ser Leu Gly Met Arg 

900 905 910 

i 

Cys Pro Glu Asp Glu Tyr Ala Gly Tyr Gly Tyr His Ser Tyr His Gin 
915 920 925 

Gly Tyr Gin Asp Gly Tyr Gin Asp Asp Tyr Arg His His Glu Ser Tyr 
930 935 940 

His His Gly Tyr Pro Tyr 
945 950 



<210> 452 
<211> 260 
<212> PRT 

<213> Homo sapiens 
<400> 452 

Met Leu Ala Leu Leu Gly Leu Ser Gin Ala Leu Asn He Leu Leu Gly 
15 10 15 

Leu Lys Gly Leu Ala Pro Ala Glu lie Ser Ala Val Cys Glu Lys Gly 

20 25 30 

Asn Phe Asn Val Ala His Gly Leu Ala Trp Ser Tyr Tyr He Gly Tyr 
35 40 45 

Leu Arg Leu He Leu Pro Glu Leu Gin Ala Arg He Arg Thr Tyr Asn 
50 55 60 

Gin His Tyr Asn Asn Leu Leu Arg Gly Ala Val Ser Gin Arg Leu Tyr 
65 \> 70 75 80 

lie Leu Leu Pro Leu Asp Cys Gly Val Pro Asp Asn Leu Ser Met Ala 

85 90 95 

Asp Pro Asn He Arg Phe Leu Asp Lys Leu Pro Gin Gin Thr Gly Asp 

100 105 110 . 
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Arg Ala Gly lie Lys Asp Arg Val Tyr Ser Asn Ser He Tyr Glu Leu 
115 120 125 

Leu Glu Asn Gly Gin Arg Ala Gly Thr Cys Val Leu Glu Tyr Ala Thr 
130 135 140 

Pro Leu Gin Thr Leu Phe Ala Met Ser Gin Tyr Ser Gin Ala Gly Phe 
145 150 155 160 

Ser Gly Glu Asp Arg Leu Glu Gin Ala Lys Leu Phe Cys Arg Thr Leu 

165 170 175 

Glu Asp He Leu Ala Asp Ala Pro Glu Ser Gin Asn Asn Cys Arg Leu 

180 185 190 

He Ala Tyr Gin Glu Pro Ala Asp Asp Ser Ser Phe Ser Leu Ser Gin 
195 200 205 

Glu Val Leu Arg His Leu Arg Gin Glu Glu Lys Glu Glu Val Thr Val 
210 215 220 

Gly Ser Leu Lys Thr Ser Ala Val Pro Ser Thr Ser Thr Met Ser Gin 
225 230 235 240 

Glu Pro Glu Leu Leu He Ser Gly Met Glu Lys Pro Leu Pro Leu Arg 

245 250 255 



Thr Asp Phe Ser 

260 



<210> 453 
<211> 35 
<212> PRT 

<213> Homo sapiens 
<400> 453 

Met Pro Leu Pro Ser Ser Phe Pro Leu Pro Val Phe Leu Ser Ser Cys 
15 10 15 

Pro Phe Leu Met Ser Val Ser He Gly Phe Leu lie Leu Val Phe Asn 

20 25 30 

Val His Pro 
35 



<210> 454 

<211> 55 

<212> PRT 

<213> Homo sapiens 

<400> 454 

Met Val Asn He Phe Gly Phe Val Ser Cys He Val Phe Arg Cys Ser 
15 10 15 

* 

Cys Ser Ala Leu Leu His Glu Ser Asn His Arg Pro Tyr Leu Asn Lys 

20 25 30 
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Trp Ser Leu Leu Ser Thr Asn Lys Thr Leu Phe Arg Asn Asn Arg Gly 
35 40 45 

Leu Asp Leu Val Leu Val Cys 
50 55 



<210> 455 
<211> 78 
<212> PRT 

<213> Homo sapiens 
<400> 455 

Met Val Cys Phe Gin Ser Asn Lys 
1 5 

Leu Ser Phe Val Phe Val Leu Phe 

20 

Phe Leu Ser Leu Pro Phe Tyr lie 
35 40 

Glu Gin Trp Ala Phe His Asn Met 
50 55 

Glu Val Arg Gly Asn Leu lie His 
65 70 



Pro Ser Thr Ser Thr Trp Arg Gin 
10 15 

Cys Leu Phe Cys Leu Gly His Ala 
25 30 

Leu Ser lie lie Ala Met Cys Leu 

45 

Asn Ser Leu Tyr His His Glu Trp 

60 

Val Asp Phe Thr Leu Pro 
75 



<210> 456 
<211> 41 
<212> PRT 

<213> Homo sapiens 
<400> 456 

Met Asn Leu Met Val Arg Leu Leu Ala Leu Gly Leu lie Ser Gly Met 
15 10 15 

Met Ser Asn lie Thr Gin Ser His Ser Ser Lys He Ser Ala Phe Gly 

20 25 30 

lie Phe He Gly Pro Glu Gin Phe Leu 
35 40 



<210> 457 
<211> 56 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (32) 

<223> Xaa equals any amino acid 
<400> 457 

Met Leu Ser Phe Phe He Cys Leu Leu He Phe Val His Leu Leu Leu 
1 5 10 15 
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Leu Ser Phe Leu He Ser Asp Trp Pro Pro Pro Thr Gly Ser Ala Xaa 

20 25 30 

His Lys He Leu Arg Leu Met Val Val Gin Arg Leu Ser Leu Leu Asp 
35 40 45 

Gin Arg Lys Arg Trp Ser Glu Ala 
50 55 



<210> 458 
<211> 90 
<212> PRT 

<213> Homo sapiens 
<400> 458 

Met Ala He Arg Leu Val Phe Leu Ala Leu Ala Gly Leu Val Asp Gly 
15 10 15 

Lys Pro Val Trp He Thr Leu Trp Met Asp Ala Lys Arg Pro Asn Leu 

20 25 30 

Ala Gly Thr Gly Ser Thr Trp Gly Ser Arg Arg Asp Ser His Cys Cys 
35 40 45 

His Gly Pro Thr Ala Trp Ser Leu Pro Cys Leu Leu Cys Leu Phe Arg 
50 55 60 

Ala Gin Gin Lys Asp Arg Glu Arg Ser Leu Leu Gly Val Pro Leu Pro 
65 70 75 80 

Thr Leu Gin Gly Gly Asn Leu Ser Asp Gly 

85 90 



<210> 459 
<211> 282 
<212> PRT 

<213> Homo sapiens 
<400> 459 

Met Leu Ala Leu Thr Leu Ala Lys Ala Asp Ser Pro Arg Thr Ala Leu 
15 10 15 

Leu Cys Ser Ala Trp Leu Leu Thr Ala Ser Phe Ser Ala Gin Gin His 

20 25 30 

Lys Gly Ser Leu Gin Val His Gin Thr Leu Ser Val Glu Met Asp Gin 
35 40 45 

Val Leu Lys Ala Leu Ser Phe Pro Lys Lys Lys Ala Ala Leu Leu Ser 
50 55 60 

Ala Ala He Leu Cys Phe Leu Arg Thr Ala Leu Arg Gin Ser Phe Ser 
65 70 75 80 

Ser Ala Leu Val Ala Leu Val Pro Ser Gly Ala Gin Pro Leu Pro Ala 

85 90 95 
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Thr Lys Asp Thr Val Leu Ala Pro Leu Arg Met Ser Gin Val Arg Ser 

100 105 110 

Leu Val lie Gly Leu Gin Asn Leu Leu Val Gin Lys Asp Pro Leu Leu 
. 115 ' 120 125 

Ser Gin Ala Cys Val Gly Cys Leu Glu Ala Leu Leu Asp Tyr Leu Asp 
130 135 140 

Ala Arg Ser Pro Asp lie Ala Leu His Val Ala Ser Gin Pro Trp Asn 
145 150 155 160 

Arg Phe Leu Leu Phe Thr Leu Leu Asp Ala Gly Glu Asn Ser Phe Leu 

165 170 175 

Arg Pro Glu lie Leu Arg Leu Met Thr Leu Phe Met Arg Tyr Arg Ser 

180 185 190 

Ser Ser Val Leu Ser His Glu Glu Val Gly Asp Val Leu Gin Gly Val 
195 200 205 

Ala Leu Ala Asp Leu Ser Thr Leu Ser Asn Thr Thr Leu Gin Ala Leu 
210 215 220 

His Gly Phe Phe Gin Gin Leu Gin Ser Met Gly His Leu Ala Asp His 
225 230 235 240 

Ser Met Ala Gin Thr Leu Gin Ala Ser Leu Glu Gly Leu Pro Pro Ser 

245 250 255 

Thr Ser Ser Gly Gin Pro Pro Leu Gin Asp Met Leu Cys Leu Gly Gly 

260 265 270 

Val Ala Val Ser Leu Ser His He Arg Asn 
275 280 



<210> 460 
<211> 178 
<212> PRT 

<213> Homo sapiens 
<400> 460 

Met Leu Pro Leu Leu He He Cys Leu Leu Pro Ala He Glu Gly Lys 
15 10 15 

Asn Cys Leu Arg Cys Trp Pro Glu Leu Ser Ala Leu He Asp Tyr Asp 

20 25 30 

Leu Gin He Leu Trp Val Thr Pro Gly Pro Pro Thr Glu Leu Ser Gin 
35 40 45 

Ser He His Ser Leu Phe Leu Glu Asp Asn Asn Phe Leu Lys Pro Trp 
50 55 60 

Tyr Leu Asp Arg Asp His Leu Glu Glu Glu Thr Ala Lys Phe Phe Thr 
65 70 75 80 

Gin Val His Gin Ala He Lys Thr Leu Arg Asp Asp Lys Thr Val Leu 
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85 90 95 

Leu Glu Glu He Tyr Thr His Lys Asn Leu Phe Thr Glu Arg Leu Asn 

100 105 110 

Lys lie Ser Asp Gly Leu Lys Glu Lys Gly Ala Pro Pro Leu Ser Met 
115 120 125 

Asn Ala Phe Pro Ala Pro Ser Pro Thr Cys Thr Pro Glu Pro Leu Gly 
130 135 140 

Ser Val Cys Leu Pro Ser Thr Ser Val Ser Leu Pro Ser His Pro Pro 
145 150 155 160 

Trp Gin Pro Ala Met Ser Pro Val Pro Gly Thr Gly Gly Pro Pro Cys 

165 170 175 

Gly Leu 



<210> 461 
<211> 298 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (42) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (58) 

<223> Xaa equals any amino acid 
<400> 461 

Met Ala Arg Arg Ser Arg His Arg Leu Leu Leu Leu Leu Leu Arg Tyr 
15 10 15 

Leu Val Val Ala Leu Gly Tyr His Lys Ala Tyr Gly Phe Ser Ala Pro 

20 25 30 

Lys Asp Gin Gin Val Val Thr Ala Val Xaa Tyr Gin Glu Ala He Leu 
35 40 45 

Ala Cys Lys Thr Pro Lys Lys Thr Val Xaa Ser Arg Leu Glu Trp Lys 
50 55 60 

Lys Leu Gly Arg Ser Val Ser Phe Val Tyr Tyr Gin Gin Thr Leu Gin 
65 70 75 80 

Gly Asp Phe Lys Asn Arg Ala Glu Met He Asp Phe Asn He Arg He 

85 90 95 

Lys Asn Val Thr Arg Ser Asp Ala Gly Lys Tyr Arg Cys Glu Val Ser 

100 105 110 

Ala Pro Ser Glu Gin Gly Gin Asn Leu Glu Glu Asp Thr Val Thr Leu 
115 120 125 
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Glu Val Leu Val Ala Pro Ala Val Pro Ser Cys Glu Val Pro Ser Ser 
130 135 140 

Ala Leu Ser Gly Thr Val Val Glu Leu Arg Cys Gin Asp Lys Glu Gly 
145 150 155 160 

Asn Pro Ala Pro Glu Tyr Thr Trp Phe Lys Asp Gly lie Arg Leu Leu 

165 170 175 

Glu Asn Pro Arg Leu Gly Ser Gin Ser Thr Asn Ser Ser Tyr Thr Met 

180 185 190 

Asn Thr Lys Thr Gly Thr Leu Gin Phe Asn Thr Val Ser Lys Leu Asp 
195 200 205 

Thr Gly Glu Tyr Ser Cys Glu Ala Arg Asn Ser Val Gly Tyr Arg Arg 
210 215 220 

Cys Pro Gly Lys Arg Met Gin Val Asp Asp Leu Asn lie Ser Gly lie 
225 230 235 240 

He Ala Ala Val Val Val Val Ala Leu Val lie Ser Val Cys Gly Leu 

245 250 255 

Gly Val Cys Tyr Ala Gin Arg Lys Gly Tyr Phe Ser Lys Glu Thr Ser 

260 265 270 

Phe Gin Lys Ser Asn Ser Ser Ser Lys Ala Thr Thr Met Ser Glu Asn 
275 280 285 

Asp Phe Lys His Thr Lys Ser Phe He He 
290 295 



<210> 462 

<211> 46 

<212> PRT 

<213> Homo sapiens 

<400> 462 • 

Met Glu Pro Val Ala Leu Leu Gin Pro Thr Trp Trp Leu Leu Asn Val 
15 10 15 

Thr Leu Pro Leu Val Ala Trp Ser Gly Pro Leu He Cys Arg Pro Leu 

20 25 30 

Leu His Gly Glu Gly Arg Gin Gly Ala Ala Cys Leu Gin Gly 
35 40 45 



<210> 463 

<211> 44 

<212> PRT 

<213> Homo sapiens 

<400> 463 

Met Gly Trp Leu Trp Leu Glu Leu Leu Gly Leu Ser He Glu Glu Thr 
1 5 10 15 
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Leu Val Trp Ala Phe Leu Asn Lys Phe Leu Asp Ser Ser Ala Ala Leu 

20 25 30 

Leu Trp Arg lie Leu Gly Lys Ser Asn Leu Ser Thr 
35 40 



<210> 464 
<211> 158 
<212> PRT 

<213> Homo sapiens 
<400> 464 

Met Ala Leu Glu Val Leu Met Leu Leu Ala Val Leu lie Trp Thr Gly 
15 10 15 

Ala Glu Asn Leu His Val Lys lie Ser Cys Ser Leu Asp Trp Leu Met 

20 25 30 

Val Ser Val lie Pro Val Ala Glu Ser Arg Asn Leu Tyr lie Phe Ala 
35 40 45 

Asp Glu Leu His Leu Gly Met Gly Cys Pro Ala Asn Arg He His Thr 
50 55 60 

Tyr Val Tyr Glu Phe He Tyr Leu Val Arg Asp Cys Gly He Arg Thr 
65 70 75 80 

Arg Val Val Ser Glu Glu Thr Leu Leu Phe Gin Thr Glu Leu Tyr Phe 

85 90 95 

Thr Pro Arg Asn He Asp His Asp Pro Gin Glu He His Leu Glu Cys 

100 105 110 

Ser Thr Ser Arg Lys Ser Val Trp Leu Thr Pro Val Ser Thr Glu Asn 
115 120 125 

Glu He Lys Leu Asp Pro Ser Pro Phe He Ala Asp Phe Gin Thr Thr 
130 135 140 

Ala Glu Glu Leu Gly Leu Leu Ser Ser Ser Pro Asn Leu Leu 
145 150 " 155 



<210> 465 
<211> 101 
<212> PRT 

<213> Homo sapiens 

<220> 
<221> SITE 
<222> (67) 

<223> Xaa equals any amino acid 
<400> 465 

Met Glu Leu Glu Arg Cys Ser Val Val Leu Cys He Leu Ala Asn Leu 
15 10 15 
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Ala Val Leu Arg Ala Leu Phe Leu Pro Cys He He Phe His Cys Val 

20 25 30 

Ser Asp Ser Arg Ser Val Asn Arg Glu Thr Lys Val Lys Phe Val His 
35 40 45 

Thr Ser Val His Gly Val Gly His Ser Phe Val Gin Ser Ala Phe Lys 
50 55 60 

Ala Phe Xaa Leu Val Pro Pro Glu Ala Val Pro Glu Gin Lys Asp Pro 
65 70 75 80 

Asp Pro Glu Phe Pro Thr Val Lys Tyr Pro Asn Pro Glu Glu Gly Lys 

85 90 95 

Gly Val Leu Val Thr 

100 



<210> 466 
<211> 71 
<212> PRT 

<213> Homo sapiens 
<400> 466 

Met Val Gin Gly Pro Leu Thr His Leu Met Leu Val Leu Leu He Ser 
15 10 15 

Leu He Phe Leu Ser Arg Gly Ser Gly Arg Ala Trp Ala Phe Ser His 

20 25 30 

Ser Cys Phe Lys Thr Ser Asp Leu Leu Pro Cys Arg Asn Arg Trp Glu 
35 40 45 

Val He Glu Phe Leu His Tyr Ser Asn Leu His Ser His He Ser Leu 
50 55 60 

Ser Val Thr Lys Thr Phe Leu 
65 70 



<210> 467 
<211> 230 
<212> PRT 

<213> Homo sapiens 
<400> 467 

Met Ala Ser Leu Gly Leu Gin Leu Val Gly Tyr He Leu Gly Leu Leu 
15 10 15 

Gly Leu Leu Gly Thr Leu Val Ala Met Leu Leu Pro Ser Trp Lys Thr 

20 25 30 

Ser Ser Tyr Val Gly Ala Ser He Val* Thr Ala Val Gly Phe Ser Lys 
35 40 45 

Gly Leu Trp Met Glu Cys Ala Thr His Ser Thr Gly He Thr Gin Cys 
50 55 60 
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Asp lie Tyr Ser Thr Leu Leu Gly Leu Pro Ala Asp lie Gin Ala Ala 
65 70 75 80 

Gin Ala Met Met Val Thr Ser Ser Ala lie Ser Ser Leu Ala Cys lie 

85 90 95 

lie Ser Val Val Gly Met Arg Cys Thr Val Phe Cys Gin Glu Ser Arg 

100 105 110 

Ala Lys Asp Arg Val Ala Val Ala Gly Gly Val Phe Phe lie Leu Gly 
0 115 120 125 

■ 

Gly Leu Leu Gly Phe lie Pro Val Ala Trp Asn Leu His Gly lie Leu 
130 135 140 

Arg Asp Phe Tyr Ser Pro Leu Val Pro Asp Ser Met Lys Phe Glu lie 
145 150 155 160 

Gly Glu Ala Leu Tyr Leu Gly lie lie Ser Ser Leu Phe Ser Leu lie 

165 170 175 

Ala Gly lie lie Leu Cys Phe Ser Cys Ser Ser Gin Arg Asn Arg Ser 

180 185 190 

Asn Tyr Tyr Asp Ala Tyr Gin Ala Gin Pro Leu Ala Thr Arg Ser Ser 
195 200 205 

Pro Arg Pro Gly Gin Pro Pro Lys Val Lys Ser Glu Phe Asn Ser Tyr 
210 215 220 

Ser Leu Thr Gly Tyr Val 
225 230 



<210> 468 
<211> 37 
<212> PRT 

<213> Homo sapiens 
<400> 468 

Met Cys Tyr lie Pro Gly Ser Thr Gly Gly Gin Cys Trp Pro Trp Cys 
15 10 15 

Trp Cys Trp Leu Cys Arg Glu Ala Leu Glu Trp Leu Cys Gly Ala Val 

20 25 30 

Ser Ala Gly Pro Ala 
35 



<210> 469 
<211> 133 
<212> PRT 

<213> Homo sapiens 
<400> 469 

Met Arg Val Pro Leu Val Leu Ser Trp Ala Phe Val Leu Val Gly Phe 
15 10 15 
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Ser Gly Val Tyr Leu Ala Ser £lu Ser Phe Trp Phe Pro Pro Ser Leu 

20 25 30 

Cys Asp Leu Thr Ser Pro Pro Gly Leu His Leu Trp Lys Phe lie Arg 
35 40 45 

Asp Leu Val Ser Met Glu Glu Leu Thr Asp Ser Ala Arg Glu Met Gly 
50 55 60 

Tyr Trp Met Met Val Phe Ser Leu Lys Ala Met Phe Pro Val Ser Ser 
65 70 75 80 

Gly Cys Phe Gin Glu Arg Gin Glu Thr Asn Lys Ser Leu Thr Leu Leu 

85 90 95 

Arg Cys Ser Gin Arg Asp Thr Ser Pro Leu Met Asp Gly Gin Thr Trp 

100 105 110 

Ala Arg Val Arg Val Thr Lys Pro Pro Thr Thr Ala Thr Ala Ala Tyr 
115 120 125 

Asn Arg His lie Arg 
130 



<210> 470 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<400> 470 

Met Phe Leu Phe He Thr Phe Thr He Leu Ala He Phe He He Glu 
15 10 15 

Pro Arg Asn Leu Arg Val Asp Leu Asn Leu He Lys Phe Gin Thr Ser 

20 25 30 

Trp Pro Lys Thr Leu Val Glu Glu Gin Asn 
35 40 



<210> 471 

<211> 56 

<212> PRT 

<213> Homo sapiens 

<400> 471 

Met Phe Leu Lys Val Leu Val Phe Leu ile Phe Phe Ser Pro Phe Ser 
15 10 15 

Ser Ser Leu Phe Ser Gly Glu Ala Val Arg Gly Arg Gly Ala Gly Leu 

20 25 30 

Gly Leu Gly Ile Gly Arg Gly Trp Thr Ser Cys Leu Ser Val Leu Asn 
35 40 45 

Gly Cys Asp Gly Ala Arg Ser His 
50 55 
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<210> 472 
<211> 52 
<212> PRT 

<213> Homo sapiens 
<400> 472 

Met Gly Pro Cys Arg Ala Ser Arg Cys Leu Ser Leu Leu Val Leu Phe 
15 10 15 

Pro Pro Gly Val Ala Gly Arg Pro Ala Pro Gly Arg Leu His Pro Val 

20 25 30 

Pro Thr Gly Pro Leu Pro Arg Met Tyr Ser Ala Gly Ala Arg Gly Arg 
35 40 45 

His Gly Ala His 
50 



<210> 473 
<211> 50 
<212> PRT 

<213> Homo sapiens 
<400> 473 

Met Asp Gly Gly Pro Gly Ala Phe Ser Arg Ala Trp Val Leu Gin lie 
15 10 15 

Pro Trp Leu Leu Leu Ser Gly Gly Asn Phe Ala Leu Cys Glu Pro Arg 

20 25 30 

Pro Cys Pro Ser Ala Gly His Pro Trp Gin Glu Ala Gly Leu Pro Ser 
35 40 45 

Ser Pro 
50 



<210> 474 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<400> 474 

Met Leu Val Ser Leu lie lie Cys Leu Leu Leu Asp Leu Leu Asn Gin 
15 10 15 

Pro Ser Leu Leu Arg Asp Leu lie Leu Lys Gin His Thr Gly Asn Pro 

20 25 30 

His Leu Ser Phe Pro Leu Lys Tyr Ser His Trp Met Gly 
35 40 45 



<210> 475 
<211> 168 
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<212> PRT 

<213> Homo sapiens 
<400> 475 

Met Val Thr Phe lie Thr Ala Thr Leu Trp lie Ala Val Phe Ser Tyr 
15 10 15 

lie Met Val Trp Leu Val Thr He He Gly Tyr Thr Leu Gly He Pro 

20 25 30 

Asp Val He Met Gly He Thr Phe Leu Ala Ala Gly Gin Val Ser Arg 
35 40 45 

Leu His Gly Gin Pro Asn Cys Gly Glu Thr Arg Pro Trp Gly His Gly 
50 55 60 

Ser Leu Gin His His Arg Ser Asn Val Phe Asp He Leu Val Gly Leu 
65 70 75 80 

Gly Val Pro Trp Gly Leu Gin Thr Met Val Val Asn Tyr Gly Ser Thr 

85 90 95 

Val Lys He Asn Ser Arg Gly Leu Val Tyr Ser Val Val Leu Leu Leu 

100 105 110 

Gly Ser Val Ala Leu Thr Val Leu Gly He His Leu Asn Lys Trp Arg 
115 120 125 

Leu Asp Arg Lys Leu Gly Val Tyr Val Leu Val Leu Tyr Ala He Phe 
130 135 140 

Leu Cys Phe Ser He Met He Glu Phe Asn Val Phe Thr Phe Val Asn 
145 150 155 160 

Leu Pro Met Cys Arg Glu Asp Asp 

165 



<210> 476 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<400> 476 

Met Asn Leu He Phe Arg Leu Pro Cys He Leu Leu Thr Cys He Tyr 
15 10 15 

Val Gin Gin Cys Val Cys Lys Tyr He Gly Thr Phe Leu Asn Arg Val 

20 25 30 

Cys Ala Met Cys Lys Gly Leu Leu Thr Val Lys 
35 40 



<210> 477 
<211> 52 
<212> PRT 

<213> Homo sapiens 
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<400> 477 

Met Lys Cys Phe Phe Leu Phe Val Val lie Leu lie lie Met Lys Ser 
15 10 15 

Asn Leu Ser Asp lie lie lie Ala Thr Tyr Thr Tyr Cys lie Pro Asp 

20 25 30 

Tyr Phe Phe His Thr Phe lie Phe Asn Leu Ser Val Tyr Leu Asn Ser 
35 40 45' 

Lys Phe lie Ser 
50 



<210> 478 

<211> 51 

<212> PRT 

<213> Homo sapiens 

<400> 478 

Met lie Lys His Val Ala Trp Leu He Phe Thr Asn Cys He Phe Phe 
15 10 15 

Cys Pro Val Ala Phe Phe Ser Phe Ala Pro Leu He Thr Ala He Ser 

20 25 30 

He Ser Pro Glu He Met Lys Ser Val Thr Leu He Phe Phe Pro Cys 
35 40 45 

Leu Leu Ala 
50 



<210> 479 
<211> 118 
<212> PRT 

<213> Homo sapiens 
<400> 479 

Met Cys Tyr Leu Leu Leu Leu Leu He Gin Thr Ala Glu Leu Leu lie 
1 5 10 15 

His Pro Gin Gly Leu Gin Ala Val Ser Asn Gly Glu Ser Ala Leu Lys 

20 25 30 

Gly Thr Arg Pro Thr Phe Ser Ser Pro Phe He Leu Val Thr Glu Gly 
35 40 45 

Arg Lys Glu Trp Glu Gly Val Phe Leu Ser Ser Gly Trp Lys Gly Asn 
50 55 60 

Thr Leu Ser Asn Tyr Tyr He Ser Leu Val Phe Tyr Tyr Ser Arg He 
65 70 75 80 

m 

Leu Gin Pro Tyr Phe Tyr Cys Leu Trp Gly Lys Leu Glu Met Val Thr 

85 90 95 

Leu He Arg Ser Val Trp Arg Gly He Asn Gly Gly Asp Lys He Ser 

100 105 110 
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Val Gly Phe Gly Lys Cys 
115 



<210> 480 
<211> 169 
<212> PRT 

<213> Homo sapiens 
<400> 480 

Met Trp Ala Val Leu Arg Leu Ala Leu Arg Pro Cys Ala Arg Ala Ser 
15 10 15 

Pro Ala Gly Pro Arg Ala Tyr His Gly Asp Ser Val Ala Ser Leu Gly 

20 25 30 

Thr Gin Pro Asp Leu Gly Ser Ala Leu Tyr Gin Glu Asn* Tyr Lys Gin 
35 40 45 

Met Lys Ala Leu Val Asn Gin Leu His Glu Arg Val Glu His lie Lys 
50 55 60 

Leu Gly Gly Gly Glu Lys Ala Arg Ala Leu His lie Ser Arg Gly Lys 
65 70 75 80 

Leu Leu Pro Arg Glu Arg lie Asp Asn Leu lie Asp Pro Gly Ser Pro 

85 90 95 

Phe Leu Glu Leu Ser Gin Phe Ala Gly Tyr Gin Leu Tyr Asp Asn Glu 

100 105 110 

Glu Val Pro Gly Gly Gly He He Thr Gly He Gly Arg Val Ser Gly 
115 120 125 

Val Glu Cys Met He He Ala Asn Asp Ala Thr Val Lys Gly Gly Ala 
130 135 140 

Tyr Tyr Pro Val Thr Val Lys Lys Gin Leu Arg Ala Gin Glu He Ala 
145 150 155 160 

Met Gin Thr Gly Ser Pro Ala Ser Thr 

165 



<210> 481 
<211> 47 
<212> PRT 

<213> Homo sapiens 
<400> 481 

Met Thr Ala Gly Phe Met Gly Met Ala Val Ala He He Leu Phe Gly 
15 10 15 

Trp He He Gly Val Leu Gly Cys Cys Trp Asp Arg Gly Leu Met Gin 

20 25 30 

Tyr Val Ala Gly Cys Ser Ser Ser Trp Glu Gly Lys Gin Trp Asn 
35 40 45 
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<210> 482 
<211> 203 
<212> PRT 

<213> Homo sapiens 
<400> 482 

Met Gin Leu Gly Ser Val Leu Leu Thr Arg Cys Pro Phe Trp Gly Cys 
15 10 15 

Phe Ser Gin Leu Met Leu Tyr Ala Glu Arg Ala Glu Ala Arg Arg Lys 

20 25 30 

Pro Asp lie Pro Val Pro Tyr Leu Tyr Phe Asp Met Gly Ala Ala Val 
35 40 45 

Leu Cys Ala Ser Phe Met Ser Phe Gly Val Lys Arg Arg Trp Phe Ala 
50 55 60 

Leu Gly Ala Ala Leu Gin Leu Ala lie Ser Thr Tyr Ala Ala Tyr lie 
65 70 ' 75 80 

Gly Gly Tyr Val His Tyr Gly Asp Trp Leu Lys Val Arg Met Tyr Ser 

85 90 95 

Arg Thr Val Ala He He Gly Gly Phe Leu Val Leu Ala Ser Gly Ala 

100 105 110 

Gly Glu Leu Tyr Arg Arg Lys Pro Arg Ser Arg Ser Leu Gin Ser Thr 
115 120 125 

Gly Gin Val Phe Leu Gly He Tyr Leu He Cys Val Ala Tyr Ser Leu 
130 135 140 

Gin His Ser Lys Glu Asp Arg Leu Ala Tyr Leu Asn His Leu Pro Gly 
145 150 155 160 

Gly Glu Leu Met He Gin Leu Phe Phe Val Leu Tyr Gly He Leu Ala 

165 170 175 

Pro Gly Leu Ser Val Arg Leu Leu Arg Asp Pro Arg Cys Pro Asp Pro 

180 185 190 

Gly Cys Thr Ala Ala Pro Cys His Ala Ala His 
195 200 



<210> 483 
<211> 123 
<212> PRT 

<213> Homo sapiens 
<400> 483 

Met His Asp Gly Ser Lys Pro Phe Pro Arg Tyr Gly Tyr Lys Pro Ser 
1 5 10 15 

Pro Pro Asn Gly Cys Gly Ser Pro Leu Phe Gly Val His Leu Asn He 

20 25 30 
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Gly lie Pro Ser Leu Thr Lys Cys Cys Asn Gin His Asp Arg Cys Tyr 
35 40 45 

Glu Thr Cys Gly Lys Ser Lys Asn Asp Cys Asp Glu Glu Phe Gin Tyr 
50 55 60 

Cys Leu Ser Lys lie Cys Arg Asp Val Gin Lys Thr Leu Gly Leu Thr 
65 70 75 80 

Gin His Val Gin Ala Cys Glu Thr Thr Val Glu Leu Leu Phe Asp Ser 

85 90 95 

Val lie His Leu Gly Cys Lys Pro Tyr Leu Asp Ser Gin Arg Ala Ala 

100 105 110 

Cys Arg Cys His Tyr Glu Glu Lys Thr Asp Leu 
115 120 



<210> 484 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 484 

Leu Gly Ser Leu Ser Thr Ala Pro Ser Ser Ala Leu Pro Thr Leu Gly 
15 10 15 

Ala Arg Arg Thr Arg Ser Lys 

20 



<210> 485 
<211> 60 
<212> PRT 

<213> Homo sapiens 
<400> 485 

Met Gly Asn Cys Gin Ala Gly His Asn Leu His Leu Cys Leu Ala His 
15 10 15 

His Pro Pro Leu Val Cys Ala Thr Leu lie Leu Leu Leu Leu Gly Leu 

20 25 30 

Ser Gly Leu Gly Leu Gly Ser Phe Leu Leu Thr His Arg Thr Gly Leu 
35 40 45 

Arg Thr Leu Thr Ser Pro Arg Thr Gly Ser Leu Phe 
50 55 60 



<210> 486 
<211> 173 
<212> PRT 

<213> Homo sapiens 
<400> 486 
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Met Glu Ala Pro Gly Pro Arg Ala Leu Arg Thr Ala Leu Cys Gly Gly 
15 10 15 

Cys Cys Cys Leu Leu Leu Cys Ala Gin Leu Ala Val Ala Gly Lys Gly 

20 25 30 

Ala Arg Gly Phe Gly Arg Gly Ala Leu lie Arg Leu Asn lie Trp Pro 
35 40 45 

Ala Val Gin Gly Ala Cys Lys Gin Leu Glu Val Cys Glu His Cys Val 
50 55 60 

Glu Gly Asp Arg Ala Arg Asn Leu Ser Ser Cys Met Trp Glu Gin Cys 
65 70 75 80 

Arg Pro Glu Glu Pro Gly His Cys Val Ala Gin Ser Glu Val Val Lys 

85 90 95 

Glu Gly Cys Ser lie Tyr Asn Arg Ser Glu Ala Cys Pro Ala Ala His 

100 105 110 

His His Pro Thr Tyr Glu Pro Lys Thr Val Thr Thr Gly Ser Pro Pro 
115 120 125 

Val Pro Glu Ala His Ser Pro Gly Phe Asp Gly Ala Ser Phe lie Gly 
130 135 140 

Gly Val Val Leu Val Leu Ser Leu Gin Ala Val Ala Phe Phe Val Leu 
145 150 155 160 

His Phe Leu Lys Ala Lys Asp Ser Thr Tyr Gin Thr Leu 

170 





165 


<210> 


487 


<211> 


210 


<212> 


PRT 


<213> 


Homo sapiens 


<220> 




<221> 


SITE 


<222> 


(139) 


<223> 


Xaa equals any 


<220> 




<221> 


SITE 


<222> 


(187) 


<223> 


Xaa equals any 


<400> 


487 


Met Glu Ala Pro Gly 


1 


5 



10 15 

Cys Cys Cys Leu Leu Leu Cys Ala Gin Leu Ala Val Ala Gly Lys Gly 

20 25 30 

Ala Arg Gly Phe Gly Arg Gly Ala Leu lie Arg Leu Asn lie Trp Pro 
35 40 45 

Ala Val Gin Gly Ala Cys Lys Gin Leu Glu Val Cys Glu His Cys Val 
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50 55 60 

Glu Gly Asp Arg Ala Arg Asn Leu Ser Ser Cys Met Trp Glu Gin Cys 
65 70 75 80 

Arg Pro Glu Glu Pro Gly His Cys Val Ala Gin Ser Glu Val Val Lys 

85 90 95 

Glu Gly Cys Ser lie Tyr Asn Arg Ser Glu Ala Cys Pro Ala Ala His 

100 105 110 

His His Pro Thr Tyr Glu Pro Lys Thr Val Thr Thr Gly Ser Pro Pro 
115 120 125 

Val Pro Glu Ala His Ser Pro Gly Phe Asp Xaa Ala Ser Phe lie Gly 
130 135 140 

Gly Val Val Leu Val Leu Ser Leu Gin Ala Val Ala Phe Phe Val Leu 
145 150 155 160 

Thr Ser Ser Arg Pro Arg Thr Ala Pro Thr Arg Arg Cys Glu Tyr Leu 

165 170 175 

Ala Ser Ser Lys Tyr Leu Ser Pro Ser Ser Xaa Leu Val Pro Ala His 

180 185 190 

Val Pro Phe Ser Thr Gin Gly Ala Val Phe Ser Thr Gly Lys Pro Ser 
195 200 205 

Gly Arg 
210 



<210> 488 
<211> 105 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (70) 

<223> Xaa equals any amino acid 
<400> 488 

Met lie Ser Tyr He Val Leu Leu Ser He Leu Leu Trp Pro Leu Val 
15 10 15 

Val Tyr His Glu Leu He Gin Arg Met Tyr Thr Arg Leu Glu Pro Leu 

20 25 30 

Leu Met Gin Leu Asp Tyr Ser Met Lys Ala Glu Ala Asn Ala Leu His 
35 40 45 

His Lys His Asp Lys Arg Lys Arg Gin Gly Lys Asn Ala Pro Pro Gly 
50 55 60 

Gly Asp Glu Pro Leu Xaa Glu Thr Glu Ser Glu Ser Glu Ala Glu Leu 
65 70 75 80 

Ala Gly Phe Ser Pro Val Val Asp Val Lys Lys Thr Ala Leu Ala Leu 
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85 

Ala lie Tyr Arg Leu Arg Ala Val Arg 

100 105 



<210> 489 
<211> 89 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (24) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (75) 

<223> Xaa equals any amino acid 
<400> 489 

Met Phe Lys Asp Tyr 
1 5 

Leu Leu Leu Leu Leu 

20 

Gly Thr Ala lie Gin 
35 

Gly Ala Ser Trp Asp 
50 

Glu Val Ala Arg Ser 
65 

Arg Ala Val Val Val Gin Met Ala Gin 

85 



PCT/US02/08276 
90 95 



Pro Pro Ala lie Lys Pro Ser Tyr Asp Val Leu 

10 15 

Leu Val Xaa Leu Leu Gin Ala Gly Leu Asn Thr 

25 30 

Cys Val Arg Phe Lys Val Ser Ala Arg Leu Gin 
40 45 

Thr Gin Asn Gly Pro Gin Glu Arg Leu Ala Gly 
55 60 



Pro Leu Lys Glu Phe Xaa Lys Glu Lys Ala Trp 
70 75 80 



<210> 490 
<211> 127 
<212> PRT 
<213> Homo sapiens 

<400> 490 

Met Gly Gin Val Trp Arg Val Pro Pro Leu Leu Leu Ser Val Gin Val 
15 10 15 

Phe Leu Thr Met Ala His Ala Phe His Gin Ala Pro Glu Leu Gin Trp 

20 25 30 

Leu Gly Leu Trp Phe Trp Val Arg Leu Phe Ala Gly Gly Asp Gly Gly 
35 40 45 

Leu His Leu Asn lie Ser Ser Val Thr Leu Pro Leu Leu His Gly Lys 
50 55 60 
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Gin Leu Ser Arg Glu Val 
65 70 

Arg Pro Pro Tyr Lys Glu 

85 

Ser Trp Lys Gly Arg Phe 

100 

Trp Pro Arg Gly Lys Arg 
115 



Pro Ser Cys Gin Gly Lys 

75 

Pro Gin Asp Cys Ser His 

90 

Met Gly Phe Pro Gly Thr 
105 

Trp Leu Leu Gin Glu Phe 
120 



PCT/US02/08276 



Pro Arg Leu Gly 

80 

Gly Cys His Leu 
95 

Pro Arg Leu Ser 
110 

Asp Leu Ser 
125 



<210> 491 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 491 

Leu Gly Lys Pro Trp Arg Tyr Pro Thr 
1 5 



<210> 492 
<211> 91 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (84) 

<223> Xaa equals any amino acid 
<400> 492 

Met Tyr Gly Lys Ser Ser Thr Arg Ala Val Leu Leu Leu Leu Gly lie 
15 10 15 

Gin Leu Thr Ala Leu Trp Pro lie Ala Ala Val Glu lie Tyr Thr Ser 

20 25 30 

Arg Val Leu Glu Ala Val Asn Gly Thr Asp Ala Arg Leu Lys Cys Thr 
35 40 45 

Phe Ser Ser Phe Ala Pro Val Gly Asp Ala Leu Thr Val Thr Trp Asn 
50 55 60 

Phe Arg Pro Leu Asp Gly Gly Pro Glu Gin Phe Val Phe Tyr Tyr His 
65 70 75 80 

lie Asp Pro Xaa Pro Thr His Glu Trp Ala Val 

85 90 



<210> 493 
<211> 941 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (807) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (809) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (815) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (819) 

<223> Xaa equals any amino acid 
<400> 493 

Met Val Phe Leu Pro Leu Lys Trp Ser Leu Ala Thr Met Ser Phe Leu 
15 10 15 

Leu Ser Ser Leu Leu Ala Leu Leu Thr Val Ser Thr Pro Ser Trp Cys 

20 25 30 

Gin Ser Thr Glu Ala Ser Pro Lys Arg Ser Asp Gly Thr Pro Phe Pro 
35 40 45 

Trp Asn Lys lie Arg Leu Pro Glu Tyr Val lie Pro Val His Tyr Asp 
50 55 60 

Leu Leu He His Ala Asn Leu Thr Thr Leu Thr Phe Trp Gly Thr Thr 
65 70 75 80 

Lys Val Glu He Thr Ala Ser Gin Pro Thr Ser Thr He He Leu His 

85 90 95 

Ser His His Leu Gin He Ser Arg Ala Thr Leu Arg Lys Gly Ala Gly 

100 105 110 

Glu Arg Leu Ser Glu Glu Pro Leu Gin Val Leu Glu His Pro Pro Gin 
115 120 125 

Glu Gin He Ala Leu Leu Ala Pro Glu Pro Leu Leu Val Gly Leu Pro 
130 135 140 

Tyr Thr Val Val He His Tyr Ala Gly Asn Leu Ser Glu Thr Phe His 
145 150 155 160 

Gly Phe Tyr Lys Ser Thr Tyr Arg Thr Lys Glu Gly Glu Leu Arg He 

165 170 175 

Leu Ala Ser Thr Gin Phe Glu Pro Thr Ala Ala Arg Met Ala Phe Pro 

180 185 190 

Cys Phe Asp Glu Pro Ala Phe Lys Ala Ser Phe Ser He Lys He Arg 
195 200 205 

Arg Glu Pro Arg His Leu Ala He Ser Asn Met Pro Leu Val Lys Ser 
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210 



215 



220 



Val Thr Val Ala Glu Gly Leu He Glu Asp His Phe Asp Val Thr Val 
225 230 235 240 

Lys Met Ser Thr Tyr Leu Val Ala Phe He He Ser Asp Phe Glu Ser 

245 250 255 

Val Ser Lys He Thr Lys Ser Gly Val Lys Val Ser Val Tyr Ala Val 

260 265 270 

Pro Asp Lys Met Asn Gin Ala Asp Tyr Ala Leu Asp Ala Ala Val Thr 
275 280 285 

Leu Leu Glu Phe Tyr Glu Asp Tyr Phe Ser He Pro Tyr Pro Leu Pro 
290 295 300 



Lys Gin Asp Leu Ala Ala He Pro Asp Phe Gin Ser Gly Ala Met Glu 
305 310 315 320 

Asn Trp Gly Leu Thr Thr Tyr Arg Glu Ser Ala Leu Leu Phe Asp Ala 

325 330 335 

Glu Lys Ser Ser Ala Ser Ser Lys Leu Gly He Thr Met Thr Val Ala 

340 345 350 

His Glu Leu Ala His Gin Trp Phe Gly Asn Leu Val Thr Met Glu Trp 
355 360 365 

Trp Asn Asp Leu Trp Leu Asn Glu Gly Phe Ala Lys Phe Met Glu Phe 
370 375 380 

Val Ser Val Ser Val Thr His Pro Glu Leu Lys Val Gly Asp Tyr Phe 
385 390 395 400 

Phe Gly Lys Cys Phe Asp Ala Met Glu Val Asp Ala Leu Asn Ser Ser 

405 410 415 

His Pro Val Ser Thr Pro Val Glu Asn Pro Ala Gin He Arg Glu Met 

420 425 430 

Phe Asp Asp Val Ser Tyr Asp Lys Gly Ala Cys He Leu Asn Met Leu 
435 440 445 

Arg Glu Tyr Leu Ser Ala Asp Ala Phe Lys Ser Gly He Val Gin Tyr 
450 455 460 

Leu Gin Lys His Ser Tyr Lys Asn Thr Lys Asn Glu Asp Leu Trp Asp 
465 470 475 480 

Ser Met Ala Ser He Cys Pro Thr Asp Gly Val Lys Gly Met Asp Gly 



Phe Cys Ser Arg Ser Gin His Ser Ser Ser Ser Ser His Trp His Gin 

500 505 510 

Glu Gly Val Asp Val Lys Thr Met Met Asn Thr Trp Thr Leu Gin Arg 
515 520 525 

Gly Phe Pro Leu He Thr He Thr Val Arg Gly Arg Asn Val His Met 



485 



490 



495 



530 



535 



540 
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Lys Gin Glu His Tyr Met Lys Gly Ser Asp Gly Ala Pro Asp Thr Gly 
545 550 555 560 

Tyr Leu Trp His Val Pro Leu Thr Phe He Thr Ser Lys Ser Asp Met 

565 570 575 

Val His Arg Phe Leu Leu Lys Thr Lys Thr Asp Val Leu He Leu Pro 

580 585 590 

Glu Glu Val Glu Trp He Lys Phe Asn Val Gly Met Asn Gly Tyr Tyr 
595 600 605 

He Val His Tyr Glu Asp Asp Gly Trp Asp Ser Leu Thr Gly Leu Leu 
610 615 620 

Lys Gly Thr His Thr Ala Val Ser Ser Asn Asp Arg Ala Ser Leu He 
625 630 635 640 

Asn Asn Ala Phe Gin Leu Val Ser He Gly Lys Leu Ser He Glu Lys 

645 650 655 

Ala Leu Asp Leu Ser Leu Tyr Leu Lys His Glu Thr Glu He Met Pro 

660 665 670 

Val Phe Gin Gly Leu Asn Glu Leu He Pro Met Tyr Lys Leu Met Glu 
675 680 685 

Lys Arg Asp Met Asn Glu Val Glu Thr Gin Phe Lys Ala Phe Leu He 
690 695 700 

Arg Leu Leu Arg Asp Leu He Asp Lys Gin Thr Trp Thr Asp Glu Gly 
705 710 715 720 

Ser Val Ser Glu Arg Met Leu Arg Ser Glu Leu Leu Leu Leu Ala Cys 

725 730 735 

Val His Asn Tyr Gin Pro Cys Val Gin Arg Ala Glu Gly Tyr Phe Arg 

740 745 750 

Lys Trp Lys Glu Ser Asn Gly Asn Leu Ser Leu Pro Val Asp Val Thr 
755 760 765 

Leu Ala Val Phe Ala Val Gly Ala Gin Ser Thr Glu Gly Trp Asp Phe 
770 775 780 

Leu Tyr Ser Lys Tyr Gin Phe Ser Leu Ser Ser Thr Glu Lys Ser Gin 
785 790 795 800 

He Glu Phe Ala Leu Cys Xaa Pro Xaa Asn Lys Glu Lys Leu Xaa Trp 

805 810 815 

Leu Leu Xaa Glu Ser Phe Lys Gly Asp Lys He Lys Thr Gin Glu Phe 

820 825 830 

Pro Gin He Leu Thr Leu He Gly Arg Asn Pro Val Gly Tyr Pro Leu 
835 840 845 

Ala Trp Gin Phe Leu Arg Lys Asn Trp Asn Lys Leu Val Gin Lys Phe 
850 855 860 
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Glu Leu Gly Ser Ser Ser lie Ala His Met Val Met Gly Thr Thr Asn 
865 870 875 880 

Gin Phe Ser Thr Arg Thr Arg Leu Glu Glu Val Lys Gly Phe Phe Ser* 

885 890 895 

Ser Leu Lys Glu Asn Gly Ser Gin Leu Arg Cys Val Gin Gin Thr lie 

900 905 910 

Glu Thr He Glu Glu Asn He Gly Trp Met Asp Lys Asn Phe Asp Lys 
915 920 925 

He Arg Val Trp Leu Gin Ser Glu Lys Leu Glu Arg Met 
930 935 940 



<210> 494 
<211> 157 
<212> PRT 
<213> Homo sapiens 

<400> 494 

Met Val Lys Ser Val He Phe Leu Ser Phe Trp Gin Gly Met Leu Leu 
1 5 10 15 

Ala He Leu Glu Lys Cys Gly Ala He Pro Lys He His Ser Ala Arg 

20 25 30 

Val Ser Val Gly Glu Gly Thr Val Ala Ala Gly Tyr His Asp Phe He 
35 40 45 

He Cys Val Glu Met Phe Phe Ala Ala Leu Ala Leu Arg His Pro Phe 
50 55 60 

Thr Tyr Asn Val Tyr Ala Asp Lys Arg Leu Asp Ala Gin Gly Arg Cys 
65 70 75 80 

Ala Pro Met Lys Ser He Ser Ser Ser Leu Lys Glu Thr Met Asn Pro 

85 90 95 

His Asp He Val Gin Asp Ala He His Asn Phe Ser Pro Ala Tyr Gin 

100 105 110 

Gin Tyr Thr Gin Gin Ser Thr Leu Glu Pro Gly Pro Thr Trp Arg Gly 
115 120 125 

Gly Ala His Gly Leu Ser Arg Ser His Ser Leu Ser Gly Ala Arg Asp 
130 135 140 

Asn Glu Lys Thr Leu Leu Leu Ser Ser Asp Asp Glu Phe 
145 150 155 



<210> 495 
<211> 118 
<212> PRT 
<213> Homo sapiens 

<400> 495 
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Phe Leu Ser Ser Trp Gin Arg Pro Ala Cys Gly Cys Gin Arg Pro Ala 
15 10 15 

Leu Pro Leu His Leu Gly Gly Ala Glu Gin Leu Gly Pro Ser Cys Pro 

20 25 30 

Gly Gly Trp Val Gin Thr Gin Ala Glu Asp Gin Pro Trp Pro Cys Pro 
35 40 • 45 

Ala lie Cys Phe His Gin Ala Val Ser Pro Pro Trp Leu Pro Phe Ser 
50 55 60 

Leu Gin Ala Lys Val Leu Leu lie Pro Thr Pro Leu Val Phe Ala Cys 
65 70 75 80 

Pro Ala Leu Leu Phe Ala Trp Arg Val Gly Gly Ala Gin Trp Gin Gly 

85 90 95 

He Ser Gly Pro Trp Gly Arg Gly Asp Gly Asn Met Cys Pro Thr Ala 

100 105 110 

Pro Ser Pro Pro Pro Pro 
115 



<210> 496 
<211> 59 
<212> PRT 

<213> Homo sapiens 
<400> 496 

Met Met Lys Asp Val Phe Phe Phe Leu Phe Leu Leu Ala Val Trp Val 
15 10 15 

Val Ser Phe Gly Val Ala Lys Gin Ala He Leu He His Asn Glu Arg 

20 25 30 

Arg Val Asp Trp Leu Phe Arg Gly Pro Ser Thr Thr Pro Thr Ser Pro 
35 40 45 

Ser Ser Gly Arg Ser Arg Ala Thr Ser Thr Val 
50 55 



<210> 497 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (94) 

<223> Xaa equals any amino acid 
<400> 497 

Met Asn Thr Leu Val Leu Trp He Phe Gly Phe Leu He Cys Leu Gly 
15 10 15 

He He Leu Ala He Gly Asn Ser lie Trp Glu Ser Gin Thr Gly Asp 
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20 25 30 

Gin Phe Arg Thr Phe Leu Phe Trp Asn Glu Gly Glu Lys Ser Ser Val 
35 40 45 

Phe Ser Gly Phe Leu Thr Phe Trp Ser Tyr lie lie lie Leu Asn Thr 
50 55 60 

Val Val Pro He Ser Leu Tyr Val Ser Val Glu Val He Arg Leu Gly 
65 70 75 80 

His Ser Tyr Phe He Asn Trp Asp Arg Lys Met Tyr Tyr Xaa Arg Lys 

85 90 95 

Ala He Pro Ala Val Ala Arg Thr Thr Thr Leu Asn Glu 

100 105 



<210> 498 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals any amino acid 
<400> 498 

He Asn His Val Phe He Trp Gly Ser He Ala He Tyr Phe Ser He 
15 10 15 

Leu Phe Thr Met His Ser Asn Gly He Phe Gly He Phe Pro Asn Gin 

20 25 30 

Phe Pro Phe Val Gly Asn Ala Arg His Ser Leu Thr Xaa Lys 
35 40 45 



<210> 499 
<211> 6 
<212> PRT 

<213> Homo sapiens 
<400> 499 

Thr Val Ala He Tyr Asp 
1 5 



<210> 500 
<211> 11 
<212> PRT 

<213> Homo sapiens 
<400> 500 

Phe Leu Val Cys Leu Leu Leu Gly Pro Arg Ser 
15 10 
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<210> 


501 


<211> 


56 


<212> 


PRT 


<213> 


Homo sapiens 


<220> 




<221> 


SITE 


<222> 


(35) 


<223> 


Xaa equals any amino acid 


<220> 




<221> 


SITE 


<222> 


(42) 


<223> 


Xaa equals any amino acid 


<220> 




<221> 


SITE 


<222> 


(46) 


<223> 


Xaa equals any amino acid 


<400> 


501 


Lys Ser Gin Met Gin Ser Phe Thr 


1 


5 



10 15 

Leu Ser Leu Thr Cys Leu Pro Thr Leu Asn Gin Met Leu Val Phe Lys 

20 25 30 

Ser Asn Xaa Ser Leu Val Ser Pro His Xaa Leu Thr Phe Xaa Asn He 
35 40 45 

Phe Ala Arg Phe Glu Asn Phe Gin 
50 55 



<210> 502 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 502 

Asn Tyr Asn Arg Gly Gly Thr Phe Leu Tyr Gin Lys Ala Lys He Lys 
15 10 15 

His His Val Leu Met Val Phe Tyr Lys Ser Thr Ser Asn Ser Thr Glu 

20 25 30 

Ser Leu He Trp Ser Leu Leu Asn Ser Trp Ser Asp Lys Val Thr Phe 
35 40 45 

Pro Lys Arg Val Arg 
50 



<210> 503 
<211> 46 
<212> PRT 

<213> Homo sapiens 
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<400> 503 

Met Pro Trp Leu Lys Ser Leu Leu His Phe Ser Leu Phe Leu Val Val 
15 10 15 

Phe Ser Thr Leu Ala Val Lys Ser Leu Gly Val Pro Val Ala Ala Gly 

20 25 30 

Ser Pro Phe Cys He Val Asp Val Leu His Phe He Leu Leu 
35 40 45 



<210> 504 
<211> 64 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (7) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (27) 

<223> Xaa equals any amino acid 
<400> 504 

Ser Trp Val He Val Val Xaa He Trp Gly Tyr Leu Leu Glu Gly His 
15 10 15 

Gly Val Pro Phe Cys Lys Ser Tyr Gly Pro Xaa Pro Trp Lys Leu His 

20 25 30 

Thr His His Ala Ala Tyr Asn Ser Gly Ser Ser Gin Val Tyr Arg He 
35 40 45 

Leu Gly Asn Ser Pro Cys Pro Val Leu He His Cys Ser Phe Ser Gly 
50 55 60 



<210> 505 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (9) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (14) 

<223> Xaa equals any amino acid 
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<400> 505 

TrpvLys Gly Leu Leu Glu Gly Ser Xaa Glu Ala Thr Met Xaa 
1 5 ~ 10 



<210> 506 
<211> 107 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (66) 

<223> Xaa equals any amino acid 
<400> 506 

Pro Leu Gly Arg Glu Pro Leu Ala Gly Phe Leu Ser Phe Leu Ser Phe 
15 10 15 

Ser Leu Leu Trp Cys Leu Glu Ala Phe Pro Arg Leu Gin Phe Leu Thr 

20 25 30 

Thr Leu Thr Asp Phe Ala lie Val Leu Ser Pro Pro Leu Ser Phe Pro 
35 40 45 

Lys Leu Thr Leu Trp Arg Leu lie Lys Arg Lys Asn His Arg Pro Gly 
50 55 60 

Ala Xaa Leu Thr Pro Arg Arg Arg Ala Asn His Leu Arg Cys Gly Val 
65 70 75 80 

Arg Asp Gin Pro Asp Gin Asn Arg Glu Thr Pro Ser Leu Leu Asn Asn 

85 90 95 

Thr Lys Leu Ala Gly Arg Gly Gly Ala Arg Leu 

100 105 



<210> 507 
<211> 127 
<212> PRT 

<213> Homo sapiens 
<400> 507 

Met Pro Arg Ala Pro Trp Arg lie Pro Leu Cys Ala Leu Pro Thr Leu 
1 5 10 15 

Cys Leu Gly Ser Pro Leu Pro Ser Gin Pro Thr His Pro He Phe Tyr 

20 25 30 

Asp His Arg Ala Pro Thr Trp Lys Met Ala His Pro Gly Gly Pro Arg 
35 40 45 

Ser Ser His Ser Pro Arg Gly Pro Gly Gly His Pro Ala Leu Arg Gin 
50 55 60 

Arg Leu Pro Cys Arg Arg Gly Glu Pro Glu Thr Ala Leu Cys Ser Ser 
65 70 75 80 
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Ala Pro Gly Ala Gly Phe Ala Glu Pro Pro Cys Lys Ala Ser Pro Gly 

85 90 95 

Trp Gly Pro Pro Ser Arg Gly Pro Gin Gly Asp Arg Ser Gin Gly Glu 

100 105 110 

Trp Leu Pro Ala Leu Gly Thr Pro Cys Gly Gly Pro Asp Asp Ser 
115 120 125 



<210> 508 

<211> 90 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (31) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (57) 

<223> Xaa equals any amino acid 
<400> 508 

Met Pro Arg Ala Pro Trp Arg lie Pro Leu Cys Ala Leu Pro Thr Leu 
1 5 10 15 

Cys Leu Gly Ser Pro Leu Pro Ser Gin Pro Thr His Pro lie Xaa Tyr 

20 25 30 

Asp His Arg Ala Pro Thr Trp Lys Met Ala His Pro Gly Gly Pro Arg 
35 40 45 

Ser Ser His Ser Pro Arg Thr Trp Xaa Thr Pro Ser Ser Gin Thr Lys 
50 55 60 

Ala Ala Leu Pro Ala Gly Gly Ala Arg Asn Ser Pro Leu Gin Leu Cys 
65 70 75 80 

Thr Arg Ser Arg Phe Cys Gly Thr Pro Met 

85 90 



<210> 509 
<211> 308 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (87) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (185) 

<223> Xaa equals any amino acid 
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<400> 509 

Met Pro Val Pro Trp Phe Leu Leu Ser Leu Ala Leu Gly Arg Ser Pro 
15 10 15 

Val Val Leu Ser Leu Glu Arg Leu Val Gly Pro Gin Asp Ala Thr His 

20 25 30 

Cys Ser Pro Gly Leu Ser Cys Arg Leu Trp Asp Ser Asp lie Leu Cys 
35 40 45 

Leu Pro Gly Asp lie Val Pro Ala Pro Gly Pro Val Leu Ala Pro Thr 
50 55 60 

His Leu Gin Thr Glu Leu Val Leu Arg Cys Gin Lys Glu Thr Asp Cys 
65 70 75 80 

Asp Leu Cys Leu Arg Val Xaa Val His Leu Ala Val His Gly His Trp 

85 90 95 

Glu Glu Pro Glu Asp Glu Glu Lys Phe Gly Gly Ala Ala Asp Leu Gly 

100 105 110 

Val Glu Glu Pro Arg Asn Ala Ser Leu Gin Ala Gin Val Val Leu Ser 
115 120 125 

Phe Gin Ala Tyr Pro Thr Ala Arg Cys Val Leu Leu Glu Val Gin Val 
130 135 140 

Pro Ala Ala Leu Val Gin Phe Gly Gin Ser Val Gly Ser Val Val Tyr 
145 150 155 160 

Asp Cys Phe Glu Ala Ala Leu Gly Ser Glu Val Arg lie Trp Ser Tyr 

165 170 175 

Thr Gin Pro Arg Tyr Glu Lys Glu Xaa Asn His Thr Gin Gin Leu Pro 

180 185 190 

Asp Cys Arg Gly Leu Glu Val Trp Asn Ser He Pro Ser Cys Trp Ala 
195 200 205 

Leu Pro Trp Leu Asn Val Ser Ala Asp Gly Asp Asn Val His Leu Val 
210 215 220 

Leu Asn Val Ser Glu Glu Gin His Phe Gly Leu Ser Leu Tyr Trp Asn 
225 230 235 240 

Gin Val Gin Gly Pro Pro Lys Pro Arg Trp His Lys Asn Leu Thr Gly 

245 250 255 

Pro Gin He He Thr Leu Asn His Thr Asp Leu Val Pro Cys Leu Cys 

260 265 270 

He Gin Val Trp Pro Leu Glu Pro Asp Ser Val Arg Arg Thr Ser Ala 
275 280 285 

Pro Ser Gly Arg Thr Pro Ala His Thr Arg Thr Ser Gly Lys Pro Pro 
290 295 300 

Asp Cys Asp Cys 
305 
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<210> 510 
<211>' 55 
<212> PRT 

<213> Homo sapiens 
<400> 510 

Met Ser Ser Asp Phe Leu Cys Phe Phe Phe Lys Leu Cys Asn Gin Met 
1 5 10 15 

lie Leu Cys Phe Phe Phe Arg Gly Ala Glu Tyr Trp Phe Leu Leu Leu 

20 25 30 

Val Val Phe Ser Phe Leu Cys His Ser Cys Phe Phe Phe Val Phe Ser 
35 40 45 

Val Ser Asn Thr lie Cys He 
50 55 



<210> 511 
<211> 98 
<212> PRT 

<213> Homo sapiens 
<400> 511 

Met His Cys Cys Gin Leu Pro Trp Arg Cys Ala Gin Ala Pro Gin Glu 
15 10 15 

Ala Phe Leu Leu Cys Leu Leu Phe Leu He Leu Val Leu Val Leu Leu 

20 25 30 

Gly Cys Ser Arg Gly Leu Pro Gly His Thr Pro Trp Arg Leu His Pro 
35 40 45 

Ala Ala Ala Ala Leu Leu Ala Pro Leu Leu His Asp Ala Leu Gly Ala 
50 55 60 

Cys Gly Phe Gin Gly Pro Glu Tyr Leu Leu Pro Cys Leu Leu Pro Leu 
65 70 75 80 

Pro Lys Pro Gly Gin Leu Gin Gly Pro Trp Gly Pro Leu Trp Ala Leu 

85 90 95 

Leu Pro 



<210> 512 
<211> 22 
<212> PRT 

<213> Homo sapiens 
<400> 512 

Leu Pro Arg Pro Cys Ala Pro Ser Pro Val Trp Arg Gin Val Gly Arg 
15 10 15 
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Glu Glu Ala Ser Leu Leu 

20 



<210> 513 

<211> 25 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (9) 

<223> Xaa equals any amino acid 
<400> 513 

Cys Ala Val Arg Phe Arg Glu Gin Xaa Ala Pro Glu Arg Val Phe Leu 
15 10 15 

Pro Thr Arg Gly Arg Lys Ser Glu Pro 

20 25 



<210> 514 
<211> 365 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (144) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (201) 

<223> Xaa equals any amino acid 
<400> 514 

Met Phe Val Gly Leu Met Ala Phe Leu Leu Ser Phe Tyr Leu lie Phe 
15 10 15 

Thr Asn Glu Gly Arg Ala Leu Lys Thr Ala Thr Ser Leu Ala Glu Gly 

20 25 30 

Leu Ser Leu Val Val Ser Pro Asp Ser lie His Ser Val Ala Pro Glu 
35 40 45 

Asn Glu Gly Arg Leu Val His lie lie Gly Ala Leu Arg Thr Ser Lys 
50 55 60 

Leu Leu Ser Asp Pro Asn Tyr Gly Val His Leu Pro Ala Val Lys Leu 
65 70 75 80 

Arg Arg His Val Glu Met Tyr Gin Trp Val Glu Thr Glu Glu Ser Arg 

85 90 95 

Glu Tyr Thr Glu Asp Gly Gin Val Lys Lys Glu Thr Arg Tyr Ser Tyr 

100 105 110 
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Asn Thr Glu Trp Arg Ser Glu He He Asn Ser Lys Asn Phe Asp Arg 
115 120 125 

Glu He Gly His Lys Asn Pro Ser Ala Met Ala Val Glu Ser Phe Xaa 
130 135 140 

Ala Thr Ala Pro Phe Val Gin He Gly Arg Phe Phe Leu Ser Ser Gly 
145 150 155 160 

Leu He Asp Lys Val Asp Asn Phe Lys Ser Leu Ser Leu Ser Lys Leu 

165 170 175 

Glu Asp Pro His Val Asp lie lie Arg Arg Gly Asp Phe Phe Tyr His 

180 185 190 

Ser Glu Asn Pro Lys Tyr Pro Glu Xaa Gly Asp Leu Arg Val Ser Phe 
195 200 205 

Ser Tyr Ala Gly Leu Ser Gly Asp Asp Pro Asp Leu Gly Pro Ala His 
210 215 220 

Val Val Thr Val lie Ala Arg Gin Arg Gly Asp Gin Leu Val Pro Phe 
225 230 235 240 

Ser Thr Lys Ser Gly Asp Thr Leu Leu Leu Leu His His Gly Asp Phe 

245 250 255 

Ser Ala Glu Glu Val Phe His Arg Glu Leu Arg Ser Asn Ser Met Lys 

260 265 270 

Thr Trp Gly Leu Arg Ala Ala Gly Trp Met Ala Met Phe Met Gly Leu 
275 280 285 

Asn Leu Met Thr Arg He Leu Tyr Thr Leu Val Asp Trp Phe Pro Val 
290 295 300 

Phe Arg Asp Leu Val Asn He Gly Leu Lys Ala Phe Ala Phe Cys Val 
305 310 315 320 

Ala Thr Ser Leu Thr Leu Leu Thr Val Ala Ala Gly Trp Leu Phe Tyr 

325 330 335 

Arg Pro Leu Trp Ala Leu Leu lie Ala Gly Leu Ala Leu Val Pro He 

340 345 350 

Leu Val Ala Arg Thr Arg Val Pro Ala Lys Lys Leu Glu 
355 360 365 



<210> 515 
<211> 108 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals any amino acid 
<220> 
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<221> SITE 
<222> (55) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (58) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (67) 

<223> Xaa equals any amino acid 
<400> 515 

Met Phe Tyr Lys Leu Thr Leu lie Leu Cys Glu Leu Ser Val Ala Gly 
15 10 15 

Val Thr Gin Ala Ala Ser Gin Arg Pro Leu Gin Arg Leu Pro Arg His 

20 25 30 

lie Cys Ser Gin Arg Asn Pro Pro Gly Arg Cys Leu Leu Lys Ala Xaa 
35 40 45 

Leu Gin Thr Thr Trp Gly Xaa Pro Asp Xaa Gin Phe Pro Gly Cys Pro 
50 55 60 

His Pro Xaa Arg Val Thr Leu Asn Ala Arg Gin Met Gly Asn Gly Lys 
65 70 75 80 

Glu Lys Lys Ala Ala Asp Leu Lys Leu Lys Phe Pro Gin Lys Arg Phe 

85 90 95 

Tyr Leu Ser Ala Phe Ser Glu Arg lie Lys Ala Phe 

100 105 



<210> 516 

<211> 73 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (38) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (48) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (54) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (55). 
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<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (68) 

<223> Xaa equals any amino acid 
<400> 516 

Met Phe Tyr Lys Leu Thr Leu lie Leu Cys Glu Leu Ser Val Ala Gly 
15 10 15 

Val Thr Gin Ala Ala Ser Gin Arg Pro Leu Gin Arg Leu Pro Arg His 

20 25 30 

lie Cys Ser Gin Arg Xaa Pro Pro Gly Arg Cys Leu Leu Lys Ala Xaa 
35 40 45 

Leu Gin Thr Thr Trp Xaa Xaa Pro Asp Lys Pro lie Pro Arg Leu Ser 
50 55 60 

Pro Pro Leu Xaa Ser Asp Pro Lys Arg 
65 70 



<210> 517 
<211> 81 
<212> PRT 

<213> Homo sapiens 
<400> 517 

Met Ser Lys Arg Ser Ala Ser Phe lie Leu Leu Pro Leu Leu Phe Leu 
15 10 15 

Lys Gly Ser Phe Ala Lys Leu Asn Ala Arg lie Ser Asp Cys Leu Glu 

20 25 30 

Glu Arg Tyr Cys His Asn Leu Trp Met Val Phe Gin Gly Cys Val lie 
35 40 45 

Thr Glu Leu His Leu Ser Arg Met Ser Lys Thr Leu Ser Ser Leu Cys 
50 55 60 

Tyr Asp Phe Val lie Asn Val Tyr lie Phe Phe Lys Phe Leu Asp lie 
65 70 75 80 

Thr 



<210> 518 
<211> 122 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> SITE 
<222> (89) 

<223> Xaa equals any amino acid 



325 



WO 02/076488 PCT/US02/08276 



<220> 

<221> SITE 
<222> (91) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (94) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (97) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (98) 

<223> Xaa equals any amino acid 
<400> 518 

Met His Arg Ser Glu Pro Phe Leu Lys Met Ser Leu Leu lie Leu Leu 
15 10 15 

Phe Leu Gly Leu Ala Glu Ala Cys Thr Pro Arg Glu Val Asn Leu Leu 

20 25 30 

Lys Gly lie lie Gly Leu Met Ser Arg Leu Ser Pro Asp Glu lie Leu 
35 40 45 

Gly Leu Leu Ser Leu Gin Val Leu His Glu Glu Thr Ser Gly Cys Lys 
50 55 60 

Glu Glu Val Lys Pro Phe Ser Gly Thr Thr Pro Ser Arg Lys Pro Leu 
65 70 75 80 

4 

Pro Lys Arg Glu Glu His Val Glu Xaa Pro Xaa Asn Ala Xaa Thr Trp 

85 90 95 

Xaa Xaa Thr Tyr Leu Phe Val Ser Tyr Asn Lys Gly Asp Trp Phe Thr 

100 105 110 • 

Phe Ser Ser Gin Val Leu Leu Pro Leu Leu 
115 120 



<210> 519 
<211> 11 
<212> PRT 

<213> Homo sapiens 
<400> 519 

Met Ser Gly Gly Leu Ser Phe Leu Leu Leu Val 
15 10 



<210> 520 
<211> 130 
<212> PRT 
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<213> Homo sapiens 
<400> 520 

Ser Thr Cys Cys Gly Trp Gly Pro Leu Gly His Ser Arg Val Arg Gly 
15 10 15 

Cys His Cys His Leu Gly His Val Gly Arg His Gin His Phe Val Val 

20 25 30 

Thr Asn Ser Thr Val Thr Asn lie Phe Gly Gin lie Pro Phe Tyr Thr 
35 40 45 

Ser Arg Gin Leu Leu Val Cys Asn Pro Thr Gly Gin Arg Glu Gly Pro 
50 55 60 

Val Thr Trp Leu Ser His Cys Pro Ala Pro Gin Met Val Leu Gly Leu 
65 70 75 80 

Leu Phe Ser Leu Gly Pro Ala Asn Thr Thr Val Phe Thr Ser Ala His 

85 90 95 

Trp Leu Ser Ala Val Val Pro Gly Ser Gin Trp His Val Ser Pro Arg 

100 105 110 

Ser Ser Leu lie Pro Gin His Thr Pro Lys Gly Ser Val Ala Asn Thr 
115 120 125 

Leu Asn 
130 



<210> 521 
<211> 122 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (19) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (73) 

<223> Xaa equals any amino acid 
<400> 521 

Lys Ala Pro Ser Ser His Pro Gly Leu Thr Cys Val Ser Leu Ser Arg 
15 10 15 

Leu Gin Xaa Ser Leu Ser Leu Cys Phe Pro Ser Gly Pro Cys Trp Ala 

20 25 30 

Gly Leu Leu Ser Ser Leu Ala Leu Ala Gly Gly Ala Pro Gly Ala Leu 
35 40 45 

Pro Pro Trp Gin Pro Gly Gin Asp Ser Lys Met Arg Thr Ala Glu Leu 
50 55 60 

Val Gly Gly Ser His Gly Pro Ala Xaa Gly Pro Gly Glu Ala Glu Pro 
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65 70 75 80 

Glu Pro Thr Ala Val Val Leu Trp Thr Val Asp Pro Glu Gly Gly Leu 

85 90 95 

Gly Gin Val Pro Ala Glu Gly Pro Gly Gly Leu Cys Val Pro Leu Gly 

100 105 110 

Pro Gly Ala Leu Val Thr Trp Thr Pro Gly 
115 120 



<210> 522 
<211> 243 
<212> PRT 
<213> Homo sapiens 

<400> 522 

Met Gly Thr Leu Pro Trp Leu Leu Ala Phe Phe lie Leu Gly Leu Gin 
15 10 15 

Ala Trp Asp Thr Pro Thr lie Val Ser Arg Lys Glu Trp Gly Ala Arg 

20 25 30 

Pro Leu Ala Cys Arg Ala Leu Leu Thr Leu Pro Val Ala Tyr lie lie 
35 40 45 

Thr Asp Gin Leu Pro Gly Met Gin Cys Gin Gin Gin Ser Val Cys Ser 
50 55 60 

Gin Met Leu Arg Gly Leu Gin Ser His Ser Val Tyr Thr lie Gly Trp 
65 70 75 80 

Cys Asp Val Ala Tyr Asn Phe Leu Val Gly Asp Asp Gly Arg Val Tyr 

85 90 95 

Glu Gly Val Gly Trp Asn lie Gin Gly Leu His Thr Gin Gly Tyr Asn 

100 105 110 

Asn lie Ser Leu Gly lie Ala Phe Phe Gly Asn Lys He Ser Ser Ser 
115 120 125 

Pro Ser Pro Ala Ala Leu Ser Ala Ala Glu Gly Leu He Ser Tyr Ala 
130 135 140 

He Gin Lys Gly His Leu Ser Pro Arg Tyr He Gin Pro Leu Leu Leu 
145 150 155 160 

Lys Glu Glu Thr Cys Leu Asp Pro Gin His Pro Val Met Pro Arg Lys 

165 170 175 

Val Cys Pro Asn He He Lys Arg Ser Ala Trp Glu Ala Arg Glu Thr 

180 185 190 

His Cys Pro Lys Met Asn Leu Pro Ala Lys Tyr Val He He He His 
195 200 205 

Thr Ala Gly Thr Ser Cys Thr Val Ser Thr Asp Cys Gin Thr Val Val 
210 215 220 
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Arg Asn lie Gin Ser Phe His Met 'Asp Thr Arg Asn Phe Cys Asp lie 
225 230 235 240 



Gly Tyr Gin 



<210> 523 
<211> 154 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (150) 

<223> Xaa equals any amino acid 
<400> 523 

Met Ala Arg His Gly Leu Pro lieu Leu Pro Leu Leu Ser Leu Leu Val 
15 10 15 

Gly Ala Trp Leu Lys Leu Gly Asn Gly Gin Ala Thr Ser Met Val Gin 

20 25 30 

Leu Gin Gly Gly Arg Phe Leu Met Gly Thr Asn Ser Pro Asp Ser Arg 
35 40 45 

Asp Gly Glu Gly Pro Val Arg Glu Ala Thr Val Lys Pro Phe Ala He 
50 55 60 

Asp He Phe Pro Val Thr Asn Lys Asp Phe Arg Asp Phe Val Arg Glu 
65 70 75 80 

Lys Lys Tyr Arg Thr Glu Ala Glu Met Phe Gly Trp Ser Phe Val Phe 

85 90 95 

Glu Asp Phe Val Ser Asp Glu Leu Arg Asn Lys Ala Thr Gin Pro Met 

100 105 110 

Lys Ser Val Leu Trp Trp Leu Pro Val Glu Lys Ala Phe Trp Arg Gin 
115 120 125 

Pro Ala Gly Pro Gly Ser Gly He Arg Glu Arg Leu Glu His Pro Val 
130 135 140 

Leu His Val Ser Trp Xaa Asp Ala Arg Ala 
145 150 



<210> 524 
<211> 57 
<212> PRT 

<213> Homo sapiens 
<400> 524 

Met Pro Cys Thr Cys Thr Trp Arg Asn Trp Arg Gin Trp He Arg Pro 
15 10 15 

Leu Val Ala Val He Tyr Leu Val Ser He Val Val Ala Val Pro Leu 
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20 25 30 

Cys Val Trp Glu Leu Gin Lys Leu Glu Val Gly lie His Thr Lys Ala 
35 40 45 

Trp Phe lie Ala Gly lie Phe Leu Leu 
50 55 



<210> 525 
<211> 107 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (92) 

<223> Xaa equals any amino acid 
<400> 525 

Met Val Arg Tyr Thr Tyr Ser Met Leu Ser Val lie Gly lie Ser Tyr 
15 10 15 

Ala Val Leu Thr Trp Leu Ser Gin Thr Leu Trp Met Pro He Tyr Pro 

20 25 30 

Leu Cys Val Leu Ala Glu Ala Phe Ala He Tyr Gin Ser Leu Pro Tyr 
35 40 45 

Phe Glu Ser Phe Gly Thr Tyr Ser Thr Lys Leu Pro Phe Asp Leu Ser 
50 55 60 

He Tyr Phe Pro Tyr Val Leu Lys He Tyr Leu Met Met Leu Phe He 
65 70 75 80 

Gly Met Tyr Phe Thr Tyr Ser His Leu Tyr Ser Xaa Arg Arg Asp He 

85 90 95 

Leu Gly He Phe Pro He Lys Lys Lys Lys Met 

100 105 



<210> 526 
<211> 37 
<212> PRT 

<213> Homo sapiens 
<400> 526 

Met Val Arg Tyr Thr Tyr Ser Met Leu Ser Val lie Gly lie Ser Tyr 
15 10 15 

Ala Val Leu Thr Trp Ala Gin Ser Asn Thr Met Asp Ala Asn Leu Ser 

20 25 30 

Phe Val Cys Ser Cys 
35 
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<210> 527 
<211> 46 
<212> PRT 

<213> Homo sapiens 
<400> 527 

Met Lys Ser Gin Cys Tyr Ser Pro Ser Tyr Phe Ala Phe Phe Cys Leu 
15 10 15 

Val Phe Phe Gin lie Thr Ser Ala Ser Ser Gin Thr Leu Arg Gly His 

20 25 30 

Val Leu Cys Arg Thr Thr Leu Arg Asp Ser Ser Ala Tyr Cys 
35 40 45 



<210> 528 
<211> 442 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE ; 
<222> (364) 

<223> Xaa equals any amino acid 
<400> 528 

Met Trp Phe Thr Tyr Leu Leu Leu Tyr Leu His Ser Val Arg Ala Tyr 
15 10 15 

Ser Ser Arg Gly Ala Gly Cys Cys Cys Cys Trp Ala Arg Trp Arg Arg 

20 25 30 

Ala Val His Thr Ala Arg Gly Leu Arg Gly Arg Pro Arg Arg Gin Leu 
35 40 45 

Leu Arg Pro Leu Arg Pro Ala Gin Gly Leu Ala Pro Gly Arg His Arg 
50 55 60 

Leu Arg Pro Ala Val Leu Pro Leu His Leu Gin Pro Leu Pro Gly Leu 
65 70 75 80 

Trp Gly Gly His Ala Glu Trp Ala Ala Leu Leu Tyr Tyr Gly Pro Phe 

85 90 95 

He Val lie Phe Gin Phe Gly Trp Ala Ser Thr Gin He Ser His Leu 

100 105 110 

Ser Leu He Pro Glu Leu Val Thr Asn Asp His Glu Lys Val Glu Leu 
115 120 125 

Thr Ala Leu Arg Tyr Ala Phe Thr Val Val Ala Asn He Thr Val Tyr 
130 135 140 

Gly Ala Ala Trp Leu Leu Leu His Leu Gin Gly Ser Ser Arg Val Glu 
145 150 155 160 

Pro Thr Gin Asp He Ser He Ser Asp Gin Leu Gly Gly Gin Asp Val 

165 170 175 
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Pro Val Phe Arg Asn Leu Ser Leu Leu Val Val Gly Val Gly Ala Val 

180 185 190 

Phe Ser Leu Leu Phe His Leu Gly Thr Arg Glu Arg Arg Arg Pro His 
195 200 205 

Ala Glu Glu Pro Gly Glu His Thr Pro Leu Leu Ala Pro Ala Thr Ala 
210 215 220 

Gin Pro Leu Leu Leu Trp Lys His Trp Leu Arg Glu Pro Ala Phe Tyr 
225 230 235 240 

Gin Val Gly lie Leu Tyr Met Thr Thr Arg Leu lie Val Asn Leu Ser 

245 250 255 

Gin Thr Tyr Met Ala Met Tyr Leu Thr Tyr Ser Leu His Leu Pro Lys 

260 265 270 

Lys Phe lie Ala Thr He Pro Leu Val Met Tyr Leu Ser Gly Phe Leu 
275 280 285 

Ser Ser Phe Leu Met Lys Pro He Asn Lys Cys He Gly Arg Asn Met 
290 295 300 

Thr Tyr Phe Ser Gly Leu Leu Val He Leu Ala Phe Ala Ala Trp Val 
305 310 315 320 

Ala Leu Ala Glu Gly Leu Gly Val Ala Val Tyr Ala Ala Ala Val Leu 

325 330 335 

Leu Gly Ala Gly Cys Ala Thr He Leu Val Thr Ser Leu Ala Met Thr 

340 345 350 

Ala Asp Leu He Gly Pro His Thr Asn Ser Gly Xaa Phe Val Tyr Gly 
355 360 365 

Ser Met Ser Phe Leu Asp Lys Val Ala Asn Gly Leu Ala Val Met Ala 
370 375 380 

He Gin Ser Leu His Pro Cys Pro Ser Glu Leu Cys Cys Arg Ala Cys 
385 390 395 400 

Val Ser Phe Tyr His Trp Ala Met Val Ala Val Thr Gly Gly Val Gly 

405 410 415 

Val Ala Ala Ala Leu Cys Leu Cys Ser Leu Leu Leu Trp Pro Thr Arg 

420 425 430 

Leu Arg Arg Trp Asp Arg Asp Ala Arg Pro 
435 440 



<210> 529 
<211> 309 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (26) 
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<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (84) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (111) 

<223> Xaa equals any amino acid 
<400> 529 

Ala Ala Asp Asn Tyr Gly lie Pro Arg Ala Cys Arg Asn Ser Ala Arg 
15 10 15 

Ser Tyr Gly Ala Ala Trp Leu Leu Leu Xaa Pro Ala Gly Ser Ser Arg 

20 25 30 

Val Glu Pro Thr Gin Asp He Ser He Ser Asp Gin Leu Gly Gly Gin 
35 40 45 

Asp Val Pro Val Phe Arg Asn Leu Ser Leu Leu Val Val Gly Val Gly 
50 55 60 

Ala Val Phe Ser Leu Leu Phe His Leu Gly Thr Arg Glu Arg Arg Arg 
65 70 75 80 

Pro His Ala Xaa Glu Pro Gly Glu His Thr Pro Leu Leu Ala Pro Ala 

85 90 95 

Thr Ala Gin Pro Leu Leu Leu Trp Lys His Trp Leu Arg Glu Xaa Ala 

100 105 110 

Phe Tyr Gin Val Gly He Leu Tyr Met Thr Thr Arg Leu He Val Asn 
115 120 125 

Leu Ser Gin Thr Tyr Met Ala Met Tyr Leu Thr Tyr Ser Leu His Leu 
130 135 140 

Pro Lys Lys Phe He Ala Thr He Pro Leu Val Met Tyr Leu Ser Gly 
145 150 155 160 

Phe Leu Ser Ser Phe Leu Met Lys Pro He Asn Lys Cys He Gly Arg 

165 170 175 

Asn Met Thr Tyr Phe Ser Gly Leu Leu Val He Leu Ala Phe Ala Ala 

180 185 190 

Trp Val Ala Leu Ala Glu Gly Leu Gly Val Ala Val Tyr Ala Ala Ala 
195 200 205 

Val Leu Leu Gly Ala Gly Cys Ala Thr He Leu Val Thr Ser Leu Ala 
210 215 220 

Met Thr Ala Asp Leu He Gly Pro His Thr Asn Ser Gly Ala Phe Val 
225 230 235 240 

Tyr Gly Ser Met Ser Phe Leu Asp Lys Val Ala Asn Gly Leu Ala Val 

245 250 255 
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Met Ala lie Gin Ser Leu His Pro Cys Pro Ser Glu Leu Cys Cys Arg 

260 265 270 

Ala Cys Val Ser Phe Tyr His Trp Ala Met Val Ala Val Thr Gly Gly 
275 280 285 

Val Gly Val Ala Ala Ala Leu Cys Leu Cys Ser Leu Leu Leu Trp Pro 
230 295 300 

Thr Arg Leu Arg Arg 
305 



<210> 530 
<211> 243 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (26) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (84) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (111) 

<223> Xaa equals any amino acid 
<400> 530 

Ala Ala Asp Asn Tyr Gly lie Pro Arg Ala Cys Arg Asn Ser Ala Arg 
15 10 15 

Ser Tyr Gly Ala Ala Trp Leu Leu Leu Xaa Pro Ala Gly Ser Ser Arg 

20 25 30 

Val Glu Pro Thr Gin Asp lie Ser lie Ser Asp Gin Leu Gly Gly Gin 
35 40 45 

Asp Val Pro Val Phe Arg Asn Leu Ser Leu Leu Val Val Gly Val Gly 
50 55 60 

Ala Val Phe Ser Leu Leu Phe His Leu Gly Thr Arg Glu Arg Arg Arg 
65 70 75 80 

Pro His Ala Xaa Glu Pro Gly Glu His Thr Pro Leu Leu Ala Pro Ala 

85 90 95 

Thr Ala Gin Pro Leu Leu Leu Trp Lys His Trp Leu Arg Glu Xaa Ala 

100 105 110 

Phe Tyr Gin Val Gly lie Leu Tyr Met Thr Thr Arg Leu lie Val Asn 
115 120 125 

Leu Ser Gin Thr Tyr Met Ala Met Tyr Leu Thr Tyr Ser Leu His Leu 
130 135 140 
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Pro Lys Lys Phe lie Ala Thr lie Pro Leu Val Met Tyr Leu Ser Gly 
145 150 155 160 

Phe Leu Ser Ser Phe Leu Met Lys Pro lie Asn Lys Cys lie Gly Arg 

165 170 175 

Asn Met Thr Tyr Phe Ser Gly Leu Leu Val lie Leu Ala Phe Ala Ala 

180 185 190 

Trp Val Ala Leu Ala Glu Gly Leu Gly Val Ala Val Tyr Ala Ala Ala 
195 200 205 

Val Leu Leu Gly Ala Gly Cys Ala Thr lie Leu Val Thr Ser Leu Ala 
210 215 220 

Met Thr Ala Asp Leu lie Gly Pro His Thr Asn Ser Gly Leu Ser Cys 
225 230 235 240 

Thr Ala Pro 



<210> 531 
<211> 148 
<212> PRT 

<213> Homo sapiens 
<400> 531 

Met Ala Gly Ser Pro Leu Leu Trp Gly Pro Arg Ala Gly Gly Val Gly 
15 10 15 

Leu Leu Val Leu Leu Leu Leu Gly Leu Phe Arg Pro Pro Pro Ala Leu 

20 25 30 

Cys Ala Arg Pro Val Lys Glu Pro Arg Gly Leu Ser Ala Ala Ser Pro 
35 40 45 

Pro Leu Ala Arg Leu Ala Leu Leu Ala Ala Ser Gly Gly Gin Cys Pro 
50 55 60 

Glu Val Arg Arg Arg Gly Arg Cys Arg Pro Gly Ala Gly Ala Gly Ala 
65 70 75 80 

Ser Ala Gly Ala Glu Arg Gin Glu Arg Ala Arg Ala Glu Ala Gin Arg 

85 90 95 

Leu Arg lie Ser Arg Arg Ala Ser Trp Arg Ser Cys Cys Ala Ser Gly 

100 105 110 

Ala Pro Pro Ala Thr Leu lie Arg Leu Trp Ala Trp Thr Thr Thr Pro 
115 120 125 

Thr Arg Leu Gin Arg Ser Ser Leu Ala Leu Cys Ser Ala Pro Ala Leu 
130 135 140 

Thr Leu Pro Pro 
145 
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<210> 532 

<211> 65 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (24) 

<223> Xaa equals any amino acid 
<400> 532 

Met Cys Lys Gly Leu Lys Asn Pro Glu Gly Leu Leu Leu Leu Leu Leu 
15 10 15 

Leu Leu Leu Phe Thr Asp Thr Xaa Asn Ser His Cys Leu Pro Pro Tyr 

20 25 30 

Leu Ser Cys Phe Leu His Glu Arg Gin Pro Glu Leu Gin Ser Val Cys 
35 40 45 

lie Ser Ala Ala Tyr Val Leu Ala Pro Leu Gin Asn Pro Val Ser Ser 
50 55 60 

Leu 
65 



<210> 533 
<211> 299 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (172) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (174) 

<223> Xaa equals any amino acid 
<400> 533 

Gly Gly Glu Glu Glu Gly Glu Glu Gly Ala Glu lie Ser Gly Leu Gly 
15 10 15' 

Ala Gly Arg Arg Ser Ala Pro lie Ala Val Gly Leu Gly Phe Leu Gly 

20 25 30 

Val Gly Gly Arg Gly Gly Ser Asp Met Glu Ala Asn Gly Ser Gin Gly 
35 40 45 

Thr Ser Gly Ser Ala Asn Asp Ser Gin His Asp Pro Gly Lys Met Phe 
50 55 60 

lie Gly Gly Leu Ser Trp Gin Thr Ser Pro Asp Ser Leu Arg Asp Tyr 
65 70 75 80 

Phe Ser Lys Phe Gly Glu lie Arg Glu Cys Met Val Met Arg Asp Pro 
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85 90 95 

Thr Thr Lys Arg Ser Arg Gly Phe Gly Phe Val Thr Phe Ala Asp Pro 

100 105 110 

Ala Ser Val Asp Lys Val Leu Gly Gin Pro His His Glu Leu Asp Ser 
115 120 125 

Lys Thr lie Asp Pro Lys Val Ala Phe Pro Arg Arg Ala Gin Pro Lys 
130 135 140 

Met Val Thr Arg Thr Lys Lys lie Phe Val Gly Gly Leu Ser Ala Asn 
145 150 155 160 

Thr Val Val Glu Asp Val Lys Gin Tyr Phe Glu Xaa Phe Xaa Lys Val 

165 170 175 

Glu Asp Ala Met Leu Met Phe Asp Lys Thr Thr Asn Arg His Arg Gly 

180 185 190 

Phe Gly Phe Val Thr Phe Glu Asn Glu Asp Val Val Glu Lys Val Cys 
195 200 205 

Glu He His Phe His Glu He Asn Asn Lys Met Val Glu Cys Lys Lys 
210 215 220 

Ala Gin Pro Lys Glu Val Met Phe Pro Pro Gly Thr Arg Gly Arg Ala 
225 230 235 240 

Arg Gly Leu Pro Tyr Thr Met Asp Ala Phe Met Leu Gly Met Gly Met 

245 250 255 

Leu Gly Glu Ser Gly Gin Asp Arg Arg Ser Pro Trp Thr Gly Arg Ala 

260 265 270 

Met Glu Ala Ser Thr Pro Asn Trp Val Thr Tyr Gin Trp Gly Lys Leu 
275 280 285 

Leu His Leu Ser Lys Pro Gin Phe Pro Cys Leu 
290 295 



<210> 534 
<211> 306 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (171) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (180) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (182) 



337 



WO 02/076488 



PCT/US02/08276 



<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (188) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (208) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (210) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (211) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (218) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (219) 

<223> Xaa equals any amino acid 
<400> 534 

Met Ala Leu Arg Leu Leu Arg Arg Ala Ala Arg Gly Ala Ala Ala Ala 
15 10 15 

Ala Leu Leu Arg Leu Lys Ala Ser Leu Ala Ala Asp lie Pro Arg Leu 

20 25 30 

Gly Tyr Ser Ser Ser Ser His His Lys Tyr He Pro Arg Arg Ala Val 
35 40 45 

Leu Tyr Val Pro Gly Asn Asp Glu Lys Lys lie Lys Lys lie Pro Ser 
50 55 60 

Leu Asn Val Asp Cys Ala Val Leu Asp Cys Glu Asp Gly Val Ala Ala 
65 70 75 80 

Asn Lys Lys Asn Glu Ala Arg Leu Arg lie Val Lys Thr Leu Glu Asp 

85 90 95 

lie Asp Leu Gly Pro Thr Glu Lys Cys Val Arg Val Asn Ser Val Ser 

100 105 110 

Ser Gly Leu Ala Glu Glu Asp Leu Glu Thr Leu Leu Gin Ser Arg Val 
115 120 125 

Leu Pro Ser Ser Leu Met Leu Pro Lys Val Glu Ser Pro Glu Glu lie 
130 135 140 

Gin Trp Ala Val Cys Glu Glu Thr Leu Lys Val Gly Pro Gin Val Gly 
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145 150 155 160 

Leu Phe Leu Asp Ala Val Arg Phe Trp Arg Xaa Arg Leu Ser Ser His 

165 170 175 

lie Gly Ala Xaa Ser Xaa Lys Glu Thr Leu Asp Xaa Leu Tyr Ala Arg 

180 185 190 

Gin Lys lie Val Val He Ala Lys Ala Phe Gly Leu Gin Ala Val Xaa 
195 200 205 

Leu Xaa Xaa He Asp Phe Arg Asp Gly Xaa Xaa Leu Leu Arg Gin Ser 
210 215 220 

Arg Glu Gly Ala Ala Met Gly Phe Thr Gly Lys Gin Val He His Pro 
225 230 235 240 

Asn Gin He Ala Val Val Gin Glu Gin Phe Ser Pro Ser Pro Glu Lys 

245 250 255 

He Lys Trp Ala Glu Glu Leu He Ala Ala Phe Lys Glu His Gin Gin 

260 265 270 

Leu Gly Lys Gly Ala Phe Thr Phe Gin Gly Ser Met He Asp Met Pro 
275 280 285 

Leu Leu Lys Gin Ala Gin Asn Thr Val Thr Leu Ala Thr Ser He Lys 
290 295 300 

Glu Lys 
305 



<210> 535 
<211> 64 
<212> PRT 

<213> Homo sapiens 
<400> 535 

Met Val Ser Pro Leu He Ser Ala Leu Phe His Val Pro Phe Leu Trp 
15 10 15 

Leu Gly Met Phe Phe Pro His Ser Leu Ser Gly Pro Phe Pro Ser His 

20 25 30 

Leu Arg Arg Ala Ser Ser Ser Arg Lys Pro Leu Val Lys Pro Pro Arg 
35 40 45 

Ala Arg Gin Tyr Pro Pro Leu Ala Ser Ser Gly Tyr Arg Gly Arg He 
50 55 60 



<210> 536 
<211> 26 
<212> PRT 

<213> Homo sapiens 
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<400> 536 

Met Ser Phe Pro His Ala Ser Thr 
1 5 

Leu Gin His Thr Leu Pro Asn His 

20 



Leu Pro Phe His Lys Leu Ser Asp 
10 15 

Gin Gly 

25 



<210> 537 
<211> 50 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> SITE 
<222> (4) 

<223> Xaa equals any amino acid 



<220> 

<221> SITE 
<222> (10) 
<223> Xaa equals 

<220> 

<221> SITE 
<222> (22) 
<223> Xaa equals 

<220> 

<221> SITE 

<222> (35) 

<223> Xaa equals 

<220> 

<221> SITE 
<222> (39) 
<223> Xaa equals 

<220> 

<221> SITE 
<222> (42) 
<223> Xaa equals 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



any amino acid 



<400> 537 
Val His Ala Xaa 
1 

Leu Tyr Trp Cys 

20 

Arg Gly Xaa Glu 
35 



Thr Pro Phe Ala 
5 

Tyr Xaa Asn Pro 

Gin Arg Xaa Pro 

40 



Gly Xaa Cys Phe 
10 

Gly Thr His Cys 
25 

Ser Xaa Arg Ser 



Asp Pro Val Ser 
15 

Tyr Pro Thr Leu 
30 

His lie Val Leu 
45 



Arg Ser 
50 



<210> 538 
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<211> 57 
<212> PRT 

<213> Homo sapiens 
<400> 538 

Met Pro Pro His Arg Gin Thr Asp Gly Gin Met Gly Leu Pro Ala Pro 
15 10 15 

Ala Leu Trp Val Trp Gly Leu Leu Leu Ser Ser Ser Phe Gin Thr Leu 

20 25 30 

Leu Pro Ala Phe Pro Lys Pro Pro Ala Leu Asn Leu Gly Cys Ser Thr 
35 40 45 

Arg Pro lie Pro Ser Phe Leu Lys lie 
50 55 



<210> 539 
<211> 93 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (24) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (65) 

<223> Xaa equals any amino acid 
<400> 539 

Gin Val Ser Leu Pro Thr Arg Leu Leu Gin Met Pro Gly Met Gly Leu 
1 5 10 15 

Asp Ser Arg Phe Gin Ala Trp Xaa Pro Ser Pro Tyr Leu Gly Pro Gin 

20 25 30 

Pro Arg Ala Pro Arg Pro Gly Leu Gin Pro Gly Pro Ser Leu Arg Gly 
35 40 45 

Ala Glu Phe Arg Glu Ser Cys Pro Arg Ser Gin Lys Arg Gly Arg Glu 
50 55 60 

Xaa Gly Arg Pro Cys Pro Gly Cys Arg Pro Gly Gly Trp Gly Leu Pro 
65 70 75 80 

Ala Arg Leu Gly Gin Pro Gin Leu Gin Thr Gly Pro Gly 

85 90 



<210> 540 
<211> 172 
<212> PRT 

<213> Homo sapiens 
<220> 
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<221> SITE 
<222> (170) 

<223> Xaa equals any amino acid 

« 

<400> 540 

Met Arg Gly Ser Val Glu Cys Thr Trp Gly Trp Gly His Cys Ala Pro 
15 10 15 

Ser Pro Leu Leu Leu Trp Thr Leu Leu Leu Phe Ala Ala Pro Phe Gly 

20 25 30 

Leu Leu Gly Glu Lys Thr Arg Gin Leu Leu Glu Phe Asp Ser Thr Asn 
35 40 45 

Val Ser Asp Thr Ala Ala Lys Pro Leu Gly Arg Pro Tyr Pro Pro Tyr 
50 55 60 

Ser Leu Ala Asp Phe Ser Trp Asn Asn lie Thr Asp Ser Leu Asp Pro 
65 70 75 80 

Ala Thr Leu Ser Ala Thr Phe Gin Gly His Pro Met Asn Asp Pro Thr 

85 90 95 

Arg Thr Phe Ala Asn Gly Ser Leu Ala Phe Arg Val Gin Ala Phe Ser 

100 105 110 

Arg Ser Ser Arg Pro Ala Gin Pro Pro Arg Leu Leu His Thr Ala Asp 
115 120 125 

Thr Cys Gin Leu Glu Val Ala Leu lie Gly Ala Ser Pro Arg Gly Asn 
130 135 140 

Arg Ser Leu Phe Gly Leu Glu Val Ala Thr Leu Gly Gin Gly Pro Asp 
145 150 155 160 

Cys Pro Ser Met Gin Glu Gin His Ser Xaa Glu Arg 

165 170 



<210> 541 
<211> 131 
<212> PRT 
<213> Homo sapiens 

<400> 541 

Met Arg Gly Ser Val Glu Cys Thr Trp Gly Trp Gly His Cys Ala Pro 
15 10 15 

Ser Pro Leu Leu Leu Trp Thr Leu Leu Leu Phe Ala Ala Pro Phe Gly 

20 25 30 

Leu Leu Gly Glu Lys Thr Arg Gin Leu Leu Glu Phe Asp Ser Thr Asn 
35 40 45 

Val Ser Asp Thr Ala Ala Lys Pro Leu Gly Arg Pro Tyr Pro Pro Tyr 
50 55 60 

Ser Leu Ala Asp Phe Ser Trp Asn Asn lie Thr Asp Ser Leu Asp Pro 
65 70 75 80 
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Ala Thr Leu Ser Ala Thr Phe Gin Gly His Pro Met Asn Asp Pro Thr 

85 90 95 

Arg Thr Phe Ala Asn Gly Ser Leu Ala Phe Arg Ser Arg Pro Phe Pro 

100 105 110 

Gly Pro Ala Asp Gin Pro Asn Pro Leu Ala Ser Cys Thr Gin Gin Thr 
115 120 125 

Pro Val Ser 
130 



<210> 542 
<211> 121 
<212> PRT 
<213> Homo sapiens 

<400> 542 

Met Cys Phe Leu Met lie Phe Thr Phe Leu Val Cys Trp Met Pro Tyr 
1 '5 10 15 

He Val He Cys Phe Leu Val Val Asn Gly His Gly His Leu Val Thr 

20 25 30 

Pro Thr He Ser He Val Ser Tyr Leu Phe Ala Lys Ser Asn Thr Val 
35 40 45 

Tyr Asn Pro Val He Tyr Val Phe Met He Arg Lys Phe Arg Arg Ser 
50 55 60 

Leu Leu Gin Leu Leu Cys Leu Arg Leu Leu Arg Cys Gin Arg Pro Ala 
65 70 75 80 

Lys Asp Leu Pro Ala Ala Gly Ser Glu Met Gin He Arg Pro He Val 

85 90 95 

Met Ser Gin Lys Asp Gly Asp Arg Pro Lys Lys Ser Asp Phe Gin Leu 

100 105 110 

Phe Phe His His Phe Tyr His His Gin 
115 120 



<210> 543 

<211> 49 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals any amino acid 
<400> 543 

Met Gly Ala His Ser Phe Gly Phe Gin Leu Phe Met Ser Val Ser Val 
15 10 15 
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Leu Trp Gly Arg Leu Cys Leu Tyr Gly Arg Phe Ser Val lie Thr Phe 

20 25 30 

Ala Ser Pro Pro Thr Thr Phe Met Xaa lie Gin Cys Cys Ser His Cys 
35 40 45 

Ser 



<210> 544 
<211> 484 
<212> PRT 

<213> Homo sapiens 
<400> 544 

Met Pro Arg His Leu Ser Gly Leu Leu Leu Leu Leu Trp Pro Leu Leu 
15 10 15 

Leu Leu Leu Pro Pro Thr Pro Ala Ala Pro Gly Pro Leu Ala Arg Pro 

20 25 30 

Gly Leu Arg Arg Leu Gly Thr Arg Gly Pro Gly Gly Ser Pro Gly Arg 
35 40 45 

Arg Pro Gly Ser Ala Val Pro Thr Arg Ala Pro Tyr Ser Gly Ala Gly 
50 55 60 

Gin Pro Gly Gly Ala Arg Gly Ala Gly Val Cys Arg Ser Arg Pro Leu 
65 70 75 80 

Asp Leu Val Phe lie lie Asp Ser Ser Arg Ser Val Arg Pro Leu Glu 

85 90 95 

Phe Thr Lys Val Lys Thr Phe Val Ser Gin lie lie Asp Thr Leu Asp 

100 105 110 

lie Gly Ala Ala Asp Thr Arg Val Ala Val Val Asn Tyr Ala Ser Thr 
115 120 125 

Val Lys lie Glu Phe His Leu Gin Thr His Ser Asp Lys Gin Ser Leu 
130 " 135 140 

Lys Gin Ala Val Ala Arg lie Thr Pro Leu Ser Thr Gly Thr Met Ser 
145 150 155 160 

Gly Leu Ala lie Gin Thr Ala Met Asp Glu Ala Phe Thr Val Glu Ala 

165 170 175 

Gly Ala Arg Gly Pro Thr Ser Asn He Pro Lys Val Ala He He Val 

180 185 190 

Thr Asp Gly Arg Pro Gin Asp Gin Val Asn Glu Val Ala Ala Arg Ala 
195 200 205 

Arg Ala Ser Gly He Glu Leu Tyr Ala Val Gly Val Asp Arg Ala Asp 
210 215 220 

Met Glu Ser Leu Lys Met Met Ala Ser Glu Pro Leu Asp Glu His Val 
225 230 235 240 
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Phe Tyr Val Glu Thr Tyr Gly Val lie Glu Lys Leu Ser Ser Arg Phe 

245 250 255 

Gin Glu Thr Phe Cys Ala Leu Asp Pro Cys Val Leu Gly Thr His Arg 

260 265 270 

Cys Gin His Val Cys Val Ser Asp Gly Glu Gly Lys His His Cys Glu 
275 280 285 

Cys Ser Gin Gly Tyr Ser Leu Asn Ala Asp Gin Lys Thr Cys Ser Ala 
290 295 300 

lie Asp Lys Cys Ala Leu Asn Thr His Gly Cys Glu His lie Cys Val 
305 310 315 320 

Asn Asp Arg Thr Gly Ser Tyr His Cys Glu Cys Tyr Glu Gly Tyr Thr 

325 330 335 

Leu Asn Gin Asp Arg Lys Thr Cys Ser Ala Gin Asp Gin Cys Ala Phe 

340 345 350 

Gly Thr His Gly Cys Gin His lie Cys Val Asn Asp Arg Asp Gly Ser 
355 360 365 

His His Cys Glu Cys Tyr <31u Gly Tyr Thr Leu Asn Ala Asp Asn Lys 
370 375 380 

Thr Cys Ser Val Arg Ser Glu Cys Ala Gly Gly Ser His Gly Cys Gin 
385 390 395 400 

His Leu Cys Val Asp Asp Gly Pro Ala Ala Tyr His Cys Asp Cys Phe 

405 410 415 

Pro Gly Tyr Thr Leu Thr Glu Asp Arg Arg Thr Cys Ala Ala lie Glu 

420 425 430 

Glu Ala Arg Arg Leu Val Ser Thr Glu Asp Ala Cys Gly Cys Glu Ala 
435 440 445 

Thr Leu Ala Phe Gin Glu Arg Ala Ser Ser Tyr Leu Gin Arg Leu Asn 
450 455 460 

Ala Lys Leu Asp Asp lie Leu Gly Lys Leu Gin Ala Asp Ala Tyr Gly 
465 470 475 480 

Gin lie His Arg 



<210> 545 
<211> 266 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (45) 

<223> Xaa equals any amino acid 
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<220> 

<221> SITE 
<222> (47) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (51) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (134) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (183) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (222) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (224) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (255) 

<223> Xaa equals any amino acid 
<400> 545 

Met Pro Arg His Leu Ser Gly Leu Leu Leu Leu Leu Trp Pro Leu Leu 
15 10 15 

Leu Leu Leu Pro Pro Thr Pro Ala Ala Pro Gly Pro Leu Ala Arg Pro 

20 25 30 

Gly Leu Arg Arg Leu Gly Thr Arg Gly Pro Gly Gly Xaa Pro Xaa Arg 
35 40 45 

Arg Pro Xaa Ser Ala Val Pro Thr Arg Ala Pro Tyr Ser Gly Ala Gly 
50 55 60 

Gin Pro Gly Gly Ala Arg Gly Ala Gly Val Cys Arg Ser Arg Pro Leu 
65 70 75 80 

Asp Leu Val Phe lie lie Asp Ser Ser Arg Ser Val Arg Pro Leu Glu 

85 90 95 

Phe Thr Lys Val Lys Thr Phe Val Ser Gin lie lie Asp Thr Leu Asp 

100 105 110 

lie Gly Ala Ala Asp Thr Arg Val Ala Val Val Asn Tyr Ala Ser Thr 
115 120 125 

Val Lys lie Glu Phe Xaa Leu Gin Thr His Ser Asp Lys Gin Ser Leu 
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130 135 140 

Lys Gin Ala Val Ala Arg lie Thr Pro Leu Ser Thr Gly Thr Met Ser 
145 150 155 160 

Gly Leu Ala lie Gin Thr Ala Met Asp Glu Ala Phe Thr Val Glu Ala 

165 170 175 

Gly Ala Arg Gly Pro Thr Xaa Asn lie Pro Lys Val Ala He He Val 

180 185 190 

Thr Asp Gly Arg Pro Gin Asp Gin Val Asn Glu Val Ala Ala Arg Ala 
195 200 205 

Arg Ala Ser Gly He Glu Leu Tyr Ala Val Gly Val Asp Xaa Ala Xaa 
210 215 220 

Met Glu Ser Leu Gin Asp Glu Trp Pro Ala Lys Pro Leu Asp Glu His 
225 230 235 240 

Val Phe Tyr Val Glu Thr Tyr Gly Val He Glu Lys Pro Ser Xaa Arg 

245 250 255 

Phe Gin Glu Thr Leu Leu Arg Ser Trp Asn 

260 265 



<210> 546 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<400> 546 

Val Leu Leu lie Leu 
1 5 



<210> 547 
<211> 84 
<212> PRT 

<213> Homo sapiens 
<400> 547 

Lys Met His Phe Asn Lys Asn Lys Ser He Leu Lys Ser Phe Ser Phe 
15 10 15 

Val Arg Gly Asn Met Asn Glu He His Ser Tyr Leu Lys Thr Glu Tyr 

20 25 30 

Phe Thr Ala Lys Thr Leu Asn He Ser Arg Ala Tyr His He Leu Asn 
35 40 45 

Thr Leu Trp Ser Cys Ser Tyr Phe Asn lie Pro Gly Ser Gly Gly Gin 
50 55 60 

Leu Ala Cys Leu Trp Leu Arg lie Cys Phe His Ala Cys Phe Leu Ser 
65 70 75 80 

Phe Phe Tyr Leu 
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<210> 548 
<211> 67 
<212> PRT 

<213> Homo sapiens 
<400> 548 

Met Ala Pro Ser Gly Pro Leu Leu lieu Val Leu Leu Val Pro Leu Ala 
1 5 10 15 

Ala Ala Arg Pro Gly Pro Thr Ser Val Pro Ala Gly Ala Ala Ala Cys 

20 25 30 

Pro Cys Gly Gly Thr Ser Cys Arg Gly Trp Gly Ala Gly Pro Thr Pro 
35 40 45 

Gly Arg Thr Ser Thr Cys Pro His Leu Thr Cys Pro Arg Ala Gly Thr 
50 55 60 

Gly Ala Thr 
65 



<210> 549 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<400> 549 

Pro Gin Gly Pro Asn Asp Val Thr Ala Lys Leu Leu Cys Pro 
15 10 



<210> 550 
<211> 6 
<212> PRT 

<213> Homo sapiens 
<400> 550 

Met Leu Leu Leu Tyr Leu 
1 5 



<210> 551 
<211> 161 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (123) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
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<222> (129) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (145) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (146) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (157) 

<223> Xaa equals any amino acid 
<400> 551 

Met Thr Thr Trp Ser Cys Leu Val Ala Met lie Val Ser Gly Val lie 
15 10 15 

Thr Ala Val Trp Ala Val Arg Ala Ala Pro lie Trp Arg Ser Gin Val 

20 25 30 

Lys Gin Lys Met Arg lie Gly Lys Gin Gly Asn Cys Arg Pro Pro Arg 
35 40 45 

Cys lie Cys Ser Ala Leu Gly Leu Leu Ala Pro Trp Met Ala Val Val 
50 55 60 

Leu Ser Gin Leu Ser Val Arg Cys Val Val Ser Trp Val Gin Gly Lys 
65 70 75 80 

Pro Ser Ser Pro Arg Pro Arg Gly Ser Ala Ala Ser Pro Ala Pro Gly 

85 90 95 

Ala Thr Pro Pro Thr Pro Arg Lys Pro Val Ser Trp Leu Gly Tyr Arg 

100 105 110 

Glu Asn His Arg Pro Lys Lys Pro Lys Ser Xaa Thr Arg Cys Leu Val 
115 120 125 

Xaa Gin Asn Trp Ser Leu Pro Pro lie Ser Lys Asp Arg Thr Ala Gly 
130 135 140 

Xaa Xaa Asp Thr Asn Arg Thr Arg Arg Ser Gly Leu Xaa Leu Arg Leu 
145 150 155 160 

Gly 



<210> 552 
<211> 325 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
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<222> (10) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (136) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (186) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (234) 

<223> Xaa equals any amino acid 
<400> 552 

Val Pro Pro Ala Val Cys Pro Ala Gly Xaa Phe Cys Gin Asn Gin Cys 
15 10 15 

Phe Thr Lys Arg Gin Tyr Pro Glu Thr Lys He He Lys Thr Asp Gly 

20 25 30 

Lys Gly Trp Gly Leu Val Ala Lys Arg Asp He Arg Lys Gly Glu Phe 
35 40 45 

Val Asn Glu Tyr Val Gly Glu Leu He Asp Glu Glu Glu Cys Met Ala 
50 55 60 

Arg He Lys His Ala His Glu Asn Asp He Thr His Phe Tyr Met Leu 
65 70 75 80 

Thr He Asp Lys Asp Arg He He Asp Ala Gly Pro Lys Gly Asn Tyr 

85 90 95 

Ser Arg Phe Met Asn His Ser Cys Gin Pro Asn Cys Glu Thr Leu Lys 

100 105 110 

Trp Thr Val Asn Gly Asp Thr Arg Val Gly Leu Phe Ala Val Cys Asp 
115 120 125 

He Pro Ala Gly Thr Glu Leu Xaa Phe Asn Tyr Asn Leu Asp Cys Leu 
130 135 140 

Gly Asn Glu Lys Thr Val Cys Arg Cys Gly Ala Ser Asn Cys Ser Gly 
145 150 155 160 

Phe Leu Gly Asp Arg Pro Lys Thr Ser Thr Thr Leu Ser Ser Glu Glu 

165 170 175 

Lys Gly Lys Lys Thr Lys Lys Lys Thr Xaa Arg Arg Arg Ala Lys Gly 

180 185 190 

Glu Gly Lys Arg Gin Ser Glu Asp Glu Cys Phe Arg Cys Gly Asp Gly 
195 200 205 

Gly Gin Leu Val Leu Cys Asp Arg Lys Phe Cys Thr Lys Ala Tyr His 
210 215 220 
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Leu Ser Cys Leu Gly Leu Gly Lys Arg Xaa Phe Gly Lys Trp Glu Cys 
225 230 235 240 

Pro Trp His His Cys Asp Val Cys Gly Lys Pro Ser Thr Ser Phe Cys 

245 250 255 

His Leu Cys Pro Asn Ser Phe Cys Lys Glu His Gin Asp Gly Thr Ala 

260 265 270 

Phe Ser Cys Thr Pro Asp Gly Arg Ser Tyr Cys Cys Glu His Asp Leu 
275 280 285 

Gly Ala Ala Ser Val Arg Ser Thr Lys Thr Glu Lys Pro Pro Pro Glu 
290 295 300 

Pro Gly Lys Pro Lys Gly Lys Arg Arg Arg Arg Arg Gly Trp Arg Arg 
305 310 • 315 320 

Val Thr Glu Gly Lys 

325 



<210> 553 
<211> 40 
<212> PRT 

<213> Homo sapiens 
<400> 553 

Met Val Ala Met Val Phe Leu Lys lie Ser Val Leu Pro Leu Met Cys 
15 10 15 

Arg Gly Gin Thr Lys His Lys Val Leu Arg Asp His Ala Tyr Pro Arg 

20 25 30 

Val Ser Gin Lys Arg Gly His lie 
35 40 



<210> 554 
<211> 173 
<212> PRT 

<213> Homo sapiens. 
<400> 554 

Met Val Phe Leu Lys Phe Phe Cys Met Ser Phe Phe Cys His Leu Cys 
15 10 15 

Gin Gly Tyr Phe Asp Gly Pro Leu Tyr Pro Glu Met Ser Asn Gly Thr 

20 25 30 

Leu His His Tyr Phe Val Pro Asp Gly Asp Tyr Glu Glu Asn Asp Asp 
35 40 45 

Pro Glu Lys Cys Gin Leu Leu Phe Arg Val Ser Asp His Arg Arg Cys 
50 55 60 

Ser Gin Gly Glu Gly Ser Gin Val Gly Ser Leu Leu Ser Leu Thr Leu 
65 70 75 80 
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Arg Glu Glu Phe Thr Val Leu Gly His Gin Val Glu Gly Cys Trp Ala 

85 90 95 

Arg Ala Gly Gly His Gin Gin Lys His Leu Leu Arg Pro Arg Arg Gly 

100 105 110 

Arg Glu Leu Trp Gin Val Pro Ala Ala Gly Val Pro Pro Asp Arg Gly 
115 120 125 

Met Pro Thr Pro Thr Arg Thr Asn Pro Ser Leu Ser Trp Arg Ala Ser 
130 135 140 

Ser Ser Arg Ala Arg Asn Arg Thr Ala Gly Arg Arg Ala Gly Ser Thr 
145 150 155 160 

Arg Thr Phe Trp Glu Cys Trp Ser Thr Pro Gly Pro Cys 

165 170 



<210> 555 
<211> 48 
<212> PRT 

<213> Homo sapiens 
<400> 555 

Met Met Leu Tyr Gin Asn Met Leu Leu Tyr Phe Arg lie lie Gly Val 
1 5 10 15 

Leu Ala Leu Asn Phe Ser lie Ser Pro lie Phe Phe His Gly Ser Leu 

20 25 30 

Gly Lys Leu Tyr Val Tyr Ser Ala Ala Lys Tyr Ser Leu Glu Leu Lys 
35 40 45 



<210> 556 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 556 

lie Tyr Gin His Phe Ser Leu Trp Leu Gly 
15 10 



<210> 557 

<211> 4 

<212> PRT 

<213> Homo sapiens 

<400> 557 
Met Phe Lys Met 
1 
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<210> 558 
<211> 201 
<212> PRT 

<213> Homo sapiens 
<400> 558 

Met Lys Leu Leu lie Leu Phe Leu Ser His Leu Leu Ser Leu Ala Phe 
15 10 15 

Gly lie Leu Cys Leu Ser Val Thr Val lie Leu Ser Leu Leu Leu Ser 

20 25 30 

Phe Ser Lys Arg Gly Phe Ser Val Arg Ser Phe Gly Thr Gly Thr His 
35 40 45 

Val Lys Leu Pro Gly Pro Ala Pro Asp Lys Pro Asn Val Tyr Asp Phe 
50 55 60 

Lys Thr Thr Tyr Asp Gin Met Tyr Asn Asp Leu Leu Arg Lys Asp Lys 
65 70 75 80 

Glu Leu Tyr Thr Gin Asn Gly lie Leu His Met Leu Asp Arg Asn Lys 

85 90 95 

Arg He Lys Pro Arg Pro Glu Arg Phe Gin Asn Cys Lys Asp Leu Phe 

100 105 110 

Asp Leu He Leu Thr Cys Glu Glu Arg Val Tyr Asp Gin Val Val Glu 
115 120 125 

Asp Leu Asn Ser Arg Glu Gin Glu Thr Cys Gin Pro Val His Val Val 
130 135 140 

Asn Val Asp He Gin Asp Asn His Glu Glu Ala Thr Leu Gly Ala Phe 
145 150 155 160 

Leu He Cys Glu Leu Cys Gin Cys He Gin His Thr Glu Asp Met Glu 

165 170 175 

Asn Glu He Asp Glu Leu Leu Gin Glu Phe Glu Glu Lys Ser Gly Arg 

180 185 190 

Thr Phe Leu His Thr Val Cys Phe Tyr 
195 200 



<210> 559 
<211> 392 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (251) 

<223> Xaa equals any amino acid 
<400> 559 

Met Ala Pro Trp Pro Pro Lys Gly Leu Val Pro Ala Val Leu Trp Gly 
15 10 15 
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Leu Ser Leu Phe Leu Asn Leu Pro Gly Pro He Trp Leu Gin Pro Ser 

20 25 30 

Pro Pro Pro Gin Ser Ser Pro Pro Pro Gin Pro His Pro Cys His Thr 
35 40 45 

Cys Arg Gly Leu Val Asp Ser Phe Asn Lys Gly Leu Glu Arg Thr He 
50 55 60 

Arg Asp Asn Phe Gly Gly Gly Asn Thr Ala Trp Glu. Glu Glu Asn Leu 
65 70 75 80 

Ser Lys Tyr Lys Asp Ser Glu Thr Arg Leu Val Glu Val Leu Glu Gly 

85 90 95 

Val Cys Ser Lys Ser Asp Phe Glu Cys His Arg Leu Leu Glu Leu Ser 

100 105 110 

Glu Glu Leu Val Glu Ser Trp Trp Phe His Lys Gin Gin Glu Ala Pro 
115 120 125 

Asp Leu Phe Gin Trp Leu Cys Ser Asp Ser Leu Lys Leu Cys Cys Pro 
130 135 * 140 

Ala Gly Thr Phe Gly Pro Ser Cys Leu Pro Cys Pro Gly Gly Thr Glu 
145 150 155 160 

Arg Pro Cys Gly Gly Tyr Gly Gin Cys Glu Gly Glu Gly Thr Arg Gly 

165 170 175 

Gly Ser Gly His Cys Asp Cys Gin Ala Gly Tyr Gly Gly Glu Ala Cys 

180 185 190 

Gly Gin Cys Gly Leu Gly Tyr Phe Glu Ala Glu Arg Asn Ala Ser His 
195 200 205 

Leu Val Cys Ser Ala Cys Phe Gly Pro Cys Ala Arg Cys Ser Gly Pro 
210 215 220 

Glu Glu Ser Asn Cys Leu Gin Cys Lys Lys Gly Trp Ala Leu His His 
225 230 235 240 

Leu Lys Cys Val Asp Cys Ala Lys Ala Cys Xaa Gly Cys Met Gly Ala 

245 250 255 

Gly Pro Gly Arg Cys Lys Lys Cys Ser Pro Gly Tyr Gin Gin Val Gly 

260 265 270 

Ser Lys Cys Leu Asp Val Asp Glu Cys Glu Thr Glu Val Cys Pro Gly 
275 280 285 

Glu Asn Lys Gin Cys Glu Asn Thr Glu Gly Gly Tyr Arg Cys He Cys 
290 295 300 

Ala Glu Gly Tyr Lys Gin Met Glu Gly He Cys Val Lys Glu Gin He 
305 310 315 320 

Pro Glu Ser Ala Gly Phe Phe Ser Glu Met Thr Glu Asp Glu Leu Val 

325 330 335 
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Val Leu Gin Gin Met Phe Phe Gly lie He He Cys Ala Leu Ala Thr 

340 345 350 

Leu Ala Ala Lys Gly Asp Leu Val Phe Thr Ala He Phe He Gly Ala 
.355 360 365 

Val Ala Ala Met Thr Gly Tyr Trp Leu Ser Glu Arg Ser Asp Arg-Val 
370 375 380 

Leu Glu Gly Phe He Lys Gly Arg 
385 390 



<210> 560 
<211> 63 
<212> PRT 

<213> Homo sapiens 
<400> 560 

Met Thr Glu Asp Glu Leu Val Val Leu Gin Gin Met Phe Phe Gly He 
15 10 15 

He He Cys Ala Leu Ala Thr Leu Ala Ala Lys Gly Asp Leu Val Phe 

20 25 30 

Thr Ala He Phe He Gly Ala Val Ala Ala Met Thr Gly Tyr Trp Leu 
35 40 45 

Ser Glu Arg Ser Asp Arg Val Leu Glu Gly Phe He Lys Gly Arg 
50 55 60 



<210> 561 
<211> 102 
<212> PRT 

<213> Homo sapiens 
<400> 561 

Met Thr Val Arg Arg Leu Ser Leu Leu Cys Arg Asp Leu Trp Ala Leu 
1 5 10 15 

Trp Leu Leu Leu Lys Ala Gly Ala Val Arg Gly Ala Arg Ala Gly Pro 

20 25 30 

Arg Leu Pro Gly Arg Cys Cys Gly Ala Thr Cys Gly Asp Ala Gly Arg 
35 40 45 

Gly Trp Thr Phe Trp Ala Gin Pro Cys Pro Gin Lys Leu Leu Gly Gin 
50 55 60 

Lys Pro Gly Ala Gly Gly Cys Arg Gly Trp Val Leu Gly Trp Val Pro 
65 70 75 80 

Pro Arg Pro Glu Glu Pro Cys Ser Leu Ala Gly Lys Val Cys Thr Gly 

85 90 95 

Leu Ala Arg Trp Met Val 

100 
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<210> 562 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (41) 

<223> Xaa equals any amj.no acid 
<400> 562 

Met Cys Lys Ala Val Cys Lys His Arg Leu Arg Leu Phe Ala Val Ser 
15 10 15 

Ser Phe Ser Leu Gly Leu Gly Trp Val Cys Val Leu Val Leu Met Leu 

20 25 30 

Trp Pro Val Arg Leu Ser Leu Ala Xaa Arg Pro Val Gin Leu Gin Gin 
35 40 45 

Arg Arg Ser His Cys 
50 



<210> 563 
<211> 472 
<212> PRT 

<213> Homo sapiens 
<400> 563 

Met Lys Phe Leu lie Phe Ala Phe Phe Gly Gly Val His Leu Leu Ser 
15 10 15 

Leu Cys Ser Gly Lys Ala lie Cys Lys Asn Gly lie Ser Lys Arg Thr 

20 25 30 

Phe Glu Glu lie Lys Glu Glu lie Ala Ser Cys Gly Asp Val Ala Lys 
35 40 45 

Ala lie lie Asn Leu Ala Val Tyr Gly Lys Ala Gin Asn Arg Ser Tyr 
50 55 60 

Glu Arg Leu Ala Leu Leu Val Asp Thr Val Gly Pro Arg Leu Ser Gly 
65 70 75 80 

Ser Lys Asn Leu Glu Lys Ala lie Gin lie Met Tyr Gin Asn Leu Gin 

85 90 95 

Gin Asp Gly Leu Glu Lys Val His Leu Glu Pro Val Arg lie Pro His 

100 105 110 

Trp Glu Arg Gly Glu Glu Ser Ala Val Met Leu Glu Pro Arg lie His 
115 120 125 

Lys lie Ala lie Leu Gly Leu Gly Ser Ser lie Gly Thr Pro Pro Glu 
130 135 140 

Gly lie Thr Ala Glu Val Leu Val Val Thr Ser Phe Asp Glu Leu Gin 
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145 150 155 160 

Arg Arg Ala Ser Glu Ala Arg Gly Lys lie Val Val Tyr Asn Gin Pro 

165 170 175 

Tyr lie Asn Tyr Ser Arg Thr Val Gin Tyr Arg Thr Gin Gly Ala Val 

180 .185 190 

Glu Ala Ala Lys Val Gly Ala Leu Ala Ser Leu He Arg Ser Val Ala 
195 200 205 

Ser Phe Ser He Tyr Ser Pro His Thr Gly He Gin Glu Tyr Gin Asp 
210 215 220 

Gly Val Pro Lys He Pro Thr Ala Cys He Thr Val Glu Asp Ala Glu 
225 230 235 240 

Met Met Ser Arg Met Ala Ser His Gly He Lys He Val He Gin Leu 

245 250 255 

Lys Met Gly Ala Lys Thr Tyr Pro Asp Thr Asp Ser Phe Asn Thr Val 

260 265 270 

Ala Glu He Thr Gly Ser Lys Tyr Pro Glu Gin Val Val Leu Val Ser 
275 280 285 

Gly His Leu Asp Ser Trp Asp Val Gly Gin Gly Ala Met Asp Asp Gly 
290 295 300 

Gly Gly Ala Phe lie Ser Trp Glu Ala Leu Ser Leu Tie Lys Asp Leu 
305 310 315 320 

Gly Leu Arg Pro Lys Arg Thr Leu Arg Leu Val Leu Trp Thr Ala Glu 

325 330 335 

Glu Gin Gly Gly Val Gly Ala Phe Gin Tyr Tyr Gin Leu His Lys Val 

340 345 350 

Asn lie Ser Asn Tyr Ser Leu Val Met Glu Ser Asp Ala Gly Thr Phe 
355 360 365 

Leu Pro Thr Gly Leu Gin Phe Thr Gly Ser Glu Lys Ala Arg Ala He 
370 375 380 

Met Glu Glu Val Met Ser Leu Leu Gin Pro Leu Asn lie Thr Gin Val 
385 390 395 400 

Leu Ser His Gly Glu Gly Thr Asp lie Asn Phe Trp He Gin Ala Gly 

405 410 415 

Val Pro Gly Ala Ser Leu Leu Asp Asp Leu Tyr Lys Tyr Phe Phe Phe 

420 425 430 

His His Ser His Gly Asp Thr Met Thr Val Met Asp Pro Lys Gin Met 
435 440 445 

Asn Val Ala Ala Ala Val Trp Ala Val Val Ser Tyr Val Val Ala Asp 
450 455 460 

Met Glu Glu Met Leu Pro Arg Ser 
465 470 
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<210> 564 
<211> 178 
<212> PRT 

<213> Homo sapiens 
<400> 564 

Ser lie Tyr Ser Pro His Thr Gly He Gin Glu Tyr Gin Asp Gly Val 
15 10 15 

Pro Lys He Pro Thr Ala Cys He Thr Val Glu Asp Ala Glu Met Met 

20 25 30 

Ser Arg Met Ala Ser His Gly He Lys He Val He Gin Leu Lys Met 
35 40 45 

Gly Ala Lys Thr Tyr Pro Asp Thr Asp Ser Phe Asn Thr Val Ala Glu 
50 55 60 

He Thr Gly Ser Lys Tyr Pro Glu Gin Val Val Leu Val Ser Gly His 
65 70 75 80 

Leu Asp Ser Trp Asp Val Gly Gin Gly Ala Met Asp Asp Gly Gly Gly 

85 90 95 

Ala Phe He Ser Trp Glu Ala Leu Ser Leu He Lys Asp Leu Gly Leu 

100 105 110 

Arg Pro Lys Arg Thr Leu Arg Leu Val Leu Trp Thr Ala Glu Glu Gin 
115 120 125 

Gly Gly Val Gly Ala Phe Gin Tyr Tyr Gin Leu His Lys Val Asn He 
130 135 140 

Ser Asn Tyr Ser Leu Val Met Glu Ser Asp Ala Gly Thr Phe Leu Pro 
145 150 155 160 

Thr Gly Leu Gin Phe Thr Gly Ser Glu Lys Ala Arg Ala Ser Trp Arg 

165 170 175 

Arg Leu 



<210> 565 
<211> 199 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (142) 

<223> Xaa e<juals any amino acid 
<400> 565 

Met Lys Leu Gly Cys Val Leu Met Ala Trp Ala Leu Tyr Leu Ser Leu 
15 10 15 
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Gly Val Leu Trp Val Ala Gin Met Leu Leu Ala Ala Ser Phe Glu Thr 

20 25 30 

Leu Gin Cys Glu Gly Pro Val Cys Thr Glu Glu Ser Ser Cys His Thr 
35 40 45 

Glu Asp Asp Leu Thr Asp Ala Arg Glu Ala Gly Phe Gin Val Lys Ala 
50 55 60 

Tyr Thr Phe Ser Glu Pro Phe His Leu lie Val Ser Tyr Asp Trp Leu 
65 70 75 80 

lie Leu Gin Gly Pro Ala Lys Pro Val Phe Glu Gly Asp Leu Leu Val 

85 90 95 

Leu Arg Cys Gin Ala Trp Gin Asp Trp Pro Leu Thr Gin Val Thr Phe 

100 105 110 

Tyr Arg Asp Gly Ser Ala Leu Gly Pro Pro Gly Pro Asn Arg Glu Phe 
115 120 125 

Ser lie Thr Val Val Gin Lys Ala Asp Ser Gly His Tyr Xaa Cys Ser 
130 135 140 

Gly lie Phe Gin Ser Pro Gly Pro Gly lie Pro Glu Thr Ala Ser Val 
145 150 155 160 

Val Ala lie Thr Val Gin Glu Leu Phe Pro Ala Pro He Leu Leu Leu 

165 170 175 

Gin Gly Trp Lys Asp Ser Ala Lys Gin Gly Gly Ser Pro Gin Asn Ser 

180 185 190 

Arg Ser Pro Gin Leu Gin Lys 
195 



<210> 566 

<211> 2 

<212> PRT 

<213> Homo sapiens 

<400> 566 
Ser Trp 



<210> 567 

<211> 32 

<212> PRT 

<213> .Homo sapiens 

<400> 567 

Cys Leu Glu Thr Phe Trp Ser Leu Tyr Leu Gly Gly Trp Gly Met Val 
15 10 15 

Gly Cys Val Cys Tyr Trp His Pro Val Asn Arg Ser Gin Gly Cys Arg 

20 25 30 
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<210> 568 
<211> 283 
<212> PRT 

<213> Homo sapiens 
<400> 568 

Met Tyr Leu Ser Ala Leu Gin Ser Leu lie Pro Ser Leu Phe Ala Leu 
15 10 15 

Val Leu Gin Asn Ala Pro Phe Ser Ser Lys Ala Lys Leu His Gly Glu 

20 25 30 

Val Pro Gin lie Glu Val Thr Arg Phe Pro Arg Pro Met Ser Pro Leu 
35 40 45 

Gin Asp Val Ser Thr He He Gly Ser Arg Glu Gin Leu Ala Val Leu 
50 55 60 

Leu Gin Leu Tyr Asp Tyr Gin Leu Glu Gin Glu Gly Thr Thr Gly Trp 
65 70 75 80 

Glu Ser Leu Leu Trp Val Val Asn Gin Leu Leu Pro Gin Leu He Glu 

85 90 95 

He Val Gly Lys He Asn Val Thr Ser Thr Ala Cys Val His Glu Phe 

100 105 110 

Ser Arg Phe Phe Trp Arg Leu Cys Arg Thr Phe Gly Lys He Phe Thr 
115 120 125 

Asn Thr Lys Val Lys Pro Gin Phe Gin Glu He Leu Arg Leu Ser Glu 
130 135 140 

Glu Asn He Asp Ser Ser Ala Gly Asn Gly Val Leu Thr Lys Ala Thr 
145 150 155 160 

Val Pro He Tyr Ala Thr Gly Val Leu Thr Cys Tyr He Gin Glu Glu 

165 170 175 

Asp Arg Lys Leu Leu Val Gly Phe Leu Glu Asp Val Met Thr Leu Leu 

180 185 190 

Ser Leu Ser His Ala Pro Leu Asp Ser Leu Lys Ala Ser Phe Val Glu 
195 200 205 

Leu Gly Ala Asn Pro Ala Tyr His Glu Leu Leu Leu Thr Val Leu Trp 
210 215 220 

Tyr Gly Val Val His Thr Ser Ala Leu Val Arg Cys Thr Ala Ala Arg 
225 230 235 240 

Met Phe Glu Val Cys Gin His Met Pro Leu Leu Val Ser lie lie Met 

245 250 255 

He Phe Phe Phe Leu Arg Arg Arg Arg Glu Phe Phe Leu He Lys Arg 

260 265 270 
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Leu Cys lie Ser Lys Lys Lys Lys Lys Lys Lys 
275 280 



<210> 569 
<211> 286 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (204) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (224) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (228) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (264) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (271) 

<223> Xaa equals any amino acid 
<400> 569 

Met Tyr Leu Ser Ala Leu Gin Ser Leu lie Pro Ser Leu Phe Ala Leu 
15 10 15 

Val Leu Gin Asn Ala Pro Phe Ser Ser Lys Ala Lys Leu His Gly Glu 

20 25 ■ 30 

Val Pro Gin lie Glu Val Thr Arg Phe Pro Arg Pro Met Ser Pro Leu 
35 40 45 

Gin Asp Val Ser Thr lie lie Gly Ser Arg Glu Gin Leu Ala Val Leu 
50 55 60 

Leu Gin Leu Tyr Asp Tyr Gin Leu Glu Gin Glu Gly Thr Thr Gly Trp 
65 70 75 80 

Glu Ser Leu Leu Trp Val Val Asn Gin Leu Leu Pro Gin Leu lie Glu 

85 90 95 

He Val Gly Lys lie Asn Val Thr Ser Thr Ala Cys Val His Glu Phe 

100 105 110 

Ser Arg Phe Phe Trp Arg Leu Cys Arg Thr Phe Gly Lys He Phe Thr 
115 120 125 
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Asn Thr Lys Val Lys Pro Gin Phe Gin Glu lie Leu Arg Leu Ser Glu 
130 135 140 

Glu Asn lie Asp Ser Ser Ala Gly Asn Gly Val Leu Thr Lys Ala Thr 
145 150 155 160 

Val Pro He Tyr Ala Thr Gly Val Leu Thr Cys Tyr He Gin Glu Glu 

165 170 175 

Asp Arg Lys Leu Leu Val Gly Phe Leu Glu Asp Val Met Thr Leu Leu 

180 185 190 

Ser Leu Ser His Ala Pro Leu Asp Ser Leu Lys Xaa Ser Phe Val Glu 
195 200 205 

Leu Gly Ala Asn Gin Ala Tyr His Glu Leu Leu Leu Thr Val Leu Xaa 
210 215 220 

Tyr Gly Val Xaa His Thr Ser Ala Leu Val Arg Cys Thr Ala Ala Arg 
225 230 235 240 

Met Phe Glu Leu Leu Val Lys Gly Val Asn Glu Thr Leu Val Ala Gin 

245 250 255 

Arg Val Val Pro Ala Leu His Xaa Leu Ser Pro Val Asp Pro Xaa Asn 

260 265 270 

Leu Cys Gin Asp Cys His Asn Phe Gin Pro Leu Gly Leu Phe 
275 280 285 



<210> 570 
<211> 45 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (43) 

<223> Xaa equals any amino acid 
<400> 570 

Met Gin Ala Pro Leu Gin Asp Cys Gly Arg Ser Val Ser Leu Arg Leu 
15 10 15 

Ala Cys Val Leu Ala Pro Leu Thr Thr Ser Ser Arg Gly Cys His Leu 

20 25 30 

Gin Leu Pro Gin Asp Lys Gly Lys Ala Arg Xaa Asp Ser 
35 40 45 



<210> 571 
<211> 305 
<212> PRT 

<213> Homo sapiens 
<400> 571 

Met Gly He Leu Leu Gly Leu Leu Leu Leu Gly His Leu Thr Val Asp 
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15 10 15 

Thr Tyr Gly Arg Pro lie Leu Glu Val Pro Glu Ser Val Thr Gly Pro 

20 25 30 

Trp Lys Gly Asp Val Asn Leu Pro Cys Thr Tyr Asp Pro Leu Gin Gly 
35 40 45 

Tyr Thr Gin Val Leu Val Lys Trp Leu Val Gin Arg Gly Ser Asp Pro 
50 55 60 

Val Thr lie Phe Leu Arg Asp Ser Ser Gly Asp His lie Gin Gin Ala 
65 70 75 80 

Lys Tyr Gin Gly Arg Leu His Val Ser His Lys Val Pro Gly Asp Val 

85 90 95 

Ser Leu Gin Leu Ser Thr Leu Glu Met Asp Asp Arg Ser His Tyr Thr 

100 105 110 

Cys Glu Val Thr Trp Gin Thr Pro Asp Gly Asn Gin Val Val Arg Asp 
115 120 125 

Lys lie Thr Glu Leu Arg Val Gin Lys His Ser Ser Lys Leu Leu Lys 
130 135 140 

Thr Lys Thr Glu Ala Pro Thr Thr Met Thr Tyr Pro Leu Lys Ala Thr 
145 150 155 160 

Ser Thr Val Lys Gin Ser Trp Asp Trp Thr Thr Asp Met Asp Gly Tyr 

165 170 175 

Leu Gly Glu Thr Ser Ala Gly Pro Gly Lys Ser Leu Pro Val Phe Ala 

180 . 185 190 

He lie Leu He He Ser Leu Cys Cys Met Val Val Phe Thr Met Ala 
195 200 205 

Tyr He Met Leu Cys Arg Lys Thr Ser Gin Gin Glu His Val Tyr Glu 
210 215 220 

Ala Ala Arg Ala His Ala Arg Glu Ala Asn Asp Ser Gly Glu Thr Met 
225 230 235 240 

Arg Val Ala He Phe Ala Ser Gly Cys Ser Ser Asp Glu Pro Thr Ser 

245 250 255 

Gin Asn Leu Gly Asn Asn Tyr Ser Asp Glu Pro Cys He Gly Gin Glu 

260 265 270 

Tyr Gin He He Ala Gin He Asn Gly Asn Tyr Ala Arg Leu Leu Asp 
275 280 285 

Thr Val Pro Leu Asp Tyr Glu Phe Leu Ala Thr Glu Gly Lys Ser Val 
290 295 300 

Cys 
305 
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<210> 572 
<211> 72 
<212> PRT 

<213> Homo sapiens 
<400> 572 

Met Lys Phe Val Pro Cys Leu Leu Leu Val Thr Leu Ser Cys Leu Gly 
15 10 15 

Thr Leu Gly Gin Ala Pro Arg Gin Lys Gin Gly Ser Thr Gly Glu Glu 

20 25 30 

Phe His Phe Gin Thr Gly Gly Arg Asp Ser Cys Thr Met Arg Pro Ser 
35 40 45 

Ser Leu Gly Gin Gly Ala Gly Glu Val Trp Leu Arg Val Arg Leu Pro 
50 55 60 

Gin His Arg Pro Asp Leu Leu Val 
65 70 



<210> 573 
<211> 121 
<212> PRT 

<213> Homo sapiens 
<400> 573 

Met Gly Leu Trp Leu Gly Met Leu Ala Cys Val Phe Leu Ala Thr Ala 
15 10 15 

Ala Phe Val Ala Tyr Thr Ala Arg Leu Asp Trp Lys Leu Ala Ala Glu 

20 25 30 

Glu Ala Lys Lys His Ser Gly Arg Gin Gin Gin Gin Arg Ala Glu Ser 
35 40 45 

Thr Ala Thr Arg Pro Gly Pro Glu Lys Ala Val Leu Ser Ser Val Ala 
50 55 60 

Thr Gly Ser Ser Pro Gly He Thr Leu Thr Thr Tyr Ser Arg Ser Glu 
65 70 75 80 

Cys His Val Asp Phe Phe Arg Thr Pro Glu Glu Ala His Ala Leu Ser 

85 90 95 

Ala Pro Thr Ser Arg Leu Ser Val Lys Gin Leu Val He Arg Arg Gly 

100 105 110 

Ala Ala Leu Gly Ala Ala Ser Ala His 
115 120 



<210> 574 
<211> 509 
<212> PRT 

<213> Homo sapiens 
<400> 574 
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Met Thr Trp Arg Met Gly Pro Arg phe Thr Met Leu Leu Ala Met Trp 
15 10 15 

Leu Val Cys Gly Ser Glu Pro His Pro His Ala Thr lie Arg Gly Ser 

20 25 30 

His Gly Gly Arg Lys Val Pro Leu Val Ser Pro Asp Ser Ser Arg Pro 
35 40 45 

Ala Arg Phe Leu Arg His Thr Gly Arg Ser Arg Gly lie Glu Arg Ser 
50 55 60 

Thr Leu Glu Glu Pro Asn Leu Gin Pro Leu Gin Arg Arg Arg Ser Val 
65 70 75 80 

Pro Val Leu Arg Leu Ala Arg Pro Thr Glu Pro Pro Ala Arg Ser Asp 

85 90 95 

lie Asn Gly Ala Ala Val Arg Pro Glu Gin Arg Pro Ala Ala Arg Gly 

100 105 110 

Ser Pro Arg Glu Met lie Arg Asp Glu Gly Ser Ser Ala Arg Ser Arg 
115 120 125 

Met Leu Arg Phe Pro Ser Gly Ser Ser Ser Pro Asn lie Leu Ala Ser 
130 135 140 

Phe Ala Gly Lys Asn Arg Val Trp Val lie Ser Ala Pro His Ala Ser 
145 150 155 160 

Glu Gly Tyr Tyr Arg Leu Met Met Ser Leu Leu Lys Asp Asp Val Tyr 

165 170 175 

Cys Glu Leu Ala Glu Arg His He Gin Gin He Val Leu Phe His Gin 

180 185 190 

Ala Gly Glu Glu Gly Gly Lys Val Arg Arg He Thr Ser Glu Gly Gin 
195 200 205 

He Leu Glu Gin Pro Leu Asp Pro Ser Leu He Pro Lys Leu Met Ser 
210 215 220 

Phe Leu Lys Leu Glu Lys Gly Lys Phe Gly Met Val Leu Leu Lys Lys 
225 230 235 240 

Thr Leu Gin Val Glu Glu Arg Tyr Pro Tyr Pro Val Arg Leu Glu Ala 

245 250 255 

Met Tyr Glu Val He Asp Gin Gly Pro He Arg Arg He Glu Lys He 

260 265 270 

Arg Gin Lys Gly Phe Val Gin Lys Cys Lys Ala Ser Gly Val Glu Gly 
275 280 285 

Gin Val Val Ala Glu Gly Asn Asp Gly Gly Gly Gly Ala Gly Arg Pro 
290 295 300 

Ser Leu Gly Ser Glu Lys Lys Lys Glu Asp Pro Arg Arg Ala Gin Val 
305 310 315 320 

Pro Pro Thr Arg Glu Ser Arg Val Lys Val Leu Arg Lys Leu Ala Ala 
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325 330 335 

Thr Ala Pro Ala Phe Pro Gin Pro Pro Ser Thr Pro Arg Ala Thr Thr 

340 345 350 

Leu Pro Pro Ala Pro Ala Thr Thr Val Thr Arg Ser Thr Ser Arg Ala 
355 360 365 

Val Thr Val Ala Ala Arg Pro Met Thr Thr Thr Ala Phe Pro Thr Thr 
370 375 380 

Gin Arg Pro Trp Thr Pro Ser Pro Ser His Arg Pro Pro Thr Thr Thr 
385 390 395 400 

Glu Val lie Thr Ala Arg Arg Pro Ser Val Ser Glu Asn Leu Tyr Pro 

405 410 415 

Pro Ser Arg Lys Asp Gin His Arg Glu Arg Pro Gin Thr Thr Arg Arg 

420 425 430 

Pro Ser Lys Ala Thr Ser Leu Glu Ser Phe Thr Asn Ala Pro Pro Thr 
435 440 445 

Thr He Ser Glu Pro Ser Thr Arg Ala Ala Gly Pro Gly Arg Phe Arg 
450 455 460 

Asp Asn Arg Met Asp Arg Arg Glu His Gly His Arg Asp Pro Asn Val 
465 470 475 480 

Val Pro Gly Pro Pro Lys Pro Ala Lys Glu Lys Pro Pro Lys Lys Lys 

485 490 495 

Ala Gin Asp Lys He Leu Ser Asn Glu Tyr Glu Glu Val 

500 505 



<210> 575 
<211> 554 
<212> PRT 

<213> Homo sapiens 
<400> 575 

Met Gly Pro Arg Phe Thr Met Leu Leu Ala Met Trp Leu Val Cys Gly 
15 10 15 

Ser Glu Pro His Pro His Ala Thr He Arg Gly Ser His Gly Gly Arg 

20 25 30 

Lys Val Pro Leu Val Ser Pro Asp Ser Ser Arg Pro Ala Arg Phe Leu 
35 40 45 

Arg His Thr Gly Arg Ser Arg Gly He Glu Arg Ser Thr Leu Glu Glu 
50 55 60 

Pro Asn Leu Gin Pro Leu Gin Arg Arg Arg Ser Val Pro Val Leu Arg 
65 70 75 80 

Leu Ala Arg Pro Thr Glu Pro Pro Ala Arg Ser Asp He Asn Gly Ala 

85 90 95 
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Ala Val Arg Pro Glu Gin Arg Pro Ala Ala Arg Gly Ser Pro Arg Glu 

100 105 110 

Met lie Arg Asp Glu Gly Ser Ser Ala Arg Ser Arg Met Leu Arg Phe 
* 115 120 125 

Pro Ser Gly Ser Ser Ser Pro Asn lie Leu Ala Ser Phe Ala Gly Lys 
130 135 140 

Asn Arg Val Trp Val lie Ser Ala Pro His Ala Ser Glu Gly Tyr Tyr 
145 150 155 160 

Arg Leu Met Met Ser Leu Leu Lys Asp Asp Val Tyr Cys Glu Leu Ala 

165 170 175 

Glu Arg His He Gin Gin He Val Leu Phe His Gin Ala Gly Glu Glu 

180 185 190 

Gly Gly Lys Val Arg Arg He Thr Ser Glu Gly Gin He Leu Glu Gin 
195 200 205 

Pro Leu Asp Pro Ser Leu He Pro Lys Leu Met Ser Phe Leu Lys Leu 
210 215 220 

Glu Lys Gly Lys Phe Gly Met Val Leu Leu Lys Lys Thr Leu Gin Val 
225 230 235 240 

Glu Glu Arg Tyr Pro Tyr Pro Val Arg Leu Glu Ala Met Tyr Glu Val 

245 250 255 

He Asp Gin Gly Pro He Arg Arg He Glu Lys He Arg Gin Lys Gly 

260 265 270 

Phe Val Gin Lys Cys Lys Ala Ser Gly Val Glu Gly Gin Val Val Ala 
275 280 285 

Glu Gly Asn Asp Gly Gly Gly Gly Ala Gly Arg Pro Ser Gin Gly Ser 
290 295 300 

Glu Lys Lys Lys Glu Asp Pro Arg Arg Ala Gin Val Pro Pro Thr Arg 
305 310 315 320 

Glu Ser Arg Val Lys Val Leu Arg Lys Leu Ala Ala Thr Ala Pro Ala 

325 330 335 

Phe Pro Gin Pro Pro Ser Thr Pro Arg Ala Thr Thr Leu Thr Pro Ala 

340 345 350 

Pro Ala Thr Thr Val Thr Arg Ser Thr Ser Arg Ala Gly Asn Arg Cys 
355 360 365 

Cys Lys Thr Tyr Asp His His Trp Leu Ser His His Ala Glu Ala Leu 
370 375 380 

Asp Pro Leu Thr Leu Pro Thr Gly Pro Leu Gin Pro Leu Arg Val lie 
385 390 395 400 

Thr Ala Arg Arg Pro Ser Val Ser Arg Glu Ser Leu Pro Ser He Pro 

405 410 415 

Gly Arg He Ser Thr Gly Arg Gly His Arg Gin Pro Gly Gly Pro Ala 
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420 425 430 

Arg Pro Thr Ser Leu Glu Ser Phe Thr Asn Ala Pro Pro Thr Thr lie 
435 440 445 

Ser Glu Pro Ser Thr Arg Ala Ala Gly Pro Gly Arg Phe Arg Asp Asn 
450 455 460 

Arg Met Asp Arg Arg Glu His Gly His Arg Asp Pro Asn Val Val Pro 
465 470 475 480 

Gly Pro Pro Lys Pro Ala Lys Glu Lys Pro Pro Lys Lys Lys Ala Gin 

485 490 495 

Asp Lys lie Leu Ser Asn Glu Tyr Glu Glu Lys Tyr Asp Leu Ser Arg 

500 505 510 

Pro Thr Ala Ser Gin Leu Glu Asp Glu Leu Gin Val Gly Asn Val Pro 
515 520 525 

Leu Lys Lys Ala Lys Glu Ser Lys Lys His Glu Lys Leu Glu Lys Pro 
530 535 540 

Glu Lys Glu Lys Lys Lys Lys Lys Lys Lys 
545 550 



<210> 576 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 576 

Met Leu Ala Leu Leu Gly Leu Leu Ala Gly Thr Glu His Pro Pro Gly 
15 10 15 

Pro Gin Gly Pro Gly Pro Ser 

20 



<210> 577 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 577 

Met Val Asn lie Phe Gly Phe Val Ser Cys He Val Phe Val Val Ala 
15 10 15 

Val Gin Leu Cys Tyr Met Lys Gin Pro 

20 25 



<210> 578 
<211> 122 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> SITE 
<222> (92) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (100) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (109) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (116) 

<223> Xaa equals any amino acid 
<400> 578 

Met Leu Ala Leu Thr Leu Ala Lys Ala Asp Ser Pro Arg Thr Ala Leu 
15 10 15 

Leu Cys Ser Ala Trp Leu Leu Thr Ala Ser Phe Ser Ala Gin Gin His 

20 25 30 

Lys Gly Ser Leu Gin Val His Gin Thr Leu Ser Val Glu Met Asp Gin 
35 40 45 

Val Leu Lys Ala Leu Ser Phe Pro Lys Lys Lys Ala Ala Leu Leu Ser 
50 55 60 

Thr Ala lie Leu Cys Phe Leu Arg Thr Ala Leu Arg Gin Ser Phe Ser 
65 70 75 80 

Ser Ala Trp Asn Pro Gly Ala Leu Lys Gly Pro Xaa Thr Ala Ala Thr 

85 90 95 

Lys Asp Thr Xaa Leu Thr Ser Leu Arg Met Ser Lys Xaa Gly Pro Gly 

100 105 110 

His Trp Ala Xaa Lys Thr Ser Trp Cys Lys 
115 120 



<210> 579 
<211> 216 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (6) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (18) 

<223> Xaa equals any amino acid 
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<400> 579 

Cys Phe Pro Trp Gly Xaa Ala Leu Arg Gin Lys Leu Phe Pro Ser Ala 
15 10 15 

Leu Xaa Ala Leu Val Pro Ser Gly Ala Gin Pro Leu Pro Ala Thr Lys 

20 25 30 

Asp Thr Val Leu Ala Pro Leu Arg Met Ser Gin Val Arg Ser Leu Val 
35 40 45 

lie Gly Leu Gin Asn Leu Leu Val Gin Lys Asp Pro Leu Leu Ser Gin 
50 55 60 

Ala Cys Val Gly Cys Leu Glu Ala Leu Leu Asp Tyr Leu Asp Ala Arg 
65 70 75 80 

Ser Pro Asp lie Ala Leu His Val Ala Ser Gin Pro Trp Asn Arg Phe 

85 90 95 

Leu Leu Phe Thr Leu Leu Asp Ala Gly Glu Asn. Ser Phe Leu Arg Pro 

100 105 110 

Glu lie Leu Arg Leu Met Thr Leu Phe Met Arg Tyr Arg Ser Ser Ser 
115 120 125 

Val Leu Ser His Glu Glu Val Gly Asp Val Leu Gin Gly Val Ala Leu 
130 135 140 

Ala Asp Leu Ser Thr Leu Ser Asn Thr Thr Leu Gin Ala Leu His Gly 
145 150 155 160 

Phe Phe Gin Gin Leu Gin Ser Met Gly His Leu Ala Asp His Ser Met 

165 170 175 

Ala Gin Thr Leu Gin Ala Ser Leu Glu Gly Leu Pro Pro Ser Thr Ser 

180 185 190 

Ser Gly Gin Pro Pro Leu Gin Asp Met Leu Cys Leu Gly Gly Val Ala 
195 200 205 

Val Ser Leu Ser His lie Arg Asn 
210 215 



<210> 580 
<211> 127 
<212> PRT 

<213> Homo sapiens 
<400> 580 

Met Leu Pro Leu Leu lie lie Cys Leu Leu Pro Ala lie Glu Gly Lys 
15 10 15 

Asn Cys Leu Arg Cys Trp Pro Glu Leu Ser Ala Leu lie Asp Tyr Asp 

20 25 30 

Leu Gin lie Leu Trp Val Thr Pro Gly Pro Pro Thr Glu Leu Ser Gin 
35 40 45 
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Ser lie His Ser Leu Phe Leu Glu Asp Asn Asn Phe Leu Lys Pro Trp 
50 55 60 

Tyr Leu Asp Arg Asp His Leu Glu Glu Glu Thr Ala Lys Phe Phe Thr 
65 70 75 80 

Gin Val His Gin Ala lie Lys Thr Leu Arg Asp Asp Lys Thr Val Leu 

85 90 95 

Leu Glu Glu lie Tyr Thr His Lys Asn Leu Phe Thr Glu Arg Leu Asn 
I 100 105 110 

Lys lie Ser Asp Gly Leu Lys Glu Lys Glu Pro His Pro Ser Pro 
115 120 125 



<210> 581 
<211> 164 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (126) 

<223> Xaa equals any amino acid 
<400> 581 

Met Leu Pro Leu Leu lie lie Cys Leu Leu Pro Ala lie Glu Gly Lys 
1 5 10 15 

Asn Cys Leu Arg Cys Trp Pro Glu Leu Ser Ala Leu He Asp Tyr Asp 

20 25 30 

Leu Gin He Leu Trp Val Thr Pro Gly Pro Pro Thr Glu Leu Ser Gin 
35 40 45 

Ser He His Ser Leu Phe Leu Glu Asp Asn Asn Phe Leu Lys Pro Trp 
50 55 60 

Tyr Leu Asp Arg Asp His Leu Glu Glu Glu Thr Ala Lys Phe Phe Thr 
65 70 75 80 

Gin Val His Gin Ala He Lys Thr Leu Arg Asp Asp Lys Thr Val Leu 

85 90 95 

Leu Glu Glu He Tyr Thr His Lys Asn Leu Phe Thr Glu Arg Leu Asn 

100 105 110 

Lys He Ser Asp Gly Leu Lys Glu Lys Gly Ala Pro Pro Xaa Ser Met 
115 120 125 

Asn Ala Phe Pro Ala Pro Ser Pro Thr Cys Thr Pro Glu Pro Leu Gly 
130 135 140 

Ser Val Cys Leu Pro Ser Thr Ser Val Ser Leu Pro Ser His Leu Pro 
145 150 155 160 

Gly Ser Leu Gin 
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<210> 582 
<211> 71 
<212> PRT 

<213> Homo sapiens 
<400> 582 

Met Val Gin Gly Pro Leu Thr His Leu Met Leu Val Leu Leu lie Ser 
15 10 15 

Leu lie Phe Leu Ser Arg Gly Ser Gly Arg Ala Trp Ala Phe Ser His 

20 25 * 30 

Ser Cys Phe Lys Thr Ser Asp Leu Leu Pro Cys Arg Asn Arg Trp Glu 
35 40 45 

Val lie Glu Phe Leu His Tyr Ser Asn Leu His Ser His He Ser Leu 
50 55 60 

Ser Val Thr Lys Thr Phe Leu 
65 70 



<210> 583 
<211> 140 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (136) 

<223> Xaa equals any amino acid 
<400> 583 

Met Ala Ser Leu Gly Leu Gin Leu Val Gly Tyr He Leu Gly Leu Leu 
15 10 15 

Gly Leu Leu Gly Thr Leu Val Ala Met Leu Leu Pro Ser Trp Lys Thr 

20 25 30 

Ser Ser Tyr Val Gly Ala Ser He Val Thr Ala Val Gly Phe Ser Lys 
35 40 45 

Gly Leu Trp Met Glu Cys Ala Thr His Ser Thr Gly He Thr Gin Cys 
50 55 60 

Asp He Tyr Ser Thr Leu Leu Gly Leu Pro Ala Asp He Gin Ala Ala 
65 70 75 80 

Gin Ala Met Met Val Thr Ser Ser Ala He Ser Ser Leu Ala Cys He 

85 90 95 

He Ser Val Val Gly Met Arg Cys Thr Val Phe Cys Gin Glu Ser Arg 

100 105 110 

Ala Lys Asp Arg Val Ala Val Ala Gly Gly Val Phe Phe He Leu Gly 
115 120 125 

Ser Leu Leu Gly Phe He Pro Xaa Ala Trp Asn Leu 
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130 135 140 



<210> 


584 


<211> 


86 


<212> 


PRT 


<213> 


Homo sapiens 


<220> 




<221> 


SITE 


<222> 


(33) 


<223> 


Xaa equals any amino acid 


<220> 




<221> 


SITE 


<222> 


(43) 


<223> 


Xaa equals any amino acid 


<400> 


584 


Arg Arg Phe Tyr Ser Pro Leu Val 


1 


5 



10 15 

Gly Glu Ala Leu Tyr Leu Gly lie lie Ser Ser Leu Phe Ser Leu lie 

20 25 30 

Xaa Gly lie lie Leu Cys Phe Ser Cys Ser Xaa Gin Arg Asn Arg Ser 
35 40 45 

Asn Tyr Tyr Asp Ala Tyr Gin Ala Gin Pro Leu Ala Thr Arg Ser Ser 
50 55 60 

Pro Arg Pro Gly Gin Pro Pro Lys Val Lys Ser Glu Phe Asn Ser Tyr 
65 70 75 80 

Ser Leu Thr Gly Tyr Val 

85 



<210> 585 
<211> 42 
<212> PRT 

<213> Homo sapiens 
<400> 585 

Met Phe Leu Phe lie Thr Phe Thr lie Leu Ala He Phe He He Glu 
15 10 15 

Pro Arg Asn Leu Arg Val Asp Leu Asn Leu He Lys Phe Gin Thr Ser 

20 25 30 

Trp Pro Lys Thr Leu Val Glu Glu Gin Asn 
35 40 



<210> 586 
<211> 76 
<212> PRT 

<213> Homo sapiens 
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<400> 586 

lie Asn Phe Thr Tyr Lys Arg Leu Ser Leu Asp Phe lie Tyr lie Tyr 
15 10 15 

Met Cys Val Cys Val Cys Val Cys Val Cys Val Cys Val Cys Val Tyr 

20 25 30 

Leu Lys Arg Thr Cys Ala Ser lie Lys Gly Asn Lys Met Arg Glu Tyr 
35 40 45 

He He Asp Phe Val Lys Ser Lys Tyr Leu Asn Tyr Gly Phe Ser lie 
50 55 60 

Phe Lys Asn Ser Cys Ser Phe Cys Thr Tyr Phe Phe 
65 70 75 



<210> 587 
<211> 53 
<212> PRT 

<213> Homo sapiens 
<400> 587 

Met Val Thr Phe He Asn Ala Thr Leu Trp He Ala Val Phe Ser Tyr 
15 10 15 

He Met Val Trp Leu Val Thr He He Gly Tyr Thr Leu Gly He Pro 

20 25 30 

Asp Val He Met Gly He Thr Phe Leu Ala Ala Gly Gin Val Phe Gin 
35 40 45 

Thr Ala Trp Pro Ala 
50 



<210> 588 
<211> 169 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (6) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (39) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
<222> (44) 

<223> Xaa equals any amino acid 
<220> 

<221> SITE 
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<222> (71) 

<223> Xaa equals any amino acid 
<400> 588 

Met Val Thr Phe lie Xaa Ala Thr Leu Trp lie Ala Val Phe Ser Tyr 
15 10 15 

lie Met Val Trp Leu Val Thr lie lie Gly Tyr Thr Leu Gly lie Pro 

20 25 30 

Asp Val lie Met Gly He Xaa Phe Leu Ala Ala Xaa Thr Ser Val Pro 
35 40 45 

Asp Cys Met Ala Ser Leu He Val Ala Arg Gin Gly Leu Gly Asp Met 
50 55 60 

Ala Val Ser Asn Thr He Xaa Ser Asn Val Phe Asp He Leu Val Gly 
65 70 75 80 

Leu Gly Val Pro Trp Gly Leu Gin Thr Met Val Val Asn Tyr Gly Ser 

85 90 95 

Thr Val Lys He Asn Ser Arg Gly Leu Val Tyr Ser Val Val Leu Leu 

100 105 110 

Leu Gly Ser Val Ala Leu Thr Val Leu Gly He His Leu Asn Lys Trp 
115 120 125 

Arg Leu Asp Arg Lys Leu Gly Val Tyr Val Leu Val Leu Tyr Ala He 
130 135 140 

Phe Leu Cys Phe Ser He Met lie Glu Phe Asn Val Phe Thr Phe Val 
145 150 155 160 

Asn Leu Pro Met Cys Arg Glu Asp Asp 

165 



<210> 589 
<211> 15090 
<212> DNA 

<213> Homo sapiens 
<400> 589 

acgttcccta cttcctgtgc tcttgcggag acgcgcgcgt cggggtttaa cgcgtttctg 60 

ggccgccgta agcccggcct aggggcagct ttgactcgag agccggctat aggcgcatgg 120 

aaggttccct ggaacgggag gcgccagcgg gggcgctggc cgccgtgcta aagcacagct 180 

cgacgttgcc gcccgaaagc acccaggtcc ggggctacga cttcaaccgc ggtgtgaatt 240 

accgcgcact gctggaggcc ttcggcacca ccggcttcca agcaaccaac ttcgggcgcg 300 

ctgtacagca agtcaatgcc atggtgagga ccgggcggaa tttctaggga cgcggagggg 360 

cgtggcttgt agaaccaacg cggtactaga cgggggcagc gtttccagtg gaggggatat 420 

gtcttttatt tgagttgccc aatagttgga ggaaggcggg acctattctg ggcgggagtt 480 

tctgtcctgg gaaggggatt ttgcactctg gtagttacat gctggtacgg taacctgagg 540 

aggcgaggac tgattcttgg tgtgggggcg ggttctaggt acatttaaag ctttctggaa 600 

tgggcggagc ctggggcaag acaaattaag ggaggatatg ggaggaggag cctaagtctg 660 

ggcggttctt gaatttagat ttgcttttcc cagcggggaa gggaccggat ctgaaaggag 720 

atgctctctg attcctaaaa gggtgggggc tggctgggcg cggtggcgca tgcctataat 780 

cccagcattt tgggaagccg aggcgggtgg atcaagagaa gaggagttcg agacaagcct 840 

ggccaacatg gtgaaaccct gtctctacta aaatgcaaaa aattagccgg gcatggtgtt 900 

gcgcgcctgt agtcccagct actcgggagg ctgaggcagg agaatcgctt gaacccgcga 960 
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ggtggaggtt gcagtgagct gagattgcgc cactgcactc cagcctggtg acagagcgat 1020 

actctgtctc aaaaaaaaaa aaaaaaaaag gccgggcacg gtggctcatg cctgtaatct 1080 

cagcattttg ggaggccgag gcgggcggat cacctgaggt cgggagttcg agaccagcct 1140 

gaccaacatg gagaaacccc gtctctacta aaatgcaaaa aattagccgg gcatggtggt 1200 

gcgcgcctgt agtcccagct actcgggagg ctgaggcagg agaatcgctt gaacccggga 1260 

ggtggagctt gcagtgagcc gagatcgcgc cattgcactc tagcctgggt aacaagagtg 1320 

aaactccgtc caaaaaaaaa aaaaaaaggg tgggggcaaa tcctgaacgt gtgtcttgag 1380 

atctggtctg ggaaggggca gagtttacac aggagatgtt ccttggtccc ctgtagaact 1440 

ggtcctgtat ccctgaatgt gcagggcctg gggtgaaatc tgtttctgga gcccacctgg 1500 

agtcctgagt ctgggattta aggccagtgc ttaacttgac ttggccctta ctcacagatc 1560 

gagaagaagc tggaaccact gtcacaggat gaagaccagc acgcggacct gacccagagc 1620 

cgccgcccac ttaccagctg caccattttc ctgggatata catccaacct catcagttca 1680 

ggcatccgtg agaccattcg ctaccttgtg cagcacaaca tggtggggac ctggtgaggc 1740 

cgtggccttg gcctctgggt caatgggcaa tgcagttatt gatcgtttaa gtgagatggg 1800 

cagatagggg tctgttataa gcagaggaat agaatcaagt tttgttttgt agcagagctc 1860 

cctcataaga gtcactggac aaggatgagg ttagaagttc tgcccagaat gagggcaggg 1920 

gcttccttct tgggccaaat tcactgcctt gtggctgcag gtggacgtat tggtgaccac 1980 

agctggcggc gtggaggaag acctcatcaa gtgcctggcg cccacatact tgggcgagtt 2040 

tagcctcagg gggaaggagc tccgggagaa cgggatcaat aggtgagaac cctgagtggt 2100 

gttgggcagg ggagctggac caagggtcct ggggcctgat gcctacatgc ctcctgttct 2160 

caggatcgga aacctgctgg tgcccaatga gaattactgc aagtttgagg actggctgat 2220 

gcccattctg gaccagatgg tgatggagca gaacacagag gtggggctgg ggcaacctgg 2280 

aggggccagt tcagtgggag tcaggggagg catggcctga aggtcacatc ctctcctagg 2340 

gtgtaaagtg gacgccttct aagatgatcg cccggctggg caaggagatc aacaacccag 2400 

agtccgtgta ttactgggcc cagaaggtga ggacctaagc gggagcaaag tagccagact 2460 

ttggcatgtg ttaacttatt tcatctttcc agcctcctgg ggattcagta ctgttattct 2520 

cagcatcctc actcttagat gaggagactc aaacacagat aggcgtggta attttttttt 2580 

tttttgagac aggatcttgc tctgtcacct aggctggagt gcagtggtgt gatcacagct 2640 

cattgcagcc tcaacctcct ggggccaagc agtcctccca cctcagcctc tcgagtagct 2700 

gggaccacag gcacatgcca tcatgcccag ctaattttaa aaattttttg tagagatggg 2760 

ggtctccctg tgtttcccag gctggtcttg aactcctggc gtcaggcagt cctcccacct 2820 

tggcctccca aagtgctggg attacaggcg tgagccactg tgcccagcca ggctaggtaa 2880 

ttttgtctga agtcacacag ataactagtg gggcctcagc atccttatgc tcctcccaga 2940 

ctatctctgg aacagtagag gcaacactta gcaactctcc tggggcctgg ctcagggtcc 3000 

cacactccca gatgctataa aataagagct acccactccc tttaattaga gaatcagaac 3060 

ctgaaaaaaa aaaaaaaaaa gctacatagt cttttttttt tttttttttt ttttgagaca 3120 

gagtcttgct ctgtcgccca ggctggagtg taatggtgtg atttcagctc actgcaccct 3180 

ccacctccca ggttcaagtg attctcccgc ctcaccctcc tgagtagctg ggactatagg 3240 

cacccaccat catgcccggc caatttttgt atttttctac agatggggtt tcaccatgtt 3300 

ggccaggctg gtcttgaact cctgacctca agtgatccac ccgccttggc ctcccaaagt 3360 

gctgggatta caggggtgag ccaccacacc cagcctactt actctttctt ttttacgaga 3420 

cagggtctca ttctgtttcc caggctggag tgcaatggca caataatggc tcactgctgt 3480 

cttgagcttc tgggctcagt cgattctcct gcctcagcct gctgagtagc tgggactaca 3540 

ggtgcgtgcc accatgcctg gctaattgtt tttgtagaga tgggaactca ccatgttgca 3600 

caagctggct cttttttttt ttttgagacg gagttttgct cttgttaccc aggccagagt 3660 

ccaatggcgc aatcttggct cacagcaacc tctacctcct gggttcaagc aattctcctg 3720 

cctcagcatc aagttcccaa gtagctggga ttataggcat gtgccaccac ccctggctaa 3780 

ttttgtattt ttagtagagg cagggtttct ccatgttggt caggctggtg tcgagctcct 3840 

gacctcaggt gatccacccg cctcggcctc ccaaagtgct gggattacaa gcgtgagcca 3900 

ccgcgcccgg ccacaagctg gctctttaaa aaagaattag acggccgggc acggtggctc 3960 

atgcttgtaa tcccagtact ttgggggctg aggcgggtgg atcacctgaa gtcaggagtt 4020 

caagaccagc ctggccaaca tggtggaacc ctgtctctac taaaaaaaca aaatgcaaaa 4080 

ttagctgggc atagtggcat gcgcctgtaa tcctagctac tcgggaggct gaagcaggag 4140 

aatcactgga acccaggagg cagaggttgc agtgagctga gatagtgcca ttgcactcca 4200 

gcctgggcaa cgagtgaaac tccgtctcag ggaataaaaa aaaagagaga gaaacagctc 4260 

agcaagctaa gaataggctg gttctttatg tgccaggcac tatactaagt gttttctgtg 4320 

ggttttctca tataggcctc atctgtgaga ggtgtttctg ttatgcttat aattgaggaa 4380 

acagtcctag agaaattagg caccccatcc acagtcagat agtcatatag ccagcatgga 4440 

gacgaggctc tgagctccag ccactgtcct gtcatgttct ctgcagaacc acatccctgt 4500 

gtttagtccc gcacttacag acggctcgct gggcgacatg atcttcttcc attcctacaa 4560 

gaacccgggc ctggtcctgg acatcgttga gggtgaggcg ctagggccac agaggagagg 4620 
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ggaaggaggg ctggctgagt ccaaggcctg acttcggcgc tcttccccca gacctgaggc 4680 

tcatcaacac acaggccatc tttgccaagt gcactgggat gatcattctg ggcgggggcg 4740 

tggtcaagca ccacattgcc aatgccaacc tcatggtgag tgggggtggc gcttcggccc 4800 

actctgcaga catgctgtgt ggtgggccta tgccatgtgc tgggtagaca gcatgaacta 4860 

gacaggccag gatccctgct ccccgggact gatgttctag taggggagat agcaaacaag 4920 

tgacatccac gtcagggggc agtcattgct acggggaata ctaaatgatg gcatgagaag 4980 

agaaaggggt gctgtccaac aagaggtttt ggaaggaggc ttctggaagt gtgagctggg 5040 

agcaggggtt tggggaggca tctcccagga gctatgtggt tgtctcccgt gccaggagtt 5100 

tgtgctgctg tactagcatt gagcttctgg attgcaaagt tcccaggaga gggaacagcc 5160 

attgcaaagg ctctggggcg cctgaagtaa tccaggaaca gctgggtacc tggtgagggg 5220 

aaggtggagt ggtgagggcc aggaggtgat gcagggaggt gccacaggga gcctgctggg 5280 

ccctgtgggc tgaggcgggg ctttggcttt tgcactgaga gaagtgggag aggacttgac 5340 

tcgggttccc atgctccccc tggtggccat gtggtgaaca gatggtggca ggtgagtgag 5400 

aactccagtc caggacagtg gtgaccatgc agggatcagg aggcaggcgg tggttggatt 5460 

tggagggctt ctgaagatag agttgctggc agctcctgcc ttgcagatgg ggcagggttg 5520 

ttgacacctg gctgttgggt ctggggaaac acagccatcg ttgcccagca gctctagtga 5580 

ccccagacat tgacatgggc cctgtctggg agcgaatgtc tgtgaagggc ttccttcatc 5640 

tgggacacac ctctcagcct gtttggctac ggtgtcctcc ctctgtgtct gtctcctgtg 5700 

actggctgac tgggcccacc gtgcccccgc ctccccacag cggaacgggg ccgactacgc 5760 

tgtttacatc aacacagccc aggagtttga tggctctgac tcaggtgccc gaccagacga 5820 

ggctgtctcc tggggcaaga tccgggtgga tgcacagccc gtcaaggtaa gcgctggctg 5880 

ggtggggcat agggtctctg ggacgatgag tgtgggtccc atggcttacc caggttcccc 5940 

cctacccagg tctatgctga cgcctccctg gtcttccccc tgcttgtggc tgaaaccttt 6000 

gcccagaaga tggatgcctt catgcatgag aagaacgagg actgagcggc tgcggtccca 6060 

ggaaggtctt accccctctt ctatttatta atttgcagac ccagcccctc ccctactttt 6120 

tggtcagcta cgtctctaga ataagatggt atctgaagtc cttccatgtc tgtgtctcgg 6180 

tccttggtcc tgttggtggg tcccctggct tcagcctgct ccatcctctg cctcataggc 6240 

ctcctctcgc cagcactgga cgctgcctcc catggcggtc agcaggcagg gctctgttgg 6300 

gtggtaggcc agcgactgca ccacaccgga acccacaggc agggccagag ccagcgcacc 6360 

ctgtggagcg aggggtaggg agtggttatc tgtggctggg ctgtaccctg gtttctgagg 6420 

tccctgggca ccccccatct cagtcctgcc tgaagccctc agtcacctct ccccgacctt 6480 

cagccacctg cctctaattg cctcctagac actgcaaaat taatctacct cagactgagc 6540 

ccctcagtag ctctgtccta ccaggtgttc aggccgcaag. tcttgggagc ctcttattga 6600 

ttttttattt tttaaaatag cagagatggg gtcttaccat gttgcccagg ctggtctcga 6660 

actcctggcc tcaagtgatc ctcctgcctt ggcctcccaa agtgctagga ctacaggtgt 6720 

gagccaccat gcctggcttt gggagtcttt ttctttttct gttttttttt gagatggagt 6780 

cttgctctgt cgccaggctg gagtgcagtg gcgccatctg ggctcactgc aacctccgac 6840 

tccctagttc aagcgattct cctgcctcag cctcccgagt aggtgggatt acaggcacgc 6900 

gctaccatgc ccagctaatt tttgtatttt tagtacagat ggggtttcac catgttggcc 6960 

aggatgatct ctatctcctg acctcgtgat ccacccccct cggcctccca aagtgctggg 7020 

attacaggtg tgaaccacca tgcctggccg ggagtctttt tcttgtacca catgttcctc 7080 

tatcagttat tcctcttggc tctgcttcta gaccatgtcc agagactgct gattttctgc 7140 

cactaccatg tttggagcat gccttgctca cagcttgcct gttccctgcc accagctggg 7200 

ctccatcctg tccatttggc agccccaatg atccccactt cccctggtcc caacacaatg 7260 

taaacagttc ctcaagcaag ccatgcttcc tctttccacc ttgaaaatgt cccttgatgt 7320 

gcctgccctc caaggcaact tctgttctac tgggcccata agaaataggt ggaacattct 7380 

tgctacctta gttcctaaag caagagcttg tccgccaagt cttcagggcc cccccggttc 7440 

cctccatcct cttatgggcc ttagttcccc tggccaagcc cagcgctgat gtctcctttg 7500 

ccgagattcc cccggctctg accccactac accaggatgg tctgtttctg gcctgactcc 7560 

ccaccaggct gggacatgtc aaggacaggg ctcagggctt gaaaccatgc acagggatgc 7620. 

taagatgtag ctccagtttc ttgaatgact gcagccctat ctcacccaag gttagacatg 7680 

gccgtcggct cccccttgcc caggggtcag tgccagttgc tcggcctggc atctgcagcc 7740 

cagcctgacc ccggatcttg ccgcacctgc cgccaccctg aagcctttgt ttcctgactg 7800 

atgcatgcgc aggctacacc tcttaacata tgctgttccc tttgcctgga atgcccttcc 7860 

tgcctcatct tctctctagc ccaagctggg cacggtgtct caggcctgta atcccagcac 7920 

tttgggaggc cgaggcgggt gaatcacttg aggtcaggag tttgagacca gcctggccaa 7980 

catggtgaaa ccctgtctct actaaaaata caaaaattag ccaggtatgg tggtaggggc 8040 

ctgtaatccc agctactcag gaggctgaga catgagaatc acctgaaccc tggggggaag 8100 

aggttgcagt gagctgagat ggcgccactg catttgcctg ggcaatagtc tcaaaaaaag 8160 

aaaaaagggc cgggtgtggt ggcttacgcc tgtaatccta gcacttggga aggctgaggt 8220 

gggtggatca tgaggtcagg agtttaagac cagcctgacc aacatagtga aaccctgtct 8280 
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ctactaaaaa 

aggagactga 

ttgcgccact 

aaaaaaaaaa 

gatggggcag 

agaagaacac 

agcagtccag 

gggggcttgg 

actcactcgc 

ctggacacca 

gggaagggtg 

cctgtcccac 

cttaggtcat 

ggaaccttga 

ctgctcttcc 

ggccaattac 

gtccttctga 

aggctggagt 

gattctcctt 

ctaattttgt 

tcccaacctc 

gccaccccac 

ctcagtccca 

gcacccccca 

cggtttgaac 

gacaccaaga 

tgacgatctc 

ggctcacgcc 

agatggagac 

agctgggcgt 

tcgcctgaac 

ctggcaacag 

tatgtattta 

aggagtgtag 

ctcctgcctc 

ttttttgtat 

ctgacctcat 

cagcgcccgg 

ggagtctcac 

tccaactccc 

gctcccgtca 

ttggccaggc 

gtgctgggat 

gtctgtcttt 

cttccacttc 

gagtaggttt 

ccaggtcctg 

acgccatctc 

caacagcgga 

tggaggggga 

taactgtggc 

cagaggaggc 

caagggcgaa 

cctagcctgg 

gggggtggcc 

tccaggcttt 

acatcccaca 

gagctggagt 

catcccagcc 

gtacgtccgc 

gcacgtcagg 



tacaaaaatt 

ggcaggagaa 

gcactctagc 

aagatcctag 

gcatctaggt 

cttcccgtcc 

cttgtattcc 

tcctctccag 

ccagcagctc 

gggtgcactg 

ggtaggtgag 

atccccagcc 

agcgtctcac 

ctacactcac 

agtgcacagg 

gtcctactcc 

ttcccccttt 

gcaatggcat 

cctcagcctc 

atttttagta 

aggtgatccg 

ctggcccaat 

ccactttccc 

cagccaccag 

tgaacgcgtc 

tcaatccctc 

ttcacttcca 

tgtgattcca 

cctcctggcc 

ggtggcacgc 

ctagaagtta 

agcgagactg 

tttatttaat 

tggtgcgatg 

agcctcctga 

ttttagtaga 

gatccgcctg 

cctattttta 

tttgtcgccc 

gggttcaggc 

ccacacctgg 

tgatctcaac 

tacaggtgtg 

cacccagcat 

agcctcccaa 

taatctgagc 

gctccactca 

tggcctcatc 

tactggaatc 

gggcagcacc 

ctcttcatta 

tggactttgg 

ggggaggtga 

catgcggagt 
gtgtcaccac 
cactcacccc 
gaaccaccgc 
aaaggaggag 
tggtccccgg 
agcagcgtcc 
cagtaattgc 



agctgggtgt 

tcgcttgaac 

ctgggcgaca 

ccttcaggcc 

atgaagtagg 

tcagaacagc 

tggttcttat 

gggtgggaga 

ccctgtgtct 

cccatcccgg 

tgagggtggg 

acactcactg 

gcggccatcc 

atggctccag 

agtggaattc 

atcatttcca 

tcttcctttt 

gatcttggct 

ccgagtagat 

gagacggggt 

cccacctcag 

tccccctttt 

cactacccag 

agggagcact 

cacttatatg 

ctcttcctcc 

cttaatgaat 

gctctttggg 

aacatggcga 

gactgtagtc 

gaggttgcag 

cgtctcaaaa 

ttatttactt 

tcggctcaca 

gtagctggga 

gacggggttt 

cctctgcctc 

tttattttta 

aggctggagt 

gattctcctg 

ctaatttttg 

ctcctgacct 

agccaccgtg 

ggagtgcaat 

gtagctggga 

atgggggcag 

ccctgccagg 

cagcgtctgc 

aatagagcct 

ctgggccccc 

aactgcaccg 

aggactggga 

aaggttttgg 

gggggatcgt 

ctatgcgcta 

tgcgtggccc 

cttgtccccg 

gaggatcagc 

ctcacccggc 

cccgaagcgg 

catccactgg 



ggtggcacgc 

ttgggaggcg 

gagcaagact 

ctggagtgac 

gctgggggca 

tgaccacatg 

ggcccttgta 

ggagggaagg 

ttgtccagga 

ctgaagcagg 

gtgcccctgg 

cccacgtagt 

acggagctgc 

gcagccctgt 

tgaagctctg 

agacccacac 

gtttgagaca 

caccggaacc 

gggaatacag 

ttctccatgt 

cctcccaaag 

cctgcattct 

caccagcatc 

tttcaaagta 

tggattttct 

tcctctgctt 

agaaaatata 

aggccagggc 

aaccccgtct 

ccagctacta 

tgagccgaga 

aaaacaaaaa 

attttgagac 

gcaagctctg 

ctacaggcgc 

caccgtgtta 

ccaaagtgct 

gaaatcattt 

gcagtggcac 

cctccgcctc 

catttttagt 

caagtgatct 

cccggccaga 

ggtgcctcga 

ttacagctgt 

ctatgattcc 

atctcgtggt 

actggctcag 

gtgaggccag 

gtgcttaggc 

tgttcacctt 

taaaggcagg 

gggaacagca 

catctgggga 

atacgggtcc 

cggaatttgc 

ccgccggagc 

atcgacctcg 

cgcatccagc 

gttccacagc 

ggaacaccag 



acctataatc 

gagtttacag 

tgtctcaaaa 

tgaggtaagg 

cctcacctcc 

tgtgtcacgc 

cctaagggtg 

ggatccccac 

gccgcaatgt 

tgcaggtgat 

ttggcccccc 

ctgagaagag 

aggagagggt 

gccttcccag 

gccttgagac 

cccctttcca 

gagttttgct 

tctgcctccc 

gcatgcacca 

tggtcaggct 

tgctgggatt 

ccagctcctc 

tagactgttc 

cagttgaccc 

tctgcctcta 

tctcaacacg 

ttttttccac 

aggtggatca 

ctactaaaaa 

gggaggctga 

tcacgtcact 

ttcttcctta 

ggagtcttgc 

cctcctgggt 

ctgccactat 

gccagaatga 

gggattacag 

tatctttttt 

aatctcggct 

ccgagtggct 

agagacgggg 

gcccaccttg 

ttattattat 

actcttggac 

gtgccactgt 

ccgtagcacc 

ctgacacctt 

gcctccgtga 

catgatggtg 

cccagactca 

ctgagaaagt 

gttcccagtg 

aggacagggg 

agggatcctt 

cctccactca 

gcacgacctg 

agagactact 

gatcccagcc 

acctcgtagc 

ttcagcgtct 

gcgggggtta 



ccaactactc 
tgagccaaga 
aaaaaaaaaa 
acagggctga 
accaggtccc 
tcgctcaggc 
gacaggatcg 
gaccacaagg 
ggagtccagg 

ggggccttgg 

atcttccctg 

ctgccccatc 

agate cage g 

cccagtgggc 

tcaggcccag 

agcccttgga 

cttgttgctc 

gggttcaaga 

ccacgccctg 

ggtctcgaac 

acaggtgtga 

ctcctcagta 

tggtctccct 

atgaacaaca 

tcagctgtga 

aagatcttta 

caggegeggt 

cgaggtcaag 

tacaaaaatt 

ggcaggagaa 

gcactccagc 

tgattatttt 

tctatcgccc 

tcactccatt 

gcccggctaa 

tctcgatctc 

geatgageca 

tttttgagat 

cactgcaacc 

gggattacag 

tttcaccatg 

gcctcccaaa 

tttgtagaca 

tcaagtgatc 

gcccagctga 

tggggtgaga 

cacactggac 

gcggcaatcc 

aggtcaggtt 

ccggacagga 

tattgecact 

tcggttaaag 

ttggtgactg 

acggtgtaat 

aatgagggtc 

ccctgatgcc 

gttgtcaaag 

tcagcgctcc 

cgtggccgct 

tgtcactgcc 

ctgggccgtc 
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gatctcagga 

acggctcgca 

ggaggccgcg 

aaatcggtct 

attgttcttc 

gtgatcgggt 

aaggaagact 

tgtcatgagg 

gcacgccgga 

tgtccactaa 

agcgtcgtgg 

tatcccgggg 

cgccgagcga 

gcatgatctt 

tgtagttgga 

cattgctgat 

gagaaagccc 

tggtctgcga 

gggtcttaag 

gtaggactag 

ggtggggcaa 

aggtggcttc 

cctagctgaa 

ctcggagctc 

gaacaagtga 

cctctctcct 

cagcccatga 

atccactagg 

ccctgaatga 

ggaacaccct 

tgfcccatgtt 

tttcactctg 

ggctctagcc 

gcttacttca 

cctctggcaa 

tttttttttc 

ggcacaatct 

gcctgccacg 

tttgagacag 

cggctcactg 

tagctgggat 

tgtaaaatta 

ggctggtctc 

gattgcaggc 

tggtaggaat 

tttctctccg 

caggggtatt 

cttaggcaag 

ggattcccat 

ctgctcctac 

gtcttcatag 

cattttccta 

tgagcctcca 



ggcgggagga 

ctgccccctg 

gctttggctc 

gtacggaaaa 

ctgacctact 

tggaactatt 

tctggcttct 

aacggaaatg 

agctgcgatt 

caatatgtcg 

gcacgggttc 

gcaggacctg 

atgtaacccg 

cagcatgtgc 

cctgagaacg 

ggggcgggac 

cagtctttgg 

gtggggccag 

catgtgggat 

aaaccacgta 

ggtcccagct 

ggggactcat 

gtggtgtgct 

ggaggacacg 

ctcttgagta 

gtctcaggct 

cgcccccatg 

cctgggcttt 

ggccgtctcg 

aggcttaccc 

tctaggggta 

gctgtcagcc 

ccataggcct 

taaaccttgt 

ctcaccagat 

ctttttaaga 

atgttcactg 

taactgggac 

agttttgctc 

caacctccgc 

tacaggcatg 

caaattttgt 

aaactcccag 

ttgagccact 

tgggaaaggc 

gaacacccag 

tacccaacgt 

ctaggtcctt 

ggacacacta 

cggcattaag 

cccgtgagac 

ccttcctgta 

gaccgcgccc 



ggacccggaa 
cccgcagtcc 
agggaaagcc 
ttccgcgcct 
tcctgtttat 
gaagcccgct 
cctaaatctc 
cgagggcctc 
cctgcagcgg 
gacccacgga 
gggttgtggg 
agcgcccctg 
gccttggacg 
ggcctcatgc 
aggcccgggg 
ctgagatcgg 
tgaagtcaag 
gctttgaagc 
gtcagagtct 
tctagagtat 
gcatcagatg 
tgttgtgcct 
tgggtcgctg 
aagcaaatga 

aggggtgggg 

gtccatctct 
gtgataccag 
ggctgctaaa 
gtgcccccag 
ctcctgcctc 
ttcatttgct 
tcttctgttc 
ggtgtggggc 
gtcttaagtc 
ggccctgctg 
aagaaaatgg 
cagcctcaaa 
cacaaatgcg 
ttgttgccca 
ctcccaggtt 
tgccaccatg 
gttttcaaac 
gcccaagtga 
gcgcctgccc 
atgaaagact 
ggatccttct 
cccaatcccc 
ccaggaccct 
caccggctag 
aggcattccg 
gccagtgtca 
acgacccccg 
tggccccgcc 



tgaagacgaa 

agcgtcttca 

atgctcccag 

tagaaaccca 

cacttccggg 

ttcaggttct 

cgttctctgg 

gggtgttact 

aagaggcgtg 

ggccgaacaa 

tgtggatcgg 

accctcgagc 

acccgacgcc 

ttaaggtggg 

gcgggtttgg 

ccgggagtgg 

gattcgagtc 

aacttggaga 

ctcttggtcc 

gccatgagca 

gaatcagtgg 

ttcactaacc 

tctactgctc 

tgagtagctt 

ggaccccagc 

gccgtggtga 

cctagaaggg 

cctgctgcct 

ctggatagag 

ccttcccctg 

ttctcgttga 

agtggggtgt 

cccagtctgg 

aggtgctagg 

acctagataa 

tcttgatctg 

ctcctgggct 

tgctaccatg 

ggctggaggg 

caagtgattc 

cccagctaat 

ccgtctctac 

ttcaccagcc 

tgggctttgt 

cggggtcctt 

caaccaggct 

acagtatagg 

agcgctgaag 

gaaaacccgg 

tcttggaatt 

cctttagccc 

ggattgccca 



ggcgctcacc 

accgtttctg 

gactccttcc 

cgcttgggtg 

ttcatcattt 

tttccccatt 

gtaaggggag 

ctaaaatccg 

atctggcctt 

agtgctgagg 

ggccctggga 

ctgtcgcagg 

ggactacatg 

cggggttgag 

aatgaagcgg 

ggctgcgatg 

aggttgggga 

ggagttgtgg 

tgagagacgc 

ggtgggactg 

gcatgatctg 

tgcccaccca 

cttcatcagc 

catgtgagac 

ctggccatcc 

tgtcctatct 

tcacattttg 

tcagctgcca 

ggaacctggc 

cctgctgctg 

aacctgttgt 

tcctcctacc 

aggcttctgg 

gacaagacag 

tctaggcttt 

tcatgcaggc 

caagcaatcc 

cccagttaat 

caatggcatg 

tcctgcctca 

ttttgtattt 

aaaacaaccc 

tggccctccc 

tttcttatca 

ccccacttgt 

ggatcccatc 

ac tcttcatc 

gtccatggga 

cccccttagg 

ccggcattaa 

aaccagtgcc 

gggctacagc 



attaaatcgt 

cggcagctct 

ttgcagcctt 

taaccttatt 

tggcatttcg 

ttccctttga 

tccaagcctc 

ccctcagctt 

cgactcgcta 

tgaggacccc 

agcgcctgtc 

tacaagcccc 

aacctgctgg 

cttctatggg 

ggtgtcttgt 

acgctgggcc 

tgtggcagat 

aaaaagggca 

aggccagaca 

aggaattccg 

gtgtctggaa 

cttacctttc 

tttgccaact 

ttgccctaca 

tagactgaca 

gcagaatcct 

gaccctgtct 

tcctggactt 

cctttcctag 

ggggagatgc 

taataaagtt 

tggcgggaag 

ctgttcattt 

tgacttctgc 

ggtttccttt 

tgagtgcagt 

tcccacctca 

tcttgttttt 

atcttgatct 

gtctcccaag 

ttacattttg 

atgttgccca 

aaagtgctgg 

gagaatgaac 

cagaccctct 

cctggtactc 

agatcctcct 

cggacaccct 

aaaagcactt 

gaggcattcc 

ctgagggtgg 

ctctctcccg 
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ggccgccgta agcccggcct aggggcagct ttgactcgag agccggctat aggcgcatgg 120 

aaggttccct ggaacgggag gcgccagcgg gggcgctggc cgccgtgcta aagcacagct 180 

cgacgttgcc gcccgaaagc acccaggtcc ggggctacga cttcaaccgc ggtgtgaatt 240 

accgcgcact gctggaggcc ttcggcacca ccggcttcca agcaaccaac ttcgggcgcg 300 

ctgtacagca agtcaatgcc atggtgagga ccgggcggaa tttctaggga cgcggagggg 360 

cgtggcttgt agaaccaacg cggtactaga cgggggcagc gtttccagtg gaggggatat 420 

gtcttttatt tgagttgccc aatagttgga ggaaggcggg acctattctg ggcgggagtt 480 

tctgtcctgg gaaggggatt ttgcactctg gtagttacat gctggtacgg taacctgagg 540 

aggcgaggac tgattcttgg tgtgggggcg ggttctaggt acatttaaag ctttcfcggaa 600 

tgggcggagc ctggggcaag acaaattaag ggaggatatg ggaggaggag cctaagtctg 660 

ggcggttctt gaatttagat ttgcttttcc cagcggggaa gggaccggat ctgaaaggag 720 

atgctctctg attcctaaaa gggtgggggc tggctgggcg cggtggcgca tgcctataat 780 

cccagcattt tgggaagccg aggcgggtgg atcaagagaa gaggagttcg agacaagcct 840 

ggccaacatg gtgaaaccct gtctctacta aaatgcaaaa aattagccgg gcatggtgtt 900 

gcgcgcctgt agtcccagct actcgggagg ctgaggcagg agaatcgctt gaacccgcga 960 

ggtggaggtt gcagtgagct gagattgcgc cactgcactc cagcctggtg acagagcgat 1020 

actctgtctc aaaaaaaaaa aaaaaaaaag gccgggcacg gtggctcatg cctgtaatct 1080 

cagcattttg ggaggccgag gcgggcggat cacctgaggt cgggagttcg agaccagcct 1140 

gaccaacatg gagaaacccc gtctctacta aaatgcaaaa aattagccgg gcatggtggt 1200 

gcgcgcctgt agtcccagct actcgggagg ctgaggcagg agaatcgctt gaacccggga 1260 

ggtggagctt gcagtgagcc gagatcgcgc cattgcactc tagcctgggt aacaagagtg 1320 

aaactccgtc caaaaaaaaa aaaaaaaggg tgggggcaaa tcctgaacgt gtgtcttgag 1380 

atctggtctg ggaaggggca gagtttacac aggagatgtt ccttggtccc ctgtagaact 1440 

ggtcctgtat ccctgaatgt gcagggcctg gggtgaaatc tgtttctgga gcccacc tgg 1500 

agtcctgagt ctgggattta aggccagtgc ttaacttgac ttggccctta ctcacagatc 1560 

gagaagaagc tggaaccact gtcacaggat gaagaccagc acgcggacct gacccagagc 1620 

cgccgcccac ttaccagctg caccattttc ctgggatata catccaacct catcagttca 1680 

ggcatccgtg agaccattcg ctaccttgtg cagcacaaca tggtggggac ctggtgaggc 1740 

cgtggccttg gcctctgggt caatgggcaa tgcagttatt gatcgtttaa gtgagatggg 1800 

cagatagggg tctgttataa gcagaggaat agaatcaagt tttgttttgt agcagagctc 1860 

cctcataaga gtcactggac aaggatgagg ttagaagttc tgcccagaat gagggcaggg 1920 

gcttccttct tgggccaaat tcactgcctt gtggctgcag gtggacgtat tggtgaccac 1980 

agctggcggc gtggaggaag acctcatcaa gtgcctggcg cccacatact tgggcgagtt 2040 

tagcctcagg gggaaggagc tccgggagaa cgggatcaat aggtgagaac cctgagtggt 2100 

gttgggcagg ggagctggac caagggtcct ggggcctgat gcctacatgc ctcctgttct 2160 

caggatcgga aacctgctgg tgcccaatga gaattactgc aagtttgagg actggctgat 2220 

gcccattctg gaccagatgg tgatggagca gaacacagag gtggggctgg ggcaacctgg 2280 

aggggccagt tcagtgggag tcaggggagg catggcctga aggtcacatc ctctcctagg 2340 

gtgtaaagtg gacgccttct aagatgatcg cccggctggg caaggagatc aacaacccag 2400 

agtccgtgta ttactgggcc cagaaggtga ggacctaagc gggagcaaag tagccagact 2460 

ttggcatgtg ttaacttatt tcatctttcc agcctcctgg ggattcagta ctgttattct 2520 

cagcatcctc actcttagat gaggagactc aaacacagat aggcgtggta attttttttt 2580 

tttttgagac aggatcttgc tctgtcacct aggctggagt gcagtggtgt gatcacagct 2640 

cattgcagcc tcaacctcct ggggccaagc agtcctccca cctcagcctc tcgagtagct 2700 

gggaccacag gcacatgcca tcatgcccag ctaattttaa aaattttttg tagagatggg 2760 

ggtctccctg tgtttcccag gctggtcttg aactcctggc gtcaggcagt cctcccacct 2820 

tggcctccca aagtgctggg attacaggcg tgagccactg tgcccagcca ggctaggtaa 2880 

ttttgtctga agtcacacag ataactagtg gggcctcagc atccttatgc tcctcccaga 2940 

ctatctctgg aacagtagag gcaacactta gcaactctcc tggggcctgg ctcagggtcc 3000 

cacactccca gatgctataa aataagagct acccactccc tttaattaga gaatcagaac 3060 

ctgaaaaaaa aaaaaaaaaa gctacatagt cttttttttt tttttttttt ttttgagaca 3120 

gagtcttgct ctgtcgccca ggctggagtg taatggtgtg atttcagctc actgcaccct 3180 

ccacctccca ggttcaagtg attctcccgc ctcaccctcc tgagtagctg ggactatagg 3240 

cacccaccat catgcccggc caatttttgt atttttctac agatggggtt tcaccatgtt 3300 

ggccaggctg gtcttgaact cctgacctca agtgatccac ccgccttggc ctcccaaagt 3360 

gctgggatta caggggtgag ccaccacacc cagcctactt actctttctt ttttacgaga 3420 

cagggtctca ttctgtttcc caggctggag tgcaatggca caataatggc tcactgctgt 3480 

cttgagcttc tgggctcagt cgattctcct gcctcagcct gctgagtagc tgggactaca 3540 

ggtgcgtgcc accatgcctg gctaattgtt tttgtagaga tgggaactca ccatgttgca 3600 

caagctggct cttttttttt ttttgagacg gagttttgct cttgttaccc aggccagagt 3660 

ccaatggcgc aatcttggct cacagcaacc tctacctcct gggttcaagc aattctcctg 3720 
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cctcagcatc aagttcccaa gtagctggga ttataggcat gtgccaccac ccctggctaa 3780 

ttttgtattt ttagtagagg cagggtttct ccatgttggt caggctggtg tcgagctcct 3840 

gacctcaggt gatccacccg cctcggcctc ccaaagtgct gggattacaa gcgtgagcca 3900 

ccgcgcccgg ccacaagctg gctctttaaa aaagaattag acggccgggc atggtggctc 3960 

atgcttgtaa tcccagtact ttgggggctg aggcgggtgg atcacctgaa gtcaggagtt 4020 

caagaccagc ctggccaaca tggtggaacc ctgtctctac taaaaaaaca aaatgcaaaa 4080 

ttagctgggc atagtggcat gcgcctgtaa tcctagctac tcgggaggct gaagcaggag 4140 

aatcactgga acccaggagg cagaggttgc agtgagctga gatagtgcca ttgcactcca 4200 

gcctgggcaa cgagtgaaac tccgtctcag ggaataaaaa aaaagagaga gaaacagctc 4260 

agcaagctaa gaataggctg gttctttatg tgccaggcac tatactaagt gttttctgtg 4320 

ggttttctca tataggcctc atctgtgaga ggtgtttctg ttatgcttat aattgaggaa 4380 

acagtcctag agaaattagg caccccatcc acagtcagat agtcatatag ccagcatgga 4440 

gacgaggctc tgagctccag ccactgtcct gtcatgttct ctgcagaacc acatccctgt 4500 

gtttagtccc gcacttacag acggctcgct gggcgacatg atcttcttcc attcctacaa 4560 

gaacccgggc ctggtcctgg acatcgttga gggtgaggcg ctagggccac agaggagagg 4620 

ggaaggaggg ctggctgagt ccaaggcctg acttcggcgc tcttccccca gacctgaggc 4680 

tcatcaacac acaggccatc tttgccaagt gcactgggat gatcattctg ggcgggggcg 4740 

tggtcaagca ccacattgcc aatgccaacc tcatggtgag tgggggtggc gcttcggccc 4800 

actctgcaga catgctgtgt ggtgggccta tgccatgtgc tgggtagaca gcatgaacta 4860 

gacaggccag gatccctgct ccccgggact gatgttctag taggggagat agcaaacaag 4920 

tgacatccac gtcagggggc agtcattgct acggggaata ctaaatgatg gcatgagaag 4980 

agaaaggggt gctgtccaac aagaggtttt ggaaggaggc ttctggaagt gtgagctggg 5040 

agcaggggtt tggggaggca tctcccagga gctatgtggt tgtctcccgt gccaggagtt 5100 

tgtgctgctg tactagcatt gagcttctgg attgcaaagt tcccaggaga gggaacagcc 5160 

attgcaaagg ctctggggcg cctgaagtaa tccaggaaca gctgggtacc tggtgagggg 5220 

aaggtggagt ggtgagggcc aggaggtgat gcagggaggt gccacaggga gcctgctggg 5280 

ccctgtgggc tgaggcgggg ctttggcttt tgcactgaga gaagtgggag aggacttgac 5340 

tcgggttccc atgctccccc tggtggccat gtggtgaaca gatggtggca ggtgagtgag 5400 

aactccagtc caggacagtg gtgaccatgc agggatcagg aggcaggcgg tggttggatt 5460 

tggagggctt ctgaagatag agttgctggc agctcctgcc ttgcagatgg ggcagggttg 5520 

ttgacacctg gctgttgggt ctggggaaac acagccatcg ttgcccagca gctctagtga 5580 

ccccagacat tgacatgggc cctgtctggg agcgaatgtc tgtgaagggc ttccttcatc 5640 

tgggacacac ctctcagcct gtttggctac ggtgtcctcc ctctgtgtct gtctcctgtg 5700 

actggctgac tgggcccacc gtgcccccgc ctccccacag cggaacgggg ccgactacgc 5760 

tgtttacatc aacacagccc aggagtttga tggctctgac tcaggtgccc gaccagacga 5820 

ggctgtctcc tggggcaaga tccgggtgga tgcacagccc gtcaaggtaa gcgctggctg 5880 

ggtggggcat agggtctctg ggacgatgag tgtgggtccc atggcttacc caggttcccc 5940 

cctacccagg tctatgctga cgcctccctg gtcttccccc tgcttgtggc tgaaaccttt 6000 

gcccagaaga tggatgcctt catgcatgag aagaacgagg actgagcggc tgcggtccca 6060 

ggaaggtctt accccctctt ctatttatta atttgcagac ccagcccctc ccctactttt 6120 

tggtcagcta cgtctctaga ataagatggt atctgaagtc cttccatgtc tgtgtctcgg 6180 

tccttggtcc tgttggtggg tcccctggct tcagcctgct ccatcctctg cctcataggc 6240 

ctcctctcgc cagcactgga cgctgcctcc catggcggtc agcaggcagg gctctgttgg 6300 

gtggtaggcc agcgactgca ccacaccgga acccacaggc agggccagag ccagcgcacc 6360 

ctgtggagcg aggggtaggg agtggttatc tgtggctggg ctgtaccctg gtttctgagg 6420 

tccctgggca ccccccatct cagtcctgcc tgaagccctc agtcacctct ccccgacctt 6480 

cagccacctg cctctaattg cctcctagac actgcaaaat taatctacct cagactgagc 6540 

ccctcagtag ctctgtccta ccaggtgttc aggccgcaag tcttgggagc ctcttattga 6600 

ttttttattt tttaaaatag cagagatggg gtcttaccat gttgcccagg ctggtctcga 6660 

actcctggcc tcaagtgatc ctcctgcctt ggcctcccaa agtgctagga ctacaggtgt 6720 

gagccaccat gcctggcttt gggagtcttt ttctttttct gttttttttt gagatggagt 6780 

cttgctctgt cgccaggctg gagtgcagtg gcgccatctg ggctcactgc aacctccgac 6840 

tccctagttc aagcgattct cctgcctcag cctcccgagt aggtgggatt acaggcacgc 6900 

gctaccatgc ccagctaatt tttgtatttt tagtacagat ggggtttcac catgttggcc 6960 

aggatgatct ctatctcctg acctcgtgat ccacccccct cggcctccca aagtgctggg 7020 

attacaggtg tgaaccacca tgcctggccg ggagtctttt tcttgtacca catgttcctc 7080 

tatcagttat tcctcttggc tctgcttcta gaccatgtcc agagactgct gattttctgc 7140 

cactaccatg tttggagcat gccttgctca cagcttgcct gttccctgcc accagctggg 7200 

ctccatcctg tccatttggc agccccaatg atccccactt cccctggtcc caacacaatg 7260 

taaacagttc ctcaagcaag ccatgcttcc tctttccacc ttgaaaatgt cccttgatgt 7320 

gcctgccctc caaggcaact tctgttctac tgggcccata agaaataggt ggaacattct 7380 



381 



WO 02/076488 



tgctacctta gttcctaaag caagagcttg 
cctccatcct cttatgggcc ttagttcccc 
ccgagattcc cccggctctg accccactac 
ccaccaggct gggacatgtc aaggacaggg 
taagatgtag ctccagtttc ttgaatgact 
gccgtcggct cccccttgcc caggggtcag 
cagcctgacc ccggatcttg ccgcacctgc 
atgcatgcgc aggctacacc tcttaacata 
tgcctcatct tctctctagc ccaagctggg 
tttgggaggc cgaggcgggt gaatcacttg 
catggtgaaa ccctgtctct actaaaaata 
ctgtaatccc agctactcag gaggctgaga 
aggttgcagt gagctgagat ggcgccactg 
aaaaaagggc cgggtgtggt ggcttacgcc 
gggtggatca tgaggtcagg agtttaagac 
ctactaaaaa tacaaaaatt agctgggtgt 
aggagactga ggcaggagaa tcgcttgaac 
ttgcgccact gcactctagc ctgggcgaca 
aaaaaaaaaa aagatcctag ccttcaggcc 
gatggggcag gcatctaggt atgaagtagg 
agaagaacac cttcccgtcc tcagaacagc 
agcagtccag cttgtattcc tggttcttat 
gggggcttgg tcctctccag gggtgggaga 
actcactcgc ccagcagctc ccctgtgtct 
ctggacacca gggtgcactg cccatcccgg 
gggaagggtg ggtaggtgag tgagggtggg 
cctgtcccac atccccagcc acactcactg 
cttaggtcat agcgtctcac gcggccatcc 
ggaaccttga ctacactcac atggctccag 
ctgctcttcc agtgcacagg agtggaattc 
ggccaattac gtcctactcc atcatttcca 
gtccttctga ttcccccttt tcttcctttt 
aggctggagt gcaatggcat gatcttggct 
gattctcctt cctcagcctc ccgagtagat 
ctaattttgt atttttagta gagacggggt 
tcccaacctc aggtgatccg cccacctcag 
gccaccccac ctggcccaat tccccctttt 
ctcagtccca ccactttccc cactacccag 
gcacccccca cagccaccag agggagcact 
cggtttgaac tgaacgcgtc cacttatatg 
gacaccaaga tcaatccctc ctcttcctcc 
tgacgatctc ttcacttcca cttaatgaat 
ggctcacgcc tgtgattcca gctctttggg 
agatggagac cctcctggcc aacatggcga 
agctgggcgt ggtggcacgc gactgtagtc 
tcgcctgaac ctagaagtta gaggttgcag 
ctggcaacag agcgagactg cgtctcaaaa 
tatgtattta tttatttaat ttatttactt 
aggagtgtag tggtgcgatg tcggctcaca 
ctcctgcctc agcctcctga gtagctggga 
ttttttgtat ttfctagtaga gacggggttt 
ctgacctcat gatccgcctg cctctgcctc 
cagcgcccgg cctattttta tttattttta 
ggagtctcac tttgtcgccc aggctggagt 
tccaactccc gggttcaggc gattctcctg 
gctcccgtca ccacacctgg ctaatttttg 
ttggccaggc tgatctcaac ctcctgacct 
gtgctgggat tacaggtgtg agccaccgtg 
gtctgtcttt cacccagcat ggagtgcaat 
cttccacttc agcctcccaa gtagctggga 
gagtaggttt taatctgagc atgggggcag 
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tccgccaagt cttcagggcc cccccggttc 7440 

tggccaagcc cagcgctgat gtctcctttg 7500 

accaggatgg tctgtttctg gcctgactcc 7560 

ctcagggctt gaaaccatgc acagggatgc 7620 

gcagccctat ctcacccaag gttagacatg 7680 

tgccagttgc tcggcctggc atctgcagcc 7740 

cgccaccctg aagcctttgt ttcctgactg 7800 

tgctgttccc tttgcctgga atgcccttcc 7860 

cacggtgtct caggcctgta atcccagcac 7920 

aggtcaggag tttgagacca gcctggccaa 7980 

caaaaattag ccaggtatgg tggtaggggc 8040 

catgagaatc acctgaaccc tggggggaag 8100 

catttgcctg ggcaatagtc tcaaaaaaag 8160 

tgtaatccta gcacttggga aggctgaggt 8220 

cagcctgacc aacatagtga aaccctgtct 8280 

ggtggcacgc acctataatc ccaactactc 8340 

ttgggaggcg gagtttacag tgagccaaga 8400 

gagcaagact tgtctcaaaa aaaaaaaaaa 8460 

ctggagtgac tgaggtaagg acagggctga 8520 

gctgggggca cctcacctcc accaggtccc 8580 

tgaccacatg tgtgtcacgc tcgctcaggc 8640 

ggcccttgta cctaagggtg gacaggatcg 8700 

ggagggaagg ggatccccac gaccacaagg 8760 

ttgtccagga gccgcaatgt ggagtccagg 8820 

ctgaagcagg tgcaggtgat ggggccttgg 8880 

gtgcccctgg ttggcccccc atcttccctg 8940 

cccacgtagt ctgagaagag ctgccccatc 9000 

acggagctgc aggagagggt agatccagcg 9060 

gcagccctgt gccttcccag cccagtgggc 9120 

tgaagctctg gccttgagac tcaggcccag 9180 

agacccacac cccctttcca agcccttgga 9240 

gtttgagaca gagttttgct cttgttgctc 9300 

caccggaacc tctgcctccc gggttcaaga 9360 

gggaatacag gcatgcacca ccacgccctg 9420 

ttctccatgt tggtcaggct ggtctcgaac 9480 

cctcccaaag tgctgggatt acaggtgtga 9540 

cctgcattct ccagctcctc ctcctcagta 9600 

caccagcatc tagactgttc tggtctccct 9660 

tttcaaagta cagttgaccc atgaacaaca 9720 

tggattttct tctgcctcta tcagctgtga 9780 

tcctctgctt tctcaacacg aagatcttta 9840 

agaaaatata ttttttccac caggcgcggt 9900 

aggccagggc aggtggatca cgaggtcaag 9960 

aaccccgtct ctactaaaaa tacaaaaatt 10020 

ccagctacta gggaggctga ggcaggagaa 10080 

tgagccgaga tcacgtcact gcactccagc 10140 

aaaacaaaaa ttcttcctta tgattatttt 10200 

attttgagac ggagtcttgc tctatcgccc 10260 

gcaagctctg cctcctgggt tcactccatt 10320 

ctacaggcgc ctgccactat gcccggctaa 10380 

caccgtgtta gccagaatga tctcgatctc 10440 

ccaaagtgct gggattacag gcatgagcca 10500 

gaaatcattt tatctttttt tttttgagat 10560 

gcagtggcac aatctcggct cactgcaacc 10620 

cctccgcctc ccgagtggct gggattacag 10680 

catttttagt agagacgggg tttcaccatg 10740 

caagtgatct gcccaccttg gcctcccaaa 10800 

cccggccaga ttattattat tttgtagaca 10860 

ggtgcctcga actcttggac tcaagtgatc 10920 

ttacagctgt gtgccactgt gcccagctga 10980 

ctatgattcc ccgtagcacc tggggtgaga 11040 
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ccaggtcctg 
acgccatctc 
caacagcgga 
tggaggggga 
taactgtggc 
cagaggaggc 
caagggcgaa 
cctagcctgg 

gggggtggcc 

tccaggcttt 

acatcccaca 

gagctggagt 

catcccagcc 

gtacgtccgc 

gcacgtcagg 

gatctcagga 

acggctcgca 

ggaggccgcg 

aaatcggtct 

attgttcttc 

gtgatcgggt 

aaggaagact 

tgtcatgagg 

gcacgccgga 

tgtccactaa 

agcgtcgtgg 

tatcccgggg 

cgccgagcga 

gcatgatctt 

tgtagttgga 

cattgctgat 

gagaaagccc 

tggtctgcga 

gggtcttaag 

gtaggactag 

ggtggggcaa 

aggtggcttc 

cctagctgaa 

ctcggagctc 

gaacaagtga 

cctctctcct 

cagcccatga 

atccactagg 

ccctgaatga 

ggaacaccct 

tgtccatgtt 

tttcactctg 

ggctctagcc 

gcttacttca 

cctctggcaa 

tttttttttc 

ggcacaatct 

gcctgccacg 

tttgagacag 

cggctcactg 

tagctgggat 

tgtaaaatta 

ggctggtctc 

gattgcaggc 

tggtaggaat 

tttctctccg 



gctccactca 

tggcctcatc 

tactggaatc 

gggcagcacc 

ctcttcatta 

tggactttgg 

ggggaggtga 

catgcggagt 

gtgtcaccac 

cactcacccc 

gaaccaccgc 

aaaggaggag 

tggtccccgg 

agcagcgtcc 

cagtaattgc 

ggcgggagga 

ctgccccctg 

gctttggctc 

gtacggaaaa 

ctgacctact 

tggaactatt 

tctggcttct 

aacggaaatg 

agctgcgatt 

caatatgtcg 

gcacgggttc 

gcaggacctg 

atgtaacccg 

cagcatgtgc 

cctgagaacg 

ggggcgggac 

cagtctttgg 

gtggggccag 

catgtgggat 

aaaccacgta 

ggtcccagct 

ggggactcat 

gtggtgtgct 

ggaggacacg 

ctcttgagta 

gtctcaggct 

cgcccccatg 

cctgggcttt 

ggccgtctcg 

aggcttaccc 

tctaggggta 

gctgtcagcc 

ccataggcct 

taaaccttgt 

ctcaccagat 

ctttttaaga 

atgttcactg 

taactgggac 

agttttgctc 

caacctccgc 

tacaggcatg 

caaattttgt 

aaactcccag 

ttgagccact 

tgggaaaggc 

gaacacccag 



ccctgccagg 
cagcgtctgc 
aatagagcct 
ctgggccccc 
aactgcaccg 
aggactggga 
aaggttttgg 

gggggatcgt 

ctatgcgcta 
tgcgtggccc 
cttgtccccg 
gaggatcagc 
ctcacccggc 
cccgaagcgg 
catccactgg 
ggacccggaa 
cccgcagtcc 

agggaaagcc 

ttccgcgcct 
tcctgtttat 
gaagcccgct 
cctaaatctc 
cgagggcctc 
cctgcagcgg 
gacccacgga 
gggttgtggg 
agcgcccctg 
gccttggacg 
ggcctcatgc 
aggcccgggg 
ctgagatcgg 
tgaagtcaag 
gctttgaagc 
gtcagagtct 
tctagagtat 
gcatcagatg 
tgttgtgcct 
tgggtcgctg 
aagcaaatga 
aggggtgggg 
gtccatctct 
gtgataccag 
ggctgctaaa 
gtgcccccag 
ctcctgcctc 
ttcatttgct 
tcttctgttc 
ggtgtggggc 
gtcttaagtc 
ggccctgctg 
aagaaaatgg 
cagcctcaaa 
cacaaatgcg 
ttgttgccca 
ctcccaggtt 
tgccaccatg 
gttttcaaac 
gcccaagtga 
gcgcctgccc 
atgaaagact 
ggatccttct 



atctcgtggt 
actggctcag 
gtgaggccag 
gtgcttaggc 
tgttcacctt 
taaaggcagg 
gggaacagca 
catctgggga 
atacgggtcc 
cggaatttgc 
ccgccggagc 
atcgacctcg 
cgcatccagc 
gttccacagc 
ggaacaccag 
tgaagacgaa 
agcgtcttca 
atgctcccag 
tagaaaccca 
cacttccggg 
ttcaggttct 
cgttctctgg 

gggtgttact 
aagaggcgtg 
ggccgaacaa 
tgtggatcgg 
accctcgagc 
acccgacgcc 
ttaaggtggg 
gcgggtttgg 
ccgggagtgg 
gattcgagtc 
aacttggaga 
ctcttggtcc 
gccatgagca 
gaatcagtgg 
ttcactaacc 
tctactgctc 
tgagtagctt 
ggaccccagc 
gccgtggtga 
cctagaaggg 
cctgctgcct 
ctggatagag 
ccttcccctg 
ttctcgttga 
agtggggtgt 
cccagtctgg 
aggtgctagg 
acctagataa 
tcttgatctg 
ctcctgggct 
tgctaccatg 
ggctggaggg 
caagtgattc 
cccagctaat 
ccgtctctac 
ttcaccagcc 
tgggctttgt 
cggggtcctt 
caaccaggct 



ctgacacctt 

gcctccgtga 

catgatggtg 

cccagactca 

ctgagaaagt 

gttcccagtg 

aggacagggg 

agggatcctt 

cctccactca 

gcacgacctg 

agagactact 

gatcccagcc 

acctcgtagc 

ttcagcgtct 

gcgggggtta 

ggcgctcacc 

accgtttctg 

gactccttcc 

cgcttgggtg 

ttcatcattt 

tttccccatt 

gtaaggggag 

ctaaaatccg 

atctggcctt 

agtgctgagg 

ggccctggga 

ctgtcgcagg 

ggactacatg 

cggggttgag 

aatgaagcgg 

ggctgcgatg 

aggttgggga 

ggagttgtgg 

tgagagacgc 

ggtgggactg 

gcatgatctg 

tgcccaccca 

cttcatcagc 

catgtgagac 

ctggccatcc 

tgtcctatct 

tcacattttg 

tcagctgcca 

ggaacctggc 

cctgctgctg 

aacctgttgt 

tcctcctacc 

aggcttctgg 

gacaagacag 

tctaggcttt 

tcatgcaggc 

caagcaatcc 

cccagttaat 

caatggcatg 
tcctgcctca 
ttttgtattt 
aaaacaaccc 
tggccctccc 
tttcttatca 
ccccacttgt 
ggatcccatc 



cacactggac 
gcggcaatcc 
aggtcaggtt 
ccggacagga 
tattgccact 
tcggttaaag 
ttggtgactg 
acggtgtaat 
aatgagggtc 
ccctgatgcc 
gttgtcaaag 
tcagcgctcc 
cgtggccgct 
tgtcactgcc 
ctgggccgtc 
attaaatcgt 
cggcagctct 
ttgcagcctt 
taaccttatt 
tggcatttcg 
ttccctttga 
tccaagcctc 
ccctcagctt 
cgactcgcta 
tgaggacccc 
agcgcctgtc 
tacaagcccc 
aacctgctgg 
cttctatggg 
ggtgtcttgt 
acgctgggcc 
tgtggcagat 
aaaaagggca 
aggccagaca 
aggaattccg 
gtgtctggaa 
cttacctttc 
tttgccaact 
ttgccctaca 
tagactgaca 
gcagaatcct 
gaccctgtct 
tcctggactt 
cctttcctag 

ggggagatgc 

taataaagtt 
tggcgggaag 
ctgttcattt 
tgacttctgc 
ggtttccttt 
tgagtgcagt 
tcccacctca 
tcttgttttt 
atcttgatct 
gtctcccaag 
ttacattttg 
atgttgccca 
aaagtgctgg 
gagaatgaac 
cagaccctct 
cctggtactc 



11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 
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caggggtatt 
cttaggcaag 
ggattcccat 
ctgctcctac 
gtcttcatag 
cattttccta 
tgagcctcca 



tacccaacgt 
ctaggtcctt 
ggacacacta 
cggcattaag 
cccgtgagac 
ccttcctgta 
gaccgcgccc 



cccaatcccc 
ccaggaccct 
caccggctag 
aggcattccg 
gccagtgtca 
acgacccccg 
tggccccgcc 



acagtatagg 
agcgctgaag 
gaaaacccgg 
tcttggaatt 
cctttagccc 
ggattgccca 



actcttcatc 
gtccatggga 
cccccttagg 
ccggcattaa 
aaccagtgcc 
gggctacagc 



agatcctcct 
cggacaccct 
aaaagcactt 
gaggcattcc 
ctgagggtgg 
ctctctcccg 



14760 
14820 
14880 
14940 
15000 
15060 
15090 



<210> 591 
<211> 3830 
<212> DNA 

<213> Homo sapiens 



<400> 591 

ctgggcacgg 

acgaggtcag 

atacaaaaaa 

aggcaggaga 

tacactccag 

aagctctctg 

caggctatgg 

cagacatgct 

cctcattttt 

tggagtgcag 

ctcctgcctc 

tttttgtatt 

ctaacctcag 

caccgcgccc 

tttgctgaga 

ggtccaagcg 

gcctcacggc 

acatacgggg 

tcacccatcc 

ttaaatcctg 

ccgaggcagg 

accccatctc 

cage tac teg 

gagtggagat 

aaaaaacaaa 

cccaatcaga 

aatccttagt 

agtgaataat 

acccagggtc 

gccactgacc 

agctataagg 

agggcacgaa 

tgtctaactc 

cggccgggcg 

gtcacgagtt 

aaaacagaaa 

gctgaggcag 

ccactgcact 

aaaagtgacc 

gggtgggagg 

agcactccag 

gtggctcatg 

ggagatcgag 

attagceggg 

gaatggcgtg 

agectgggag 

aacaacaaaa 



tggctcatgc 

gagatcaaga 

ttagctgggc 

ctggcgtgaa 

cctgggcgac 

attttagctg 

ggataaagaa 

gagaaggatg 

aatttttatt 

tggcaccatc 

agcctcctga 

ttttagtaga 

gtgatctgee 

agccccaatc 

tgcagaaaga 

aggatgtcaa 

ttttgctgtt 

attttctcag 

ctccaactaa 

attgegctgg 

cagatcacct 

tactaaaaat 

ggaggctgag 

cgcgccattg 

aaacaaaaaa 

ttgtgggctc 

gcctagcacg 

cctgttggcc 

tcacattctt 

cagtgggagt 

gaaggaagaa 

cegggctcac 

tgcctggtga 

cagtggctca 

caggagatcg 

aaattagccg 

gagaatggca 

ccagcctggg 

ctcataaaaa 

atgacctgag 

cctggcgaca 

cctgtaatcc 

actaccctgg 

cgtggtggcc 

aacctgggag 

acagegcaag 

ccatcctccc 



ctgtaatccc 
ccatcctggc 
gtgtggcggg 
cccgggaggt 
agagcaagac 
ttaggtggga 
aacacctgga 
gatgtggggt 
tatctatttt 
tcagctcact 
gtagctggga 
gacggggttt 
ctccttggca 
cctatttttc 
tcagggatag 
ggaggatgtc 
ctgtcttccc 
ccttctggct 
aatcccgaac 
gcgcggtggc 
gaggtcagga 
acaagaatta 
gcaggagaat 
cactccagcc 
ggectgatag 
cagtagggca 
gagcccacca 
gtatgcactg 
gcaggaggga 
tcagggaagg 
cagagegtta 
tatcccatta 
ccgaaatctt 
cgcctgtaat 
agaccatcct 
ggtgtggtgg 
tgaacctggg 
egacagageg 
aattagctgg 
teegggagtc 
gagtgagacc 
cagcactttg 
etaacaeggt 
ggcgcctgta 
gcggaggttg 
actctagctc 
ctcccgaggg 



agcactttgg 

taacatggtg 

cgcctgtagt 

ggagcttgta 

tccgtctcaa 

gatgggttgg 

ttteggaect 

gtgagaaaag 

tgagaeggag 

gcaacctcca 

ttatgggtgt 

cgccatgttg 

tcccaaagtg 

agtctagtgg 

gtggagaggg 

tgtctcctcc 

atgtgcagga 

ttggctcaaa 

tcctccacct 

tcacacctgt 

gttcgagact 

geegggegtg 

tgcttgaaac 

tgggcaacaa 

cactcaccac 

gggtccacat 

tagagaatag 

ttaggagggt 

aagagacgtc 

cttcccagag 

tgcagcatgt 

gacaaaagtg 

gtccatgggt 

cctggcactt 

ggctaacacg 

cgggcacctg 

aggcagagct 

agactctgtc 

gcacgatggt 

agaggttgea 

ctatcaaaaa 

ggaggctgag 

gaaaccccgt 

gtcccagtta 

cagtgagccg 

aaacaaacaa 

gacagaacag 



gaggctgagg 
aaaccccgtc 
cccagctact 
gtgagccgaa 
aaaaaaaaaa 
aggataccaa 
actttatagg 
gggagacagc 
tttcgctctg 
cctcccgggt 
gcgccaccac 
tccaggctgg 
ctgggattac 
atgagtggag 
agctgagggg 
aacaaggtag 
aatggcctct 
tctcacctcc 
aaagactcaa 
aatcccagca 
agcctgacca 
atggctcatg 
tgggaggcag 
gagegaaact 
tattgttaac 
cttggtcttg 
actacatgaa 
gtgtttgaga 
agccctgggt 
gaggtggagg 
gaaggctttg 
gctgaggaag 
gaegcttaag 
tgggaggeca 
gtgaaacccc 
tagtcccagt 
tgcagtgagc 
tcaaaaaaaa 
gcacactagt 
gtgggccgag 
atagcagcag 
gegggeggat 
ctctactaaa 
ctggggaggc 
agategtgee 
acaaacaaaa 
aaacgaatgg 



tgggeggate 
tctactaaaa 
tgggaggctg 
atcacaccac 
aaaaaaaaaa 
agecagcttg 
tagaggttgg 
aagctgeget 
ttgcccaggc 
tcaagtgatt 
acccggctaa 
tctctaactc 
aggtgtgagc 
ggtggggccg 
gcaactttga 
gctcattcca 
ctgcatcttc 
ttggcgtcct 
attatcctgt 
ctttgggagg 
acatagtgaa 
cctgtaatcc 
aggttgegat 
ctgtctccaa 
tttctatctt 
ttcaccacta 
ttgtagagtg 
tacagatgac 
cccagaagag 
tgacagctgc 
gagacttgtg 
gatgaaactg 
aagtgaccct 
aggegggegg 
atctctacta 
tactegggag 
egagattgeg 
aaaaaaaaaa 
eggaatgetg 
atcgcgtcac 
gccaggcgcg 
cacgaggtca 
aatacaaaat 
tgaggcagaa 
actgcactcc 
cagcaacaac 
gcgagtgccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
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ggccaagcag tgggtctcca gcaggtggca ttaaaatagg aattttggct ggggacggtg 2880 

gctcacacct gtaatctcag cactttggaa agcccaggcg ggcggtcacc tgagggcaga 2940 

accagcctcg ccaacatggt gaaatgccat ctctactaaa aataaaaaat tagccaggcc 3000 

tggtggtggg tgtctgtaat cccagcaact cgggaggctg aggcaggaga atcgcttgaa 3060 

ccaggggggc agaggttgca atgagtcaag attgcaccac cgcactccag cctgcgtaac 3120 

aagagcgtgt aactcttgtc tcaaaaataa attaaataaa taaataataa aaataaaaaa 3180 

gaatcttcat tcatgggaag tcgagaacac atgaaaacaa gtaaaggccg aagcgcagtg 3240 

gctcacgcct gtaatcccag cactttggga ggctgaggcg ggcggataac ctgaggtcgg 3300 

aagttcgaga ccagcctgac caacagggag aaaccccgcc tctactaaaa atacaaaatt 3360 

agccgggcat ggcggtgcat gccagtagtc ccagctactc gggaggctga ggcaggagaa 3420 

tcgcttgaac ccggaatgtg gaggttgtgg tgagctgaga tcgggcaatt gcactccagc 3480 

ctgggcaaca agagcgaaac cctgtatcaa aaaaaaaaaa aagaaaaaaa aaggaagaaa 3540 

aggcctaaag gcgccgggcg cggtgggtca cgcctgtaat cccagcactt tgggaggccg 3600 

aggcgggcga atcacgaggt caggagatcg agaccaggtg aaaccccgtc tctactaaaa 3660 

atacaaaaaa attagccggg cgtggtggtg ggcgcctgta gtcccagcta ctcgggaggc 3720 

tgaggcagga gaacagcgtg aacccggaaa gcggagctgg cagtgagctg agatcgcgcc 3780 

actgcactcc agcctgggtg acagagcgag actccgtctc aaaaaaaaaa 3830 

<210> 592 

<211> 20245 

<212> DNA 

<213> Homo sapiens 

<400> 592 

gcctttccag ggccggggaa ccccaggagg aagctgctga gccatgggcg cctacgcgcg 60 

ggcttcgggg gtctgcgctc gcggctgcct ggactcagca ggcccctgga ccatgtcccg 120 

cgccctgcgg ccaccgctcc cgcctctctg ctttttcctt ttgttgctgg cggctgccgg 180 

tgctcgggcc gggggatacg aggtgagtgg ggcctccgag ctgaaacgta caggaggcag 240 

agtgaaaccc agaatacagt ctagaggtgt gggtgggtct gtcctgtggg tgtctagtga 300 

atggctgatg atatgacagt gtggtctgag tgcgtgcttt gtgtcattgc gaggtctggc 360 

tgtgcgcatc tgagtatagg actgtgtctg attgtacctg cctccgtgtc tccgggatgc 420 

ttgcctagac tttgtctgca cttaactgtg ggattggagg ggcaggaggt ggcagggggt 4 BO 

gagcagtgta tgtgtggggg gaggtgctgg ctgagagctg ggactctgga gtctgcctga 540 

aattccagcc tagctcttac acttcctgag tgtgtgacgt tgggcaagtc acctatcctc 600 

tctaagcctc agtgccttca tctggaaact ggggataaca tcaccccacc tcccacggtg 660 

gccgtctgct gtcggcaaat gctgaacaaa catcagctac ttctattatt attttcccgg 720 

agtgtgaatg agagctgccc tgtggggttg tgcaaagtgg agttgtatct atgagcacaa 780 

ctatgtatgt gtgtgtcctg cctgggaggg cctggcctcc tctgcacata ggcgagatca 840 

ggcctctctg agtcactcac ctctagatca agacttactc tgcagccccc gacctcgggg 900 

gttattgaca aggtatgtgt gtttggggtc cctgtgcaga catgccccac agtgcagccg 960 

aacatgctga acgtgcacct gctgcctcac acacatgatg acgtgggctg gctcaaaacc 1020 

gtggaccagt acttttatgg aagtgagtag aggatgggga ctggtccctg ggatccccat 1080 

ggtccctgta atccctctgg gtcctggaca ttagggtggg gccagtgcta ccctaatatc 1140 

cagggtttgg gctcctctgt ctaggaataa cccccttggc tctgctgttc cctgagagcc 1200 

ttatccctgt tatccacagt caagaatgac atccagcacg ccggtgtgca gtacatcctg 1260 

gactcggtca tctctgcctt gctggcagat cccacccgtc gcttcattta cgtggagatt 1320 

gccttcttct cccgttggtg gcaccagcag acaaatgcca cacaggaagt cgtgcgagac 1380 

cttgtgcgcc agggtgagcc taccccaagg aagtgaaaag aggaagccca gcccagcttc 1440 

tgcttctgca tctctggttt ctgagatttg tcatgccacg tgcaagctgt ataacatgcg 1500 

tgtcgctccg cctgcctgga ctctccattt ggagacctcc tatacatccc acaaagcccc 1560 

acctgctgtg catcctctgg gaagcctgcc atgcaggggg cctctttccc atatctggga 1620 

actatggcct gggagcgacc cctcttgtcc ttccgccaga aatgtgtaca caggcaggct 1680 

ttcacattcc cagtatcacc caccctactc ccactcctgg ctctgaccgc tgaccctgac 1740 

cttgcctgtc ctggcacagg gcgcctggag ttcgccaatg gtggctgggt gatgaacgat 1800 

gaggcagcca cccactacgg tgccatcgtg gaccagatga cacttgggct gcgctttctg 1860 

gaggacacat ttggcaatga tgggcgaccc cgtgtggcct ggcacattga ccccttcggc 1920 

cactctcggg agcaggcctc gctgtttgcg caggtgcgac ccgggacctc tcttgggccc 1980 

acttcttcac tcactctggc tcctccctcg cccagtcaaa ccccgccctc tccctgcaat 2040 

ctcacaagga ccaggcccag gcctaggcct gttgaagccc tgccccttga gtgagccgta 2100 

aagccagtgg cttttgagct ctggcctcag ccggctatgc ccagcccagg ctgacccagc 2160 

tccggctggc tccgccctct cccctaatag gcccctcttg gtgttctggc cccacccact 2220 
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agctcgggtc ctggctcctc cctaaaccgg gtggcaagtg gatgcctagg ctgccttaaa 2280 

aacaggttca ttacctgtgc tcagacccca tccatcctca ggctgtggaa gggggaacct 2340 

cattcctgga gtcaggcctg ctctgcgctt tgacagtgct ggggaggtga acctgggttc 2400 

tgatgttgga cccgccctct ccctgctagc ccaaggtggt gagttctgaa acttccccaa 2460 

gcttggaaat aagctggagg cctctctgtt tcagccctac gtgtttttgt ttttgttttt 2520 

tgagacaggg tcttgctgtc atccaggctg gagtgcagtg gtgcaatcct aactcactat 2580 

agcctcaatc tcccggattc aagcgattct cctgtctcag cccccctagt agatgagact 2640 

acaagagcgc accactacgc ctggctaatt tttaaatttt ttgtagacag tctgcccgtg 2700 

ttgtgcaggc tggtctcaaa aactcctggg tttaagtgat cctcctgttt tggcctccca 2760 

gaaggctggg atcataggca agagccacca catcgaccta gcactgcttt ttaacctgtg 2820 

ctctgacctg cccctcccaa gcagagggga gtttggtggt gagagggctg ggcactaatt 2880 

cacactgcct tttcctccct catccccaga tgggcttcga cggcttcttc tttgggcgcc 2940 

ttgattatca agataagtgg gtacggatgc agaagctgga gatggagcag gtgtggcggg 3000 

ccagcaccag cctgaagccc ccgaccgcgg acctcttcac tggtaggggg cttggtgagg 3060 

gcagggccag ccatggtgcc acacactcag aagggccctg ggcttgatat ctgctctgtt 3120 

gtcactgtcc tggaattcct atagtctggg aacaaaggcc ctgcatttcc ttttgcattg 3180 

ggacacaaat tctgaagccc atcctgggtg ggacatggcc ggctttgaaa ccagggaagg 3240 

tctgggtgat gggccacccc ttgaacttgg tgtgacctgc aggtgtgctt cccaatggtt 3300 

acaacccgcc aaggaatctg tgctgggatg tgctgtgtgt cgatcagccg ctggtggagg 3360 

accctcgcag ccccgagtac aacgccaagg agctggtcga ttacttccta aatgtggcca 3420 

ctgcccaggt aaccctggtg tccagaacct tcgagtccgg tatatacaat acaatgagct 3480 

ctttccatgg tacaggcatc ccctaacacg ttctccttat ttttaatttt tttgagacag 3540 

tcttactctg tcacccaggc tggagtgcag tggtgcgatc tcggctcact gcaacctctg 3600 

cctcctgggt tcaaacaagc acatccagct aatttttgta tttttgcact ggggtctcat 3660 

catgttgtcc aggctggtct caaactcctg agctcaagtg atctgcttgc ctcggcctcc 3720 

caaagtgccg ggattacagg catgagccac cgcacctggc ctctattttt aacattttta 3780 

tttatttatt attatttttt tttttttgag acagagtctt gctctgtcac ccaggctgga 3840 

gtgcagtggc atgatctcag ctcactgcaa cctccgcctc ccgggttcaa gcgattctcc 3900 

tgcctcagcc tcctaagaag ctgggattag aggcacctgc caccacaccc agctaatttt 3960 

tgtattttta gtagacaggg tttcgtcatg ttgaccaggc tggtcttgac ctcaggtgat 4020 

ctgcccgcct caacctccca aagtgctggg attacaggtg tgggccactg tgcctagtct 4080 

atttttaaca tttttattga gaattccttt tttatttttt ttagactcac tctgtcgccc 4140 

aggctggagt gcagtggcac gatctcggct cactgcaacc tccacctact gggtccaagc 4200 

gattcttctg cctcagcctc ccgagtagct gggattacag gtgcccacca ccatgctcgg 4260 

ctaagtttta tgtcttttta gtagaaaggg ggtttcacca tattggccag gctggtctcg 4320 

aactcctaac cttttgaccc acagccttgg cctctcaaag tgctgggatt acaggcgtga 4380 

gccaccgcgt ccagccttta acatttttat aattaaaaaa cattattttt tcacagagat 4440 

aaggtctcac catgtggccc aggctggtct caaactcctg aactcaagtg atcctcctgc 4500 

cttggcctcc caaagtgcta ggatataggt gtgagccacc atgcctggca taacacgttc 4560 

tccttaaaaa aatttttttt tcctttcttt aaaaattgat ggctgggcat ggtggctcac 4620 

gcctataatc ccagcatttt gagaggccga gatgggcaga tcatctgaag tcaggagttc 4680 

aagaccagca tagccaaaat gacgaaaccc tgtctctact aaaaatacaa aaattagtcg 4740 

ggtgtggtgg cgcacgcctg taatcccagc tacttgggag gctaaggcaa gagagtcgct 4800 

tgaacctggc aggtggaggt tgcagtgagc cgagatcacg tcacttcact ctagcttggg 4860 

cagcagagtg aaactctgtc tcaaaaaaaa aattgtttta gattaatttt tttttttttt 4920 

cttgagacag ggtcttgctg tgtggcccag gttggtctta aactcctaga ctcaagcgat 4980 

cctcctgcct cagtctcctg agtagctggg attacaggtg tgagcccctg taatcatgtt 5040 

ctcatgcccc ctggaggaag atgctattct attcaccatc acaatgtccc cctcctggat 5100 

ttatgtgcat tcctcattag aagagatagc ataggctggg caagagatag catggcatgg 5160 

tgactggatg ccctcttcta ggtagtgggt ccaagagaac tgctcaacaa ctggtggcta 5220 

cttttatctg tgtccccacg ctcccaagtg ccataccccc acccatgcct gtgcacccaa 5280 

catccttgac cccatataca tcaaaacaca gctatacaca gggatggccc aggatcctct 5340 

ggcttcagga ctcccctctt gcctgcaggg ccggtattac cgcaccaacc acactgtgat 5400 

gaccatgggc tcggacttcc aatatgagaa tgccaacatg tggttcaaga accttgacaa 5460 

gctcatccgg ctggtaaatg cgcaggtcag tgcgcctacc ctgtggtacc cttgtgcaca 5520 

tgtgcgcttg catccggggg ccttgggtta tgtgcatagc tctcagtgct gtctttgttt 5580 

tctattgttc tattgtggtc attctataac aaatgaccac acacttagca gctcaaaaca 5640 

acagaaatac attgtcttac agttctgtag gtaagaagtc cagcatgagg ccgggcgcaa 5700 

tggctcatgc ctgtaatccc agcatgttgg gaggctgagc cgggcagatc acgaggtcag 5760 

gaattcgaga ccagcctgac caacatggtg aaacgctgtc tctactaaaa atacagaaat 5820 

tagctgggtg tgatggtgcg tgcctgtaat cccagctact cgggagcctg aggcagggga 5880 
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atctcttgaa tccgggaggc ggaggttgca gtgagcggag attgtaccac tgcactccag 5940 

cctgggccac agagaaagac tctgtctcaa aaaaaaaaaa aaaaaaaaaa aatccagcct 6000 

gagtctcacc aggctataat caaggtgttg gcaaggctgt gttccttctg cagtctctag 6060 

gagagaatat aatttccttg ctttttccaa catctagaag ttacccacat tcaaaatcta 6120 

ttcttggctc catcttcaaa gccagccaca tagcatcttt ctgaccctgt ttctgtaatc 6180 

acatcccttt ctctcattct cacctcttct gcctctctct tccacatttt atttatttac 6240 

ttagagacgg agtctcgctc tgtcgcccag gctggagtgc agtggcgtgc tctcggctca 6300 

ctgcaacctc cgcctcctgg gttcaagtga ttcttctgtc tcagcctccc aagtagctgg 6360 

gactacagtc gcgtgccacc acgcccagct aatttttgta tttttagtag acagggtttc 6420 

accatgttgg ccaggatggt ctcgatttct tgacctcgtg atccaccccg ctcggcctcc 6480 

caaggtgctg ggattacaga tgtgagccac tgtgtggcct aatttcccta ctttaagttt 6540 

ggctgatgaa caacctcaat tccatctgca accttaattc cccttttgcc atgtaatcta 6600 

atgtggtaat aggttctggg gatcaggaca tggacacttt tgggcagtta tcattttacc 6660 

caacacagat gtgttagtgt tttgcactaa gtggcctgtg gctgtggctg tgtgcacagt 6720 

cagtcactct catcagcaca gaaacttgtc tcccctctcc catctcagac accccttccc 6780 

atcaatcttg tcctcactac ggtgcaatct ccattcccac cttccactcg ctcccagcac 6840 

tgctatttca ccaaaacagc tcttttatgt catttatttt tatttatttt ttctttttaa 6900 

aattttattt atttatttat ttattgagac aagagtcttg ctttgtcacc caggctggag 6960 

tgcagtggca tgatcttgcc tcatggcaac ctctgcctcc tgggttctag tgattctcct 7020 

gcctcagcct cccaagtagc tgggactaca ggcatgtgcc accaaccctg gctaattttt 7080 

gtatttttag tagaggcagg gtttcactat gttggccagg ctggtctcaa aactcctgac 7140 

ctcaggtgat ccgtccgcct tggcctccca aagtgttggg attgtaatct gaggtgggcg 7200 

gatcacttga agacaggagt tggagaccag cctggccaac atggtgaaac cttgtctcta 7260 

ctaaaaatac tacaaattag gtgggcgtga tggcactcat ctgtaagacc agctactcgg 7320 

caggctgagg caggagaatc gctggaacct gggaggcgga gtttgcagcc agctgagatc 7380 

gtgccactgc actccagctt gggcgacaga gtcagactca gtctcaaaaa aaaaaaaaaa 7440 

aaaaaaagtt tattgagcac ctactgtgta cattggggga cacagctcgt gcaaaacaaa 7500 

catccttttc ctcacatagg tcactttctt gttcctccac cttgctcagg tgaaagtgca 7560 

cctccattca ttataaaaat tgtgtctagg ccgggcacgg tggctcatgc ctgtaatccc 7620 

agcactttgg gaggctgagg cgggcggatc atgaggtcag gagatcgaga ccatcctggc 7680 

taacacggtg aaactccgtc tctactaaaa aatgcaaaaa attagccggg tgtggtggcg 7740 

ggcacctgta gtcccaggta ctcatgaggc tgaggctgga gaatggcgtg aacccaagag 7800 

gtggagcttg cagtgagctg agattgagcc actgcactac agcctgggca acaaagtaag 7860 

actccgtccc aaaaaaaaaa agttgtgttt tttggtaaca atctgacagt cctgcaaaac 7920 

gattattcct gccctataat gtcacactta gtttttccac aaaagttttt aataataaag 7980 

ttggaattat tgtaaaagtt tagtaaaatt tcaggtttat tcttgcatct ctgaaatcct 8040 

tttaaaaaag gctagtggtc atgtttgact tcagccaaaa ttcattttca cacccaacca 8100 

ttggcttgca cgccaaatat gtatttagag aaactgaacc tatgggatgc atgaatgtgc 8160 

acacatgtgt ggaagtgtgg gcccccagga aggtgcggac tcccgagggc tcactccgtc 8220 

gcctccccca gcagcaggca aaaggaagca gtgtccatgt tctctactcc acccccgctt 8280 

gttacctctg ggagctgaac aaggccaacc tcacctggta tttggggaaa ctggggagct 8340 

tgggggggtt ggcatgcccc gtgggtcatg accctgccct caatgcccct gccgctgtag 8400 

gtcagtgaaa catgacgact tcttccctta cgcggatggc ccccaccagt tctggaccgg 8460 

ttacttttcc agtcggccgg ccctcaaacg ctacgagcgc ctcagctaca acttcctgca 8520 

ggtgggtagg agccgggcta gagggggcat gcagccccga ggcccgacag gctgggcgcc 8580 

ccaacatacc cctctgcctc caggtgtgca accagctgga ggcgctggtg ggcctggcgg 8640 

ccaacgtggg accctatggc tccggagaca gtgcacccct cagtaagtgt cgggcccaag 8700 

aggggaagag gtttgcggct gaagttggaa accaccccta ggccgccccc ctcgagtttc 8760 

ttcttttttt tttttttttt tttttttttt gagacggagt ctcgatttgt ctcccaggct 8820 

ggagtgcagt ggtgcgatct cggctcactg caagctccac ctcccgtgtt cacgccattc 8880 

tcctgactca gcctcccgag tagctgggac tacaggcgcc caccaccacg cccggctaat 8940 

tttttgtatt tttagtagag acggggtttc accatgttag ccaggatggt ctcgatctga 9000 

cctcgtgatc cgcccgcctc ggtctctcaa agtgctggga ttacaggcgt gagccaccgc 9060 

ccccagccgt cctcgagttc cttcttaaag cctctaagaa tcctgcccgc cagcaccgga 9120 

cctttcgctt ccccttgggg tctcagctga gtcccacaga acctcaccgg actcattgtc 9180 

tatgagcaga tgaggcgatg gctgtgctcc agcatcacga cgccgtcagc ggcacctccc 9240 

gccagcacgt ggccaacgac tacgcgcgcc agcttgcggc aggctggggg ccttgcgagg 9300 

tgcgcggggc gagacttggg agacacgggg gtggagacag gaaggggcgg ggccagggcc 9360 

tgggaaaggg gacagagaca ggtgtgaggc gtagccgaga gccctgtggc ggggctacaa 9420 

gggctcgtgg gggcggggct tgtaggaggc ggggaaagat acaggaacgg ggcggggctt 9480 

tggagggggg aaggaggcgg ggcgtgggca agaggaggcg gagacagcta tggggtatag 9540 
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tcaagggcag 
agagaccgga 
ccaccggtcc 
aggcttcaaa 
ccagacggcg 
gagggggtag 
gaagctgggt 
ccacactagg 
ctgtgaccca 
cctggggcgg 
gaaggacccc 
cgctgggact 
catgaatacc 
cttaatatca 
ttttttttga 
gctcacttca 
ccattttaat 
tccagaaaac 
tcccctactt 
ttttttccac 
ccacacacac 
gtctccatgc 
ccgaggtggt 
cagcctcact 
agccccaggc 
tcgaaaatga 
tgtcatggtg 
acaactcccc 
tcctgggact 
gttggtggca 
taaaggtgac 
gagatccagg 
attccaggag 
accttgaggg 
tgtgtctact 
gtcccagcta 
cagtgagtaa 
aaagaataat 
ggaggctgag 
gtgaaaccca 
ctacttgggg 
cgagatcatg 
aaaataaaag 
tttttattaa 
gagttctggc 
gagggctttg 
tgtaggcagc 
aggccctggt 
cacacctgta 
ggagaccatc 
caggcgtggt 
cgtgaacccg 
ggcaacagag 
aggctggggt 
ggcaggtcac 
tgtaccaaaa 
cgggagggag 
atcatgccac 
aaaaaaaatc 
tagccccagg 
tttaatttag 



cagggtgggg 
acggggcggg 
ctttgcgctc 
gatcacttca 
gcgcgcgtga 
ttggaaaatg 
cagcagagag 
accagccagg 
tgccctctct 
aaggtgaatt 
aatggcagga 
cctcccccag 
tcccactcat 
gtctcccctt 
gacggagtct 
aactccgcct 
accccgcttc 
gtcctccccc 
cctccttcac 
ctggtgaatc 
ctccatttcc 
atgtatagct 
aatatttccc 
gcccgccctg 
ccgcgcacca 
ggtgagaccc 
agctgtgtgg 
tcattgtgtg 
gttatgagga 
ggtggtggtt 
atttaagata 
tggggagaga 
ccacaaggag 
ccttgaggac 
cctagctctt 
attgggaggc 
tgattgtccc 
aataaaatag 
gtgggtggat 
gtctctacaa 
gctgaggaag 
ccactgcact 
aagttctgca 
aataaaataa 
attgtggttc 
atgggactta 
cagtgtggaa 
aagaggtggt 
atcccagcac 
ctggctaaca 
ggcgagcatc 
gaaggcggag 
tgagactccg 
gggcacgttg 
ctgaggtcag 
atacaaaaat 
aggcaggaga 
tgcactccag 
aggttgaatg 
caacatagtg 
tagagatgga 



ctagaagggg 
gcctgaggag 
ttccgcaggt 
ccttttgcca 
gccgggacgg 
tttttggagg 
aacaacgcat 
ggtgtttctg 
gacccccgcc 
ggatggtacg 
cagtgcccag 
tgggcatttc 
gcatatctct 
gggaacatat 
cgctctgtca 
gctggattat* 
cccaggaaaa 
aaacctcttc 
tcccccgact 
ttctctgcaa 
tttccccatc 
ttccatgggt 
agctcagaca 
ggcttcagca 
cagcccatcc 
catttcaatc 
actctgggtc 
tccttgagca 
cccagtgtga 
cgcaattttg 
tgacctgagg 
ataacaggca 
ctcagtgtgg 
caacaggaag 
tgggaggctg 
ccagatggga 
actgtactcc 
gccaggcatc 
ctactgagct 
aaaattagct 
gaggattgct 
ccagcctggg 
gtaggactgt 
aagaagttaa 
acacatcaga 
gggtatcaca 
gtgcagggac 
gaggttgaaa 
tttgggaggc 
cggtgaaacc 
tgcagtccca 
cttgcagtga 
tctcaaaaaa 
gctcatgcct 
gagtttgaga 
taactgggcg 
atcacttgaa 
cctgggggac 
gctgggtgtg 
ggacccccat 
gtttcactat 



ttttggggcg 
aggggaggag 
tcttctgagc 
acagctaaac 
gaggggtgga 
actatacagg 
ctcgaggggg 
tgcaaagtgg 
tccccagttc 
gctgccggtc 
cgatgtgagc 
ctcatcgccc 
tcccccctct 
gcctcctgtg 
cccaggctgg 
aggcatgagc 
tttattgctg 
ctcccccgag 
ccatggtctc 
tcatttcctt 
tcaggactct 
actcttgact 
gccaggcgca 
cctattcagt 
ccagaagatc 
ccctttcctg 
agcaggtccc 
agtgaccttt 
tcatgggtga 
gcagggacgt 
gagggagccg 
taaaagcctt 
ctgaagcagg 
actttggctt 
aagcgggagg 
ggtttgtttg 
atcctggcaa 
gtggattata 
caggagtttg 

gggggtggtg 

tgagcccagg 
tgacaaagtg 
tcatgttagg 
aacgttccca 
tgcccaagtt 

ggtgtgctct 

ctggaagggg 
ataaggttgg 
cgaggcaggc 
ctgactctac 
gttactcggg 
cctgagatgg 
aaaaaaaaga 
gtaatcccag 
ccagcctggc 
tggtggtaca 
cccaggaggt 
agagcaagac 
gtggactgct 
ctctagaaaa 
gttggccagg 



actcttgagg 

tcaggcctgg 

aacgcgctgg 

atcagcatct 

tctagggcag 

agtgaaatta 

cttggcctta 

gtgggtttgg 

caggtcatcg 

agcgaaggcg 

ccaaacaacg 

ccatggatat 

tgtatcttct 

agtggtcctc 

agtgcagtgg 

caccgtgcct 

tttatggaca 

aacctgcctc 

tccaccctcc 

ggaatctgac 

tttccttcct 

cagtttccct 

ccctccggag 

agcccaggtg 

ctggtcccct 

ctcctgtgac 

taggcttatg 

gtctgagcct 

cacttgacag 

ttcagggaag 

tgtggctatt 

cgagcaggag 

gtgaacgggt 

ttgccagtca 

atcacttgag 

agctggggag 

cagagtgaaa 

cctgtaattc 

agaccagctt 

gtacacgtct 

tggcagaggt 

agaccctgtc 

aagaaaacat 

caggccccta 

ggccctggtc 

ggctgttgtg 

ttgactgcac 

gggggccggg 

agatcacgag 

aaaaatacaa 

aggctgaggc 

cgccactgca 

aaaaaaaaga 

gactttggga 

caacacagtg 

cacttgtagt 

ggaggttgca 

tctgtctcaa 

tgagcccagg 

caaatttaaa 

ctggtatcaa 



gaggcgggac 

cg.tcctgaac 

cgcggctcag 

gcccgctcag 

atgggcttta 

cgtgggctgc 

agtgccgtga 

agaaggcctg 

tttataatcc 

ttttcgttgt 

aatattcccc 

cgcctgcttc 

ccctctgccc 

cctttttttt 

cgcgatcttg 

ggctggtcct 

tctccttcat 

aagggtttct 

cggtgggttt 

tgtcccccca 

tgagtttttt 

cctcctttat 

ctgctgttct 

cctcgctgga 

gctttaacca 

aaatttgaag 

acctcctgtc 

cagtgtcctt 

acatgtcaga 

gtgttcccga 

tgaaggaaga 

agtgtgcaga 

gacaggtcag 

ttgttgctca 

cccaggagtt 

ttcgaggttg 

ccctgtctga 

tagcaccacg 

gggcaacatg 

gtaatcccag 

tgcagtgagc 

tcaataaaat 

ctttttcata 

aaagtcttgt 

cgcagcagag 

gggaacagac 

tggccctgga 

cgcggtggct 

gtcaggagat 

aaaatttagc 

aggagaatgg 

ttccagcctg 

aaaaaagaaa 

ggctgaggct 

aaacctcgtc 

cctagctact 

gtgagccgag 

aagcaaacaa 

agttggacac 

tttttttttt 

actcctcacc 
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tcaggttatc 
gctcggccct 
ctgtaatccc 
gaccagcatg 
tgtggtggca 
aacccgggag 
ggcgatagag 
cagtggctca 
gtcaggagtt 
aaaattagcc 
ggagaattgc 
ccagcctggg 
cacagtggct 
agttcagggg 
tgaaaattag 
cacgagaatc 
actctagcct 
attaaaaaaa 
ccagcacatc 
gaatcagcaa 
gcctgagggt 
tgccccaact 
gcctacatct 
cacctggtga 
ggagcccaag 
ttcagatcag 
agaattgata 
cattcacatc 
ccacttggcc 
gtagtaagag 
gtttgggtga 
cagggcctct 
aagagcctcg 
tttgcctgtt 
tttcttctaa 
ccaggacctc 
ggtccaagat 
gacaggggca 
actcagctac 
tgtgccaagc 
ctcaagagtg 
gagactgaag 
gattaggtgc 
tgcagtggct 
aggtcaggag 
tacaaaaatt 
gcaggagaat 
cactccatcc 
ttatgtgggc 
gggctgatca 
ctactaaaaa 
cgggaggctg 
atcgcgccac 
aaaaaaagaa 
tatggccagg 
cacctcaccc 
ttggtgttcc 
ggtggggccg 
gcagaggggt 
gggaaggagg 
gacagcaatg 



cacccaactt 
aaaaaaattt 
agcactttgg 
gccaatattg 
tgtgcctgta 
gcggaggttg 
cgagactcag 
cgcctgtaat 
ggagaccagc 
aggcatggtg 
tggaacccag 
cgacacagca 
tatgtctgta 
tttgagacca 
ccaggcgggg 
acttgaaccc 
gggctacaga 
gaaaacacac 
cgggcaacgt 
ctcctgctgc 

ggggtggtgg 

ccaggtacaa 
tcagacccaa 
aggtcaggga 
agggtggtgg 
tgggctaaga 
ttcatagctg 
tgctgtggac 
catgaatcag 
gagcttctat 
gtctcagggt 
cattgatgtt 
cgtttcatgt 
ctaacagagt 
caaaaagtct 
ctcttctatc 
agctgctgaa 
aggaacattg 
tgatagcctc 
tagaatccta 
atgtggtaag 
caggaggatc 
ctacatagtg 
cacgcctgta 
ttcaagacaa 
agccaggggt 
tgcttgaacc 
tgggcaacaa 
caggcgcggt 
cgaggtcagg 
tacaaaaaat 
aggcaggaga 
tgcactcccg 
ttatgtggcg 
gttgggggac 
gtgtgcccgc 
caggtggttc 
atacctgtgg 
ttatccaagg 
tcatcagccg 
gccgggagat 



ggtctcccaa 
taaattgaga 
gaggctgagg 
tgaaaccctg 
gtcccagcta 
cagtgagcca 
tctcaaaaaa 
cccagccctt 
ctgaccaaca 
gcatgtgcct 
gggcagaggc 
agtttctgtc 
atcctagtac 
gcctggccaa 
tggtgggcac 
gggaggtggg 
gtgagactca 
acacatacac 
ttgatcctga 
ctgttcgcca 
tgctcggcat 
cgccagtata 
ccaacagaaa 
ctaggaatga 
gccatgtgta 
cccacagatc 
gtcgtatgag 
gtttacatac 
gagagccaaa 
ctggagtgaa 
gaatgcctat 
tgactttcat 
gagtcccttt 
tacccaaatt 
ggctgttagt 
ttgttgctct 
gctccagccg 
ttgacaccac 
agggaagact 
ttctatgggg 
ccagatgcag 
gctgagcata 
atgccctgtc 
atcccagcac 
gtctggccaa 
ggtggcaggt 
caggaggcgg 
gagcaaaact 
ggctcacgcc 
agatcgagac 
tagccgggtg 
atggtgtgaa 
cctgggcaaa 
gtgactgaag 
atttgtagga 
agacaccctt 
gcctgtaccc 
ggtgagtggc 
ctcacaacct 
ttttgacaca 
cctggagagg 



agttctggga 
aaaaaagggg 

ggggggtgga 

tctctaataa 
ttcaggaggc 
agatcacgcc 
aaaaaaaaaa 
tgggaggcca 
tggcgaaacc 
gtaatcccag 
tgtagtgagc 
tcaaaaaaaa 
tttgggaggc 
catggtgaaa 
ctgtaatccc 
gggtgcagtg 
gtctcaaaaa 
acacaaaccc 
cacagggctg 
gaccttcttc 
ggagctaggg 
ggtgacaacg 
ccgctgcctg 
tgagtgggca 
gagtgagggt 
agcgggggaa 
agtctcccca 
ttagttttgg 
tatcactgct 
tatttgagag 
gggtgcatct 
agatgacaaa 
gagatgaatg 
aactgtgact 
ccagagaaag 
gccttcccca 
ccacatttgt 
ttggatcact 
ggacaatata 
caatggtgaa 
tggctcatgc 
ggagttcaag 
tcaaaaaaaa 
tttgcgaggc 
catggataga 
acctgcaatc 
aggttgcagt 
ctgtctcaaa 
tgtaatccca 
catccttgct 
tggtggcggg 
cccgggaggc 
agagcgagac 
gttgaccttg 
ctcacctctg 
ggtgcaggag 
aggacagcgg 
acaggctggg 
tgccatccca 
ccgctggaga 
aggtgggggg 



ttacaggtgt 

ccaggcgtgt 

tcatgaggtc 

aaatacaaaa 

tgagtcagga 

actttctgca 

caaaaaaccc 

aggcgggtgg 

ccgtctctac 

ctacccagga 

agagattgcg 

aaaataaaaa 

caaggtgagc 

ccctgtctct 

agctacttgg 

agccaagatt 

aaaaaaaaga 

atctgtggac 

ttgatggaga 

tggtaaggga 

ccccttacct 

aaagtgacca 

tgagccgctg 

gttgggaatg 

aggggaggat 

atatttgagg 

taggacacat 

aaattggggg 

tgaagaattc 

tccacccttg 

ggctaggaac 

actcacattt 

tctgcatatc 

agaataagat 

tggggtggtt 

atattcagtt 

gtttgagtca 

ttctatgggg 

gtctttatcc 

caggaggcgt 

ctgtaatcac 

accaggtggg 

aaaaaaaaaa 

agaggcaggc 

aaccccgtct 

cctctactcg 

gaggcaagat 

aaaaaaaaaa 

gcactttggg 

aacacggtga 

tgcctgtagt 

ggagcttgca 

tccgtctcaa 

atgtaggcct 

ctcaggtaca 

gtgcaccaga 

cacctggagc 

agaggggtgt 

ttgggtacag 

caaagggacg 

tgactgagag 



gagccaccac 
tggctcatgc 
aggagttcga 
attagccagg 
gaattgcttg 
ctccagcctg 
aggccaggtg 
attacctgag 
taaaaataca 
ggctgaggca 
cccctgcact 
tccggctggg 
agatcacttg 
actaaaaata 
atagctgagg 
gcaccactgt 
aaaaagaaaa 
ccttttctgc 
ttatgaacat 
agatcaccag 
gactctcacc 
ggcctcaggt 
ggctcagatc 
gggaagttat 
ttacttttcc 
gtccttccat 
catacaaagc 
tggggcattc 
accaaagtct 
gattgattgg 
agtggtttcc 
gtccacattc 
tgtatatagt 
agaagattat 
ccataatcac 
tccacctcat 
gtagcagagg 
caaactggtc 
caggtagcca 
ttatgccacg 
agcactttgg 
agcatttgct 
agaggccggg 
agatcatctg 
ctactaaaaa 
ggaggctgag 
tgtgccactg 
gaaaaaagaa 
aggccgaggc 
aaccccatct 
cccagctact 
gtgagccgag 
aaaaaaaaaa 
taatggggtt 
gactgacatc 
acttctcagc 
tagagtggtc 
ggaagcaagg 
cgacacctgg 
cttctacaca 
cactgagggg 
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16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
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gtggtctgtg 

tgggagacag 

cacgcctgta 

cgagactatc 

ccgggtgtgg 

gcgtgaagcc 

gggtgacaga 

agatcccaag 

actgaaccag 

cacggtagct 

tctgcagcat 

actcccagga 

gcctgagaga 

gtcctcccaa 

ttcaggcccc 

ttgctcttgt 

ctcctgggtt 

caccatcacg 

caggctggtc 

gggattacag 

cctctccgtt 

aggcccccct 

gcctctgctg 

tttaatttct 

aaacccacga 

aaggacgatg 

cgagggcgcc 

ctggcggagc 

tacaatctcg 

cggattgaag 

tccatttggc 

gcccagtgcc 

gcccctaaac 

ccgcccccag 

caaagtctca 

acccatctcc 

gccttgcccc 

tcggccccgc 

ccgcagttct 

agctggggcc 

tccggacgta 

agaaggagat 

ctcatctggg 

tgcaggagac 

caacaaacac 

cggcaggtgg 

gggtatgggg 

aaaccaggaa 

aatccccatc 

tacgaggaca 

gccggggaag 

agctgtgtct 

cctcactcct 

gctggacccg 

tcaatggaag 

tgctccgggg 

ctactaagac 



gtgtgttggg 

aggacgaagg 

atcccagcac 

ctggctaaca 

tggcgggcac 

gggaggcgga 

gcgagactcc 

cctgatcagc 

acggagcccg 

ctcccccatc 

ctcaaagctg 

tggaaacatg 

tggctcgctg 

cctggacccc 

actaagctga 

tgtccaagct 

caagtgattg 

cccggctaat 

tcgaactcct 

gcgtgaacca 

tttctccatc 

gttaggcatt 

gggttgccct 

tttgacattc 

ctctcacccc 

gacgcggagt 

acctggtgct 

aggaggtcct 

gggctcctcc 

tctaccaggg 

cataattagc 

agcccagtcc 

caccaactca 

gctctgtcct 

ggtcacatca 

ccattggctc 

tctccggcga 

ccttctccat 

cagggctgcg 

ccgaaatggt 

acctgagcgc 

cggagagagg 

cccatcccct 

cacgctggtg 

aggtggggcc 

gggcaatgtg 

cagagtccag 

tgggtcttga 

catgtctagt 

gtggccagat 

caggttaaga 

agacccaggg 

ccttccccct 

gccaacatca 

gaggtggatg 

gcagaccaga 

tcaggtcgct 



gcccaggggt 

gagagtgaag 

tttgggaagc 

tggtgaaacc 

ctgtagtccc 

gcttgcagtg 

atctcaaaaa 

tcacccccaa 

tggcaggaaa 

ctgcacctcc 

cctggggttg 

cagctgactg 

gagctcatgg 

tgctggacct 

ggactccgtt 

ggagtgcaat 

tcctgcctca 

tttggtattt 

tacctcaggt 

ccgctcccgg 

tcctgcccag 

tacctccttc 

accagaccct 

ccacacgccc 

ctcctggcca 

atcggagcca 

gctggacaca 

ggcccctcag 

gcgcacgcag 

agccgggttt 

cccaaaccct 

caacccccct 

cctccggccc 

gcctcaccca 

gttctgctcc 

agtcctccct 

aaccccgccc 

ccaggccccg 

cagggacctg 

gctgctgcgc 

ccccgttacc 

caagagagag 

tccccgcagg 

gccaaccagc 

ctggtcaggg 

tgaggcatgg 

acccaggtta 

ggtttgtggg 

tctgggaggg 

tgagggtatg 

gtctgggatt 

gaggaagcat 

gcacctctcc 

cgctggaacc 

gttaggtctg 

ctctgactct 

ctgtg 



ggtgagggaa 

ggcgggggag 

cgaggcgggc 

ccgtctctac 

age tact egg 

agecgagate 

aaaaaaaaaa 

ccccaggcgg 

ctactatcca 

ccacctcgat 

gggttgactg 

tgctgactga 

tgagtgggtc 

tgaaggctgt 

tctttctttt 

agtactatct 

gccttccgag 

ttagtagaga 

gatccactca 

ccaaggactc 

gctccctcca 

cggcctggtt 

caccttcatc 

ttacccagtt 

ctctcccccc 

ctaatggaga 

gcccaggctg 

gtggtgctgg 

gtgaggggca 

gggtgcggag 

cgggccaacc 

ggactgtgta 

cgcccatttc 

gccccgcggc 

ategggaetc 

gtcacggccc 

ctacaaccta 

cccctctcca 

ccgccctcgg 

ttggagcacc 

ttgaacttga 

ggagagaaga 

acctgttctc 

tccgcgaggc 

gtagggaagg 

gatgttggcc 

ggggcttaaa 

tgctgtttag 
tgttcatcac 
aggacattgg 
gaggecctgg 
cgtggggtga 
aggccccaca 
catggaaatc 
ctgggatggg 
cctcttgggg 



atttgetgat 

ccaggcaggg 

agatcacaag 

taaaaataca 

gaggctgagg 

gcgccactgc 

aaaaaaagag 

gattatcgac 

gtcaacaccc 

agaaagggaa 

ccctctactt 

ccgctcccag 

agagccccat 

ttctggccca 

cttctctttg 

cagctcactg 

tagctgggct 

tggggtttca 

cctcggcctc 

catttctgtg 

cgctctggcg 

cccatgccta 

ctcatccgtt 

tctggcccaa 

aggtgcaccg 

aegggteggg 

cagccgccgg 

ccccgggtgg 

geggggtagg 

tttcctaggc 

ccatcccctg 

gaggcacagc 

actttagccc 

ttcagccccg 

gaaccctaag 

tggccgctcc 

gccctgcctc 

actcagcccc 

tgcacctgct 

agtttgccgt 

gggtgagaag 

gaaacctggc 

caccttcacc 

agcctccagg 

ggtggagtct 

aggacccaaa 

ggatctgaaa 

tcccaagaac 

ccctggggtc 

acctgggggt 

agacagattc 

gtcagaggaa 

ccccaccaaa 

cgcactttcc 

ccctccaagc 

ctgctgccat 



tacatgagtg 
tgcagtggct 
gtcaggagat 
aaaaaatcag 
caggagaatg 
actccagcct 
ttggggagcc 
ccacctggaa 
ggatttacat 
tcacccctta 
tcacccttca 

gggggcagca 

ccgagccagg 
gttctctget 
agaeggagtt 
caacctctgc 
tacaggcaca 
ccatgttggc 
ecaaagtget 
tgtggctttt 
tgccatatcc 
cttggatcct 
cctcccaccc 
ggacacccac 
aaggctgctg 
ggcgtgggtg 
acaccggctc 
cggcgccgcc 
cagagaggac 
teagegggga 
ggcagtcccc 
cctagctcat 
cgccctctca 
ccgtaactcc 
cttgggaccg 
teteggctga 
ctgacctggc 
gccctctctc 
cacgctggcc 
aggagaggat 
ggcaaaattg 
tttgccccaa 
atcacccgcc 
ctcaagtgga 
tacctggggc 
aaggtcatga 
tgggtgcggc 
aagatttgaa 
agattgaggg 
ggtattcggg 
ggggatatgc 
gtctgcacct 
ctccgtacca 
tggcctcagt 
ccaagcctcc 
taaaaegcta 



16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20245 
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<213> Homo sapiens 
<400> 593 

gcctttccag ggccggggaa ccccaggagg aagctgctga gccatgggcg cctacgcgcg 60 

ggcttcgggg- gtctgcgctc gcggctgcct ggactcagca ggcccctgga ccatgtcccg 120 

cgccctgcgg ccaccgctcc cgcctctctg ctttttcctt ttgttgctgg cggctgccgg 180 

tgctcgggcc gggggatacg aggtgagtgg ggcctccgag ctgaaacgta caggaggcag 240 

agtgaaaccc agaatacagt ctagaggtgt gggtgggtct gtcctgtggg tgtctagtga 300 

atggctgatg atatgacagt gtggtctgag tgcgtgcttt gtgtcattgc gaggtctggc 360 

tgtgcgcatc tgagtatagg actgtgtctg attgtacctg cctccgtgtc tccgggatgc 420 

ttgcctagac tttgtctgca cttaactgtg ggattggagg ggcaggaggt ggcagggggt 480 

gagcagtgta tgtgtggggg gaggtgctgg ctgagagctg ggactctgga gtctgcctga 540 

aattccagcc tagctcttac acttcctgag tgtgtgacgt tgggcaagtc acctatcctc 600 

tctaagcctc agtgccttca tctggaaact ggggataaca tcaccccacc tcccacggtg 660 

gccgtctgct gtcggcaaat gctgaacaaa catcagctac ttctattatt attttcccgg 720 

agtgtgaatg agagctgccc tgtggggttg tgcaaagtgg agttgtatct atgagcacaa 780 

ctatgtatgt gtgtgtcctg cctgggaggg cctggcctcc tctgcacata ggcgagatca 840 

ggcctctctg agtcactcac ctctagatca agacttactc tgcagccccc gacctcgggg 900 

gttattgaca aggtatgtgt gtttggggtc cctgtgcaga catgccccac agtgcagccg 960 

aacatgctga acgtgcacct gctgcctcac acacatgatg acgtgggctg gctcaaaacc 1020 

gtggaccagt acttttatgg aagtgagtag aggatgggga ctggtccctg ggatccccat 1080 

ggtccctgta atccctctgg gtcctggaca ttagggtggg gccagtgcta ccctaatatc 1140 

cagggtttgg gctcctctgt ctaggaataa cccccttggc tctgctgttc cctgagagcc 1200 

ttatccctgt tatccacagt caagaatgac atccagcacg ccggtgtgca gtacatcctg 1260 

gactcggtca tctctgcctt gctggcagat cccacccgtc gcttcattta cgtggagatt 1320 

gccttcttct cccgttggtg gcaccagcag acaaatgcca cacaggaagt cgtgcgagac 1380 

cttgtgcgcc agggtgagcc taccccaagg aagtgaaaag aggaagccca gcccagcttc 1440 

tgcttctgca tctctggttt ctgagatttg tcatgccacg tgcaagctgt ataacatgcg 1500 

tgtcgctccg cctgcctgga ctctccattt ggagacctcc tatacatccc acaaagcccc 1560 

acctgctgtg catcctctgg gaagcctgcc atgcaggggg cctctttccc atatctggga 1620 

actatggcct gggagcgacc cctcttgtcc ttccgccaga aatgtgtaca caggcaggct 1680 

ttcacattcc cagtatcacc caccctactc ccactcctgg ctctgaccgc tgaccctgac 1740 

cttgcctgtc ctggcacagg gcgcctggag ttcgccaatg gtggctgggt gatgaacgat 1800 

gaggcagcca cccactacgg tgccatcgtg gaccagatga cacttgggct gcgctttctg 1860 

gaggacacat ttggcaatga tgggcgaccc cgtgtggcct ggcacattga ccccttcggc 1920 

cactctcggg agcaggcctc gctgtttgcg caggtgcgac ccgggacctc tcttgggccc 1980 

acttcttcac tcactctggc tcctccctcg cccagtcaaa ccccgccctc tccctgcaat 2040 

ctcacaagga ccaggcccag gcctaggcct gttgaagccc tgccccttga gtgagccgta 2100 

aagccagtgg cttttgagct ctggcctcag ccggctatgc ccagcccagg ctgacccagc 2160 

tccggctggc tccgccctct cccctaatag gcccctcttg gtgttctggc cccacccact 2220 

agctcgggtc ctggctcctc cctaaaccgg gtggcaagtg gatgcctagg ctgccttaaa 2280 

aacaggttca ttacctgtgc tcagacccca tccatcctca ggctgtggaa gggggaacct 2340 

cattcctgga gtcaggcctg ctctgcgctt tgacagtgct ggggaggtga acctgggttc 2400 

tgatgttgga cccgccctct ccctgctagc ccaaggtggt gagttctgaa acttccccaa 2460 

gcttggaaat aagctggagg cctctctgtt tcagccctac gtgtttttgt ttttgttttt 2520 

tgagacaggg tcttgctgtc atccaggctg gagtgcagtg gtgcaatcct aactcactat 2580 

agcctcaatc tcccggattc aagcgattct cctgtctcag cccccctagt agatgagact 2640 

acaagagcgc accactacgc ctggctaatt tttaaatttt ttgtagacag tctgcccgtg 2700 

ttgtgcaggc tggtctcaaa aactcctggg tttaagtgat cctcctgttt tggcctccca 2760 

gaaggctggg atcataggca agagccacca catcgaccta gcactgcttt ttaacctgtg 2820 

ctctgacctg cccctcccaa gcagagggga gtttggtggt gagagggctg ggcactaatt 2880 

cacactgcct tttcctccct catccccaga tgggcttcga cggcttcttc tttgggcgcc 2940 

ttgattatca agataagtgg gtacggatgc agaagctgga gatggagcag gtgtggcggg 3000 

ccagcaccag cctgaagccc ccgaccgcgg acctcttcac tggtaggggg cttggtgagg 3060 

gcagggccag ccatggtgcc acacactcag aagggccctg ggcttgatat ctgctctgtt 3120 

gtcactgtcc tggaattcct atagtctggg aacaaaggcc ctgcatttcc ttttgcattg 3180 

ggacacaaat tctgaagccc atcctgggtg ggacatggcc ggctttgaaa ccagggaagg 3240 

tctgggtgat gggccacccc ttgaacttgg tgtgacctgc aggtgtgctt cccaatggtt 3300 

acaacccgcc aaggaatctg tgctgggatg tgctgtgtgt cgatcagccg ctggtggagg 3360 

accctcgcag ccccgagtac aacgccaagg agctggtcga ttacttccta aatgtggcca 3420 

ctgcccaggt aaccctggtg tccagaacct tcgagtccgg tatatacaat acaatgagct 3480 
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ctttccatgg tacaggcatc ccctaacacg ttctccttat ttttaatttt tttgagacag 3540 

tcttactctg tcacccaggc tggagtgcag tggtgcgatc tcggctcact gcaacctctg 3600 

cctcctgggt tcaaacaagc acatccagct aatttttgta tttttgcact ggggtctcat 3660 

catgttgtcc aggctggtct caaactcctg agctcaagtg atctgcttgc ctcggcctcc 3720 

caaagtgccg ggattacagg catgagccac cgcacctggc ctctattttt aacattttta 3780 

tttatttatt attatttttt tttttttgag acagagtctt gctctgtcac ccaggctgga 3840 

gtgcagtggc atgatctcag ctcactgcaa cctccgcctc ccgggttcaa gcgattctcc 3900 

tgcctcagcc tcctaagaag ctgggattag aggcacctgc caccacaccc agctaatttt 3960 

tgtattttta gtagacaggg tttcgtcatg ttgaccaggc tggtcttgac ctcaggtgat 4020 

ctgcccgcct caacctccca aagtgctggg attacaggtg tgggccactg tgcctagtct 4080 

atttttaaca tttttattga gaattccttt tttatttttt ttagactcac tctgtcgccc 4140 

aggctggagt gcagtggcac gatctcggct cactgcaacc tccacctact gggtccaagc 4200 

gattcttctg cctcagcctc ccgagtagct gggattacag gtgcccacca ccatgctcgg 4260 

ctaagtttta tgtcttttta gtagaaaggg ggtttcacca tattggccag gctggtctcg 4320 

aactcctaac cttttgaccc acagccttgg cctctcaaag tgctgggatt acaggcgtga 4380 

gccaccgcgt ccagccttta acatttttat aattaaaaaa cattattttt tcacagagat 4440 

aaggtctcac catgtggccc aggctggtct caaactcctg aactcaagtg atcctcctgc 4500 

cttggcctcc .caaagtgcta ggatataggt gtgagccacc atgcctggca taacacgttc 4560 

tccttaaaaa aatttttttt tcctttcttt aaaaattgat ggctgggcat ggtggctcac 4620 

gcctataatc ccagcatttt gagaggccga gatgggcaga tcatctgaag tcaggagttc 4680 

aagaccagca tagccaaaat gacgaaaccc tgtctctact aaaaatacaa aaattagtcg 4740 

ggtgtggtgg cgcacgcctg taatcccagc tacttgggag gctaaggcaa gagagtcgct 4800 

tgaacctggc aggtggaggt tgcagtgagc cgagatcacg tcacttcact ctagcttggg 4860 

cagcagagtg aaactctgtc tcaaaaaaaa aattgtttta gattaatttt tttttttttt 4920 

cttgagacag ggtcttgctg tgtggcccag gttggtctta aactcctaga ctcaagcgat 4980 

cctcctgcct cagtctcctg agtagctggg attacaggtg tgagcccctg taatcatgtt 5040 

ctcatgcccc ctggaggaag atgctattct attcaccatc acaatgtccc cctcctggat 5100 

ttatgtgcat tcctcattag aagagatagc ataggctggg caagagatag catggcatgg 5160 

tgactggatg ccctcttcta ggtagtgggt ccaagagaac tgctcaacaa ctggtggcta 5220 

cttttatctg tgtccccacg ctcccaagtg ccataccccc acccatgcct gtgcacccaa 5280 

catccttgac cccatataca tcaaaacaca gctatacaca gggatggccc aggatcctct 5340 

ggcttcagga ctcccctctt gcctgcaggg ccggtattac cgcaccaacc acactgtgat 5400 

gaccatgggc tcggacttcc aatatgagaa tgccaacatg tggttcaaga accttgacaa 5460 

gctcatccgg ctggtaaatg cgcaggtcag tgcgcctacc ctgtggtacc cttgtgcaca 5520 

tgtgcgcttg catccggggg ccttgggtta tgtgcatagc tctcagtgct gtctttgttt 5580 

tctattgttc tattgtggtc attctataac aaatgaccac acacttagca gctcaaaaca 5640 

acagaaatac attgtcttac agttctgtag gtaagaagtc cagcatgagg ccgggcgcaa 5700 

tggctcatgc ctgtaatccc agcatgttgg gaggctgagc cgggcagatc acgaggtcag 5760 

gaattcgaga ccagcctgac caacatggtg aaacgctgtc tctactaaaa atacagaaat 5820 

tagctgggtg tgatggtgcg tgcctgtaat cccagctact cgggagcctg aggcagggga 5880 

atctcttgaa tccgggaggc ggaggttgca gtgagcggag attgtaccac tgcactccag 5940 

cctgggccac agagaaagac tctgtctcaa aaaaaaaaaa aaaaaaaaaa aatccagcct 6000 

gagtctcacc aggctataat caaggtgttg gcaaggctgt gttccttctg cagtctctag 6060 

gagagaatat aatttccttg ctttttccaa catctagaag ttacccacat tcaaaatcta 6120 

ttcttggctc catcttcaaa gccagccaca tagcatcttt ctgaccctgt ttctgtaatc 6180 

acatcccttt ctctcattct cacctcttct gcctctctct tccacatttt atttatttac 6240 

ttagagacgg agtctcgctc tgtcgcccag gctggagtgc agtggcgtgc tctcggctca 6300 

ctgcaacctc cgcctcctgg gttcaagtga ttcttctgtc tcagcctccc aagtagctgg 6360 

gactacagtc gcgtgccacc acgcccagct aatttttgta tttttagtag acagggtttc 6420 

accatgttgg ccaggatggt ctcgatttct tgacctcgtg atccaccccg ctcggcctcc 6480 

caaggtgctg ggattacaga tgtgagccac tgtgtggcct aatttcccta ctttaagttt 6540 

ggctgatgaa caacctcaat tccatctgca accttaattc cccttttgcc atgtaatcta 6600 

atgtggtaat aggttctggg gatcaggaca tggacacttt tgggcagtta tcattttacc 6660 

caacacagat gtgttagtgt tttgcactaa gtggcctgtg gctgtggctg tgtgcacagt 6720 

cagtcactct catcagcaca gaaacttgtc tcccctctcc catctcagac accccttccc 6780 

atcaatcttg tcctcactac ggtgcaatct ccattcccac cttccactcg ctcccagcac 6840 

tgctatttca ccaaaacagc tcttttatgt catttatttt tatttatttt ttctttttaa 6900 

aattttattt atttatttat ttattgagac aagagtcttg ctttgtcacc caggctggag 6960 

tgcagtggca tgatcttgcc tcatggcaac ctctgcctcc tgggttctag tgattctcct 7020 

gcctcagcct cccaagtagc tgggactaca ggcatgtgcc accaaccctg gctaattttt 7080 

gtatttttag tagaggcagg gtttcactat gttggccagg ctggtctcaa aactcctgac 7140 
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ctcaggtgat ccgtccgcct tggcctccca aagtgttggg attgtaatct gaggtgggcg 7200 

gatcacttga agacaggagt tggagaccag cctggccaac atggtgaaac cttgtctcta 7260 

ctaaaaatac tacaaattag gtgggcgtga tggcactcat ctgtaagacc agctactcgg 7320 

caggctgagg caggagaatc gctggaacct gggaggcgga gi:ttgcagcc agctgagatc 7380 

gtgccactgc actccagctt gggcgacaga gtcagactca gtctcaaaaa aaaaaaaaaa 7440 

aaaaaaagtt tattgagcac ctactgtgta cattggggga cacagctcgt gcaaaacaaa 7500 

catccttttc ctcacatagg tcactttctt gttcctccac cttgctcagg tgaaagtgca 7560 

cctccattca ttataaaaat tgtgtctagg ccgggcacgg tggctcatgc ctgtaatccc 7620 

agcactttgg gaggctgagg cgggcggatc atgaggtcag gagatcgaga ccatcctggc 7680 

taacacggtg aaactccgtc tctactaaaa aatgcaaaaa attagccggg tgtggtggcg 7740 

ggcacctgta gtcccaggta ctcatgaggc tgaggctgga gaatggcgtg aacccaagag 7800 

gtggagcttg cagtgagctg agattgagcc actgcactac agcctgggca acaaagtaag 7860 

actccgtccc aaaaaaaaaa agttgtgttt tttggtaaca atctgacagt cctgcaaaac 7920 

gattattcct gccctataat gtcacactta gtttttccac aaaagttttt aataataaag 7980 

ttggaattat tgtaaaagtt tagtaaaatt tcaggtttat tcttgcatct ctgaaatcct 8040 

tttaaaaaag gctagtggtc atgtttgact tcagccaaaa ttcattttca cacccaacca 8100 

ttggcttgca cgccaaatat gtatttagag aaactgaacc tatgggatgc atgaatgtgc 8160 

acacatgtgt ggaagtgtgg gcccccagga aggtgcggac tcccgagggc tcactccgtc 8220 

gcctccccca gcagcaggca aaaggaagca gtgtccatgt tctctactcc acccccgctt 8280 

gttacctctg ggagctgaac aaggccaacc tcacctggta tttggggaaa ctggggagct 8340 

tgggggggtt ggcatgcccc gtgggtcatg accctgccct caatgcccct gccgctgtag 8400 

gtcagtgaaa catgacgact tcttccctta cgcggatggc ccccaccagt tctggaccgg 8460 

ttacttttcc agtcggccgg ccctcaaacg ctacgagcgc ctcagctaca acttcctgca 8520 

ggtgggtagg agccgggcta gagggggcat gcagccccga ggcccgacag gctgggcgcc 8580 

ccaacatacc cctctgcctc caggtgtgca accagctgga ggcgctggtg ggcctggcgg 8640 

ccaacgtggg accctatggc tccggagaca gtgcacccct cagtaagtgt cgggcccaag 8700 

aggggaagag gtttgcggct gaagttggaa accaccccta ggccgccccc ctcgagtttc 8760 

ttcttttttt tttttttttt tttttttttt gagacggagt ctcgatttgt ctcccaggct 8820 

ggagtgcagt ggtgcgatct cggctcactg caagctccac ctcccgtgtt cacgccattc 8880 

tcctgactca gcctcccgag tagctgggac tacaggcgcc caccaccacg cccggctaat 8940 

tttttgtatt tttagtagag acggggtttc accatgttag ccaggatggt ctcgatctga 9000 

cctcgtgatc cgcccgcctc ggtctctcaa agtgctggga ttacaggcgt gagccaccgc 9060 

ccccagccgt cctcgagttc cttcttaaag cctctaagaa tcctgcccgc cagcaccgga 9120 

cctttcgctt ccccttgggg tctcagctga gtcccacaga acctcaccgg actcattgtc 9180 

tatgagcaga tgaggcgatg gctgtgctcc agcatcacga cgccgtcagc ggcacctccc 9240 

gccagcacgt ggccaacgac tacgcgcgcc agcttgcggc aggctggggg ccttgcgagg 9300 

tgcgcggggc gagacttggg agacacgggg gtggagacag gaaggggcgg ggccagggcc 9360 

tgggaaaggg gacagagaca ggtgtgaggc gtagccgaga gccctgtggc ggggctacaa 9420 

gggctcgtgg gggcggggct tgtaggaggc ggggaaagat ■ acaggaacgg ggcggggctt 9480 

tggagggggg aaggaggcgg ggcgtgggca agaggaggcg gagacagcta tggggtatag 9540 

tcaagggcag cagggtgggg ctagaagggg ttttggggcg actcttgagg gaggcgggac 9600 

agagaccgga acggggcggg gcctgaggag aggggaggag tcaggcctgg cgtcctgaac 9660 

ccaccggtcc ctttgcgctc ttccgcaggt tcttctgagc aacgcgctgg cgcggctcag 9720 

aggcttcaaa gatcacttca ccttttgcca acagctaaac atcagcatct gcccgctcag 9780 

ccagacggcg gcgcgc 9796 

<210> 594 

<211> 3844 

<212> DNA 

<213> Homo sapiens 



<400> 594 



tcatgcctgt aatcccagca ttttgggagg ccgaggcggg cagatcacaa ggtcaggaga 
tcaagaccat cctggctaac acggtgaaac cctgtctcta tcttagtaat actttattaa 
ttagtctggg cgtgtggcgg gcgcctgtag tcccagctac ttgggaggct gaggcaggag 
actggcgtga acccgggagg tggagcttgt agtgagccga aatcacacca ctacactcca 
gcctgggcga cagagcaaga ctccgtctca aaaaaaaaaa aaaaaaaaaa aaagctctct 
gattttagct gttaggtggg agatgggttg gaggatacca aagccagctt gcaggctatg 
gggataaaga aaacacctgg atttcggacc tactttatag gtagaggttg gcagacatgc 
tgagaaggat ggatgtgggg tgtgagaaaa ggggagacag caagctgcgc tcctcatttt 
taatttttat ttatctattt ttgagacgga gtttcgctct gttgcccagg ctggagtgca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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gtggcaccat 

cagcctcctg 

tttttagtag 

ggtgatctgc 

cagccccaat 

atgcagaaag 

gaggatgtca 

cttttgctgt 

attttctcag 

ctccaactaa 

attgcgctgg 

cagatcacct 

tactaaaaat 

ggaggctgag 

cgcgccattg 

aaacaaaaaa 

ttgtgggctc 

gcctagcacg 

cctgttggcc 

tcacattctt 

cagtgggagt 

gaaggaagaa 

ccgggctcac 

tgcctggtga 

cagtggctca 

caggagatcg 

aaattagccg 

gagaatggca 

ccagcctggg 

ctcataaaaa 

atgacctgag 

cctggcgaca 

cctgtaatcc 

actaccctgg 

cgtggtggcc 

aacctgggag 

acagcgcaag 

ccatcctccc 

tgggtctcca 

gtaatctcag 

ccaacatggt 

tgtctgtaat 

agaggttgca 

aactcttgtc 

tcatgggaag 

gtaatcccag 

ccagcctgac 

ggcggtgcat 

ccggaatgtg 

agagcgaaac 

gcgccgggcg 

atcacgaggt 

attagccggg 

gaacagcgtg 

agcctgggtg 

aggg 



ctcagctcac 

agtagctggg 

agacggggtt 

cctccttggc 

ccctattttt 

atcagggata 

aggaggatgt 

tctgtcttcc 

ccttctggct 

aatcccgaac 

gcgcggtggc 

gaggtcagga 

acaagaatta 

gcaggagaat 

cactccagcc 

ggcctgatag 

cagtagggca 

gagcccacca 

gtatgcactg 

gcaggaggga 

tcagggaagg 

cagagcgtta 

tatcccatta 

ccgaaatctt 

cgcctgtaat 

agaccatcct 

ggtgtggtgg 

tgaacctggg 

cgacagagcg 

aattagctgg 

tccgggagtc 

gagtgagacc 

cagcactttg 

ctaacacggt 

ggcgcctgta 

gcggaggttg 

actctagctc 

ctcccgaggg 

gcaggtggca 

cactttggaa 

gaaatgccat 

cccagcaact 

atgagtcaag 

tcaaaaataa 

tcgagaacac 

cactttggga 

caacagggag 

gccagtagtc 

gaggttgtgg 

cctgtatcaa 

cggtgggtca 

caggagatcg 

cgtggtggtg 

aacccggaaa 

acagagcgag 



tgcaacctcc 
attatgggtg 
tcgccatgtt 
atcccaaagt 
cagtctagtg 
ggtggagagg 
ctgtctcctc 
catgtgcaga 
ttggctcaaa 
tcctccacct 
tcacacctgt 
gttcgagact 
gccgggcgtg 
tgcttgaaac 
tgggcaacaa 
cactcaccac 
gggtccacat 
tagagaatag 
ttaggagggt 
aagagacgtc 
cttcccagag 
tgcagcatgt 
gacaaaagtg 
gtccatgggt 
cctggcactt 
ggctaacacg 
cgggcacctg 
aggcagagct 
agactctgtc 
gcacgatggt 
agaggttgca 
ctatcaaaaa 
ggaggctgag 
gaaaccccgt 
gtcccagtta 
cagtgagccg 
aaacaaacaa 
gacagaacag 
ttaaaatagg 
agcccaggcg 
ctctactaaa 
cgggaggctg 
attgcaccac 
attaaataaa 
atgaaaacaa 
ggctgaggcg 
aaaccccgcc 
ccagctactc 
tgagctgaga 
aaaaaaaaaa 
cgcctgtaat 
agaccaggtg 
ggcgcctgta 
gcggagctgg 
actccgtctc 



acctcccggg 
tgcgccacca 
gtccaggctg 
gctgggatta 
gatgagtgga 
gagctgaggg 
caacaaggta 
aatggcctct 
tctcacctcc 
aaagactcaa 
aatcccagca 
agcctgacca 
atggctcatg 
tgggaggcag 
gagcgaaact 
tattgttaac 
cttggtcttg 
actacatgaa 
gtgtttgaga 
agccctgggt 
gaggtggagg 
gaaggctttg 
gctgaggaag 
gacgcttaag 
tgggaggcca 
gtgaaacccc 
tagtcccagt 
tgcagtgagc 
tcaaaaaaaa 
gcacactagt 
gtgggccgag 
atagcagcag 
gcgggcggat 
ctctactaaa 
ctggggaggc 
agatcgtgcc 
acaaacaaaa 
aaacgaatgg 
aattttggct 
ggcggtcacc 
aataaaaaat 
aggcaggaga 
cgcactccag 
taaataataa 
gtaaaggccg 
ggcggataac 
tctactaaaa 
gggaggctga 
tcgggcaatt 
aagaaaaaaa 
cccagcactt 
aaaccccgtc 
gtcccagcta 
cagtgagctg 
aaaaaaaaaa 



ttcaagtgat 
cacccggcta 
gtctctaact 
caggtgtgag 
gggtggggcc 
ggcaactttg 
ggctcattcc 
ctgcatcttc 
ttggcgtcct 
attatcctgt 
ctttgggagg 
acatagtgaa 
cctgtaatcc 
aggttgcgat 
ctgtctccaa 
tttctatctt 
ttcaccacta 
ttgtagagtg 
tacagatgac 
cccagaagag 
tgacagctgc 
gagacttgtg 
gatgaaactg 
aagtgaccct 
aggcgggcgg 
atctctacta 
tactcgggag 
cgagattgcg 
aaaaaaaaaa 
cggaatgctg 
atcgcgtcac 
gccaggcgcg 
cacgaggtca 
aatacaaaat 
tgaggcagaa 
actgcactcc 
cagcaacaac 
gcgagtgccg 
ggggacggtg 
tgagggcaga 
tagccaggcc 
atcgcttgaa 
cctgcgtaac 
aaataaaaaa 
aagcgcagtg 
ctgaggtcgg 
atacaaaatt 
ggcaggagaa 
gcactccagc 
aaggaagaaa 
tgggaggccg 
tctactaaaa 
ctcgggaggc 
agatcgcgcc 
aaaaaaaaaa 



tctcctgcct 600 

atttttgtat 660 

cctaacctca 720 

ccaccgcgcc 780 

gtttgctgag 840 

aggtccaagc 900 

agcctcacgg 960 

acatacgggg 1020 

tcacccatcc 1080 

ttaaatcctg 1140 

ccgaggcagg 1200 

accccatctc 1260 

cage tac teg 1320 

gagtggagat 1380 

aaaaaacaaa 1440 

cccaatcaga 1500 

aatccttagt 1560 

agtgaataat 1620 

acccagggtc 1680 

gccactgacc 1740 

agctataagg 1800 

agggcacgaa 1860 

tgtctaactc 1920 

cggccgggcg 1980 

gtcacgagtt 2040 

aaaacagaaa 2100 

gctgaggcag 2160 

ccactgcact 2220 

aaaagtgacc 2280 

gggtgggagg 2340 

agcactccag 2400 

gtggctcatg 2460 

ggagatcgag 2520 

attagccggg 2580 

gaatggcgtg 2640 

agectgggag 2700 

aacaacaaaa 2760 

ggccaagcag 2820 

gctcacacct 2880 

accagcctcg 2940 

tggtggtggg 3000 

ccaggggggc 3060 

aagagcgtgt 3120 

gaatcttcat 3180 

gctcacgcct 3240 

aagttcgaga 3300 

ageegggea t 3360 

tegcttgaac 3420 

ctgggcaaca 3480 

aggectaaag 3540 

aggegggega 3600 

atacaaaaaa 3660 

tgaggcagga 3720 

actgcactcc 3780 

aaagaaagaa 3840 

3844 



<210> 595 

<211> 4126 

<212> DNA 

<213> Homo sapiens 
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<400> 595 

tgcagtgcag agttccttta tttgggggca gtgcccaggc cagttggtgg aaagaggcag 60 

gcatacaacc cactgtcagg ctggggggcc cagcaggggc gatggaggag acgaggtggt 120 

tgagggattt tctcagctgc aggttccagg cccaggacag gaggagatgt caggcatcag 180 

acactgagcc tgcttggtgc ccgcaggagc caaaaactgg cggccagagt ttttcccgcc 240 

cccgcccccc gcttacccac aagggtgccc agaggccacc ctctgcatcc tcctttcccc 300 

tcaggatgga gtccaggccc aaaggggtca gattctgcag aggccaaaga ggacactcag 360 

gaaggtgaca ctgcctcagc cagccaagga ccccctgcct cgagaggagg cagcagagga 420 

agagcaaaca ctttaacaca ggctgtggcg acccgcccca cagcacacac ggcacacaga 480 

gggcctcagg cacagctccc cttctcagag caaaacccaa ctagacacaa ggcccagctc 540 

ctgcccacag ccacagaaac gcgggccaca ggcttgcctg ggctggcacc tactggttct 600 

cttccctgct ggcagcatga agggaaaaga cacaggaggg ctcagctggg ctcctagact 660 

ggcccaggct gagtcctggt cccagccaac atctgacaaa ggaaacaccc caactggagg 720 

agacaaaagg ggtgtggctc acatggagca cgctgtcacc gtcctgaggg cttgcccatg 780 

gagaagacag caccctgggt actgaagggg aacggtgggg caggaaccag gaggtgagct 840 

gggactcagg tacttgctgc tggccaggta ctcacagcgt ctggtaggtg ctgtccttgg 900 

ccttgaggaa gtgcagcaca aagaaagcca ccgcctgtag gctcaacacc agcacgacac 960 

ctccgataaa gctggccccg tcaaatccag ggctgtgggc ctcagggact ggggggctcc 1020 

ctgcaggggt gaggcagggc aggggtcata gcagtcactg tgaaccagga tcccagtcct 1080 

accctcataa ctggcacaag taacctggag gccccggtcc tcaatatggg gtgcttaaac 1140 

cctgggctcc gtttagtcca gatcccgcag gcagttggga agcgcctggt gggaggaaca 1200 

gtggcagggg agggtggggt gctgtccagc cctggctctg ctcctggctt gctctgtggc 1260 

ttttatgcca cccacattct ctctgggcct catgatcaca tggtccaatt ggtggcttgg 1320 

accagtgact gggctcccac cctccctggg cctcaccagc ccaatagggt gaagacagag 1380 

ccagaactca gaccctcccc tgctcacaat tcctttcagc cttggacctc agggttcaga 1440 

gccaagatgc agacgatata gtcccttttc aggactccta agccctaaga cctcagtgtc 1500 

acccaatgtc agagccctca gagagggaag gagcaggggg agaagtggag gccccaactg 1560 

tcaccctgct aagtgggtaa gcgaggactg gacagtgtgt gtcaggggga acatggctga 1620 

gggacaggct tggccaaagt cacctcacag gacatagcca gactcacaca gggtcccctg 1680 

accatctcct ctgcccctta gaaggtaggg ctttagggcc atggttgcca ggctatgcct 1740 

gggtggctcc cagctgtgcc acggtacctg tccactccta gcagggattt gatttcaggc 1800 

cctgggctgg gtgcccagaa cagggcaggg ctggtggatg gaagatcatt ctgggacagt 1860 

ggaaaggggc ccagcctgaa ctgggtctcc ctcccagaag gtgtaagggg tggaaggagt 1920 

ctgacaatat gaagggcaaa cagtggtccc agctcccgct cagggagccc cagagaagga 1980 

agctggggga tgaggagcag aagaggggag aaatgagcca catgaggctg atccggggct 2040 

ccagctgagg gagctgagcc atcaataatt cagcttctga gatcggtttc tggctttaat 2100 

taaccaaggg cctcttcagc catccctgcc aagtctggtg acctccagcc cctgagaggg 2160 

agtgggaagc aaagggtggc attcttgcct ctactccgcc aggaacccgc cttgtgaccc 2220 

tctctgggcc tggagggtgc agtaggcagg cactgaggat gtctcctgag caactcgagc 2280 

cacacaccct cacctccaag gacaagtccc gaggagtgac cttgagttcc cagaactggc 2340 

aatggggagg agggatggga actgtcattt cttgattgcc ttcagtgagt cagacactga 2400 

agtgggcact gtacagcttc tcataaaacc tgcagagcaa ctctatgaga aaggcatcat 2460 

ccaacccatt tcacagatgc ggagacaggc tccgagggag aaggcgcagg gccgaggtca 2520 

cagagcatgt cagtgcctga gcccagagct gtcttcccct gcacctctgc tcctcccact 2580 

gccccgtctc cagggagtac tcacctgttg tgactgtctt cggttcatag gtggggtggt 2640 

ggtgagcagc tgaggagaca ggtgatccat agcaaagcag ccttcccacc ccctgactcc 2700 

catcaagacc ccagccccca ctcaccgcta aacttccgat tcagatgcag ccctgtccct 2760 

actctagaca tccaacatac ttgcaagccc cacacagttc cctccaccag atgtgctctc 2820 

tctccatctc taccttccag gctcattact gatgccacct cctccaggaa gcactccctc 2880 

tcattccctc tgaacttcca aagtccttgc tatgccttaa actagaagaa gagctaacac 2940 

tcccatcccc tccacacacc tgtactaatt ttcagatgag gaaataggca gagagaggag 3000 

gggacttctc agtgagctaa aggcagaact gataccgggc ccaggtatgg gtgagcaaaa 3060 

tccaccgcct gagagaggct gtttcctcct ggggggcagg ctatggctgg ggagacccat 3120 

gcaacttggc ttaggtgcaa gaaccccctt acctggacat gcctctgagc ggttgtagat 3180 

ggagcaacct tccttgacca cctcagattg ggccacacag tgtcctggtg agagaaggtg 3240 

caggggggag gtttggggag aggggccagg gccttggggt tcagagctgg gggcactcaa 3300 

tctggggagg gctcagctcc atacctggct cctctggccg gcactgctcc cacatgcagc 3360 

tggagagatt gcgcgctctg tctccctcca cgcagtgctc acagacctcc agctgtttgc 3420 

aggccccttg gaccgccggc cagatattca ggcggatcag ggctcccctc ccaaagcctc 3480 

gagctccttt acctggtagt gcggtggagt gggaagggag aattcctcga agccaagctg 3540 
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tggcacccgc ccaccctccc acccaacact ccggtggccc tcgcccctct gcacacatct 3600 

gggctctcac actcacccct ggatccccaa gccccagcgc acacctggag cttccacagc 3660 

agaggtctac acacacctag agcctcccca cattcaccca gagcccctcc ctttgcatac 3720 

acccggggtt cccagattca gttggagcca cctcccccag accacctgaa gctccctagg 3780 

cccccacacc tggggcctat atttacacac agggctccac acagagctgg agccccctct 3840 

cccacccctg gaggcctgca cttggccgga aagccccctc ccctcaagcc caggctgctc 3900 

ctggggagac ggctggctga ggtgcaagct ccatccgtcc ccgtcgagcc cctagccaga 3960 

ccctgccgcg agttaccagc cacagccagc tgggcacata ggaggaggca gcaacagccg 4020 

ccacagagcg cagtccgcaa ggcgcggggt cccggagcct ccatgggctc gcggggtggg 4080 

ggtggccggg ggcggtggcc gggatcggtg gacagctgcc gggcgc 4126 

<210> 596 
<211> 341 
<212> DNA 
<213> Homo sapiens 

<400> 596 

tttagtagag atagggtttc actatgttgg tctggctggt ctcaaactcc tgacctcaag 60 

tgatccaccc gccttggcct cccaaagtgc tgggattaca ggtgtgagcc actttgcccc 120 

gcctaaattt ttattttttg tagagatggg ttctcactat actgcccagg ctggtctcaa 180 

actcctggct tcaaacaatc atcccgcctc agtctcccaa agtgctggaa ttacaggtgt 240 

gagccactat acctcctggc cagtttttta ttttttaata tatacagggt ctcactctgt 300 

cacccaggct ggagtgcagt ggcatgatct tggctcactg c 341 

<210> 597 
<211> 142 
<212> DNA 
<213> Homo sapiens 

<400> 597 

agggaggagg gatagcatta ggagatatac ctaacgtaaa tgatgagtta atgggtgcag 60 

cacaccaaca tggcacatgt atacatatgt aacaaacctg cacattgtgc acatatactc 120 

tagaacttag agtataataa aa 142 

<210> 598 

<211> 5889 

<212> DNA 

<213> Homo sapiens 

<400> 598 

ggttgctgtc ttcctcgtcc ctccggccct tcttcctact cagcgtctca cttttggcct 60 

attttctgct ggatctctgg cagcctcgct ttctccctga cgtttcaggt gagtgtttct 120 

tcattcagta agcacccatt gggtacttgc ttggtgcctg attccgctgg gtggggttaa 180 

gtggcgaggg gaacgtacag caggccacct gcctcccagg catggcccac ttcctttttg 240 

tacggaattc cctaaaatga gaattgcccc tttcccaaac taatactact gtaggtgcag 300 

tggttaagat ggtgaagtct gcagccatat tacctgggtt ccattctctg ttctgccatt 360 

taccattgct gtgaccttgt aaaagtagct caatttctgt ttgtctcaat ttccttatct 420 

gtaaaacaca gatgatagaa cttacttcat agagaggtga ggagaggatt aatgagttaa 480 

tacacatgct atacttatga tagtgtttca catataagcc ttataaatag aacgtcagtg 540 

tgtgtttttc ttgttatcca tataatttac ttagtaggtg acaggctgta tgtgagttgc 600 

ttgggagata agcatgagta aaatctagtt cttatgttgg agaagggtaa tgtctaaagg 660 

agacagacgc aaacaaaaac atgtaacagc tcccccattg gcatctgaag aaggtgctct 720 

ggggctgcag aggagggaca actatttctg ccttggggag tgtctccagg gcccctacca 780 

ctcaataatt tgcccccctc ctctctctct agcatcatcc ccagagagcc acactctgac 840 

aggtgagtac aggccacctc ttgaagacaa atgcccacat cctgtcctgc ttgctggtgc 900 

cagtgtcacc atggagcagg gcagtgcccc agcattgtga atgaactggt aacaagccat 960 

gggtttttca tggtcctagg ctcctgtgta ggtgtttgcg tgcctgctcc ttttggggca 1020 

gagggttggg tcctcctggc tttgcttcta agaccaggac agcctccttt tgagaggtgg 1080 

ggtcatttgc ttttttagac taaagatatc tttggcattc tcagagggat ttaagtgtct 1140 

aggaagctgg tatctgaggc ctccaggggt catgtcatgt ctccccagtg agggtgcggg 1200 

gtcaggcgcc cggccgcacc tgctgagtgt gcccgagttg tgcagatacc tggctgagag 1260 
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ctggctcacc ttccagattc acctgcagga gctgctgcag tacaagaggc agaatccagc 1320 

tcaggtaacc tcccctacat catacaacag ttcactacac tgaaggggaa tagaggtggc 1380 

gggggatggg agtggagtga tataaactcc ccgcttggag accctgaaag aagaggcctg 1440 

ggcgccttct tgaggcaaca ttcacagcta atccctggtg caggaggcag tctcacctca 1500 

ggaaataagc ttttgatagc ctgacacttt ctcctgccaa ggcagaccag tgtgaattac 1560 

aggtggtggg cattatgtat tgccagcttc ctgaattggt ggtcttgagc atcctcttaa 1620 

ccaaactgga ccaccccatt ccttgagtac atacccatcc ttctaccagc tgagcagtag 1680 

tttcttaccc agccgagcgg ggcgcatatc tcagcgtgat gttcttagac ttttgctctt 1740 

ctctgcagtt ctgcgttcga gtctgctctg gctgtgctgt gttggctgtg ttgggacact 1800 

atgttccagg gattatgatt tcctacattg tctgtgagta gggtctgtcc ctgccctatt 1860 

taaagccctc tcctttcttc tttccttgga ctgacccaga gggaagacta tgctctctct 1920 

gctcagtttc tgatttgtgg agate tccag ggatgettta gtattagtaa ctgtggaaga 1980 

cttacctggg gaggagggca gtggatgtgg caagaagagg ttggcacagc cttgggaaag 2040 

gaccttgcct accttcagca cttcagttta gggaccagtt gaatgggggc tcttaggaaa 2100 

ttggcgtagg gagctcttag ttcacgttct taacccatag tgttgagtat cctgctgtgg 2160 

cccctggtgg tttatcatga gctgatccag aggatgtaca ctcgcctgga gcccctgctc 2220 

atgcagctgg actacagcat gaaggcagaa gccaatgccc tgcatcacaa acacgacaag 2280 

aggagtaagg ggctgcccta agecaggagg gtgaaagagc gggagggect tggcttgggg 2340 

ttcatgagat gccctggaat tgagatcttc tccagttctt tagctgtgtt tgtctcccct 2400 

cccatcattc atgcttggtc attgetctae tactcttget tttctagagc gtcaggggaa 2460 

gaatgcaccc ccaggaggtg atgagecact ggcagagaca gagagtgaaa gcgaggcaga 2520 

gctggctggc ttctccccag tggtgaggtc cagggaaagc gggggtgtca aatagaaagc 2580 

cagaggaaaa atetttctge tttggagctg ccacctccaa aggaggtgga agecagaggt 2640 

tttgggagag gggatgetec tgataaattg gttctcttat cccctcacat gtcgtgttca 2700 

tcctggttct cctgccaggt ggatgtgaag aaaacagcat tggccttggc cattacagac 2760 

tcagagctgt cagatgagga ggcttctatc ttggagagtg gtggcttctc cgtatcccgg 2820 

gccacaactc cgcagctgac tgatgtctcc gagggtatgg ggagcccttt gctgccctgc 2880 

tccccgccac aatctttgtg ttccctacca tgggtactta ctgtgctctc tggcccactc 2940 

actctctaat tctctcagct cctaaaagac cttaacacct aaggcttggc ccagctttcc 3000 

atgtcttgag ttctcatcct tgaactcatt gtctgttgtg gctaatcaga ccagcagcag 3060 

gcccattctt aattctttgt tctctgcaca gatttggacc ageagagect gccaagtgaa 3120 

ccagaggaga ccctaagccg ggacctaggg gagggagagg agggagagct ggcccctccc 3180 

gaagacctac taggccgtcc tcaagctctg tcaaggcaag ccctggactc ggaggaagag 3240 

gaagaggatg tggcagctaa ggaaaccttg ttgeggctet catcccccct ccactttgtg 3300 

aacacgcact tcaatggggc agggtccccc cagatggagt gaaatgetec cctggaggac 3360 

cagtggagac actgagcccc gagacagtga gtggtggcct cactgctctg cccggcaccc 3420 

tgtcacctcc actttgeett gttggaagtg acccagcccc ctccccttcc attctcccac 3480 

ctgttcccca ggactcaccc cagcccctgc ctgcccctga ggaagaagag gcactcacca 3540 

ctgaggactt tgagttgctg gatcaggggg agctggagca getgaatgea gagctgggct 3600 

tggagecaga gacaccgcca aaaccccctg atgctccacc cctggggccc gacatccatt 3 660 

ctctggtaca gtcagaccaa gaagctcagg ccgtggcaga gccatgagcc ageegttgag 3720 

gaaggagctg caggcacagt agggcttcct ggctaggagt gttgctgttt cctcctttgc 3780 

ctaccactct ggggtggggc agtgtgtggg gaagctggct gtcggatggt agctattcca 3840 

ccctctgcct gcctgcctgc ctgctgtcct gggcatggtg cagtacctgt gectaggatt 3900 

ggttttaaat ttgtaaataa ttttccattt gggttagtgg atgtgaacag ggctagggaa 3960 

gtccttccca cagcctgcgc ttgcctccct gcctcatctc tattctcatt ccactatgcc 4020 

ccaagccctg gtggtctggc cctttctttt tcctcctatc ctcagggacc tgtgctgctc 4080 

tgccctcatg tcccacttgg ttgtttagtt gaggcacttt ataatttttc tcttgtcttg 4140 

tgttcctttc tgctttattt ccctgctgtg tcctgtcctt agcagctcaa ccccatcctt 4200 

tgccagctcc tcctatcccg tgggcactgg ccaagcttta gggaggctcc tggtctggga 4260 

agtaaagagt aaacctgggg cagtgggtca ggccagtagt tacactctta ggtcactgta 4320 

gtctgtgtaa ccttcactgc atccttgccc cattcagccc ggectttcat gatgeaggag 4380 

agcagggatc ccgcagtaca tggcgccagc actggagttg gtgagcatgt gctctctctt 4440 

gagattagga gcttccttac tgctcctctg ggtgatccaa gtgtagtggg accccctact 4500 

agggtcagga agtggacact aacatctgtg caggtgttga cttgaaaaat aaagtgttga 4560 

ttggctagaa ctgctgcctc cctgactgtg agctgccttc cacaccctgc actgeactgt 4620 

gttctctcct cacccttaac ctgcttcact ccagtctgtt ctggctgttt attaccttgt 4680 

tgcaaaacag ggecgaagea aggattacct tgacaaccct agcttctcct tagecatett 4740 

ccttgacagt gtgatctgtt tagtgagatt tagcatgtgt gaataaagta tatgeaggag 4800 

gaaattgett tgtcttccca ateggtagaa attegggace ataaaaattg tgttttacca 4860 

tgtggcctac aaccttaaca ctgetttett aagaagtctt cacccatcta catgetaaca 4920 
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actcactcag cctggattta tctttactgg ggaagccaaa caagcaatag aggaccttta 4980 

cctgtgttag aaatgagttg gagccaagga acactgaaga aatagtatct taacagttac 5040 

tgagtccatt gtatgtgctt ggctctgctc tgagtgattt atatgtatta agatttttcc 5100 

tcacaggtca gatatatact gttactaact tcattttata gacaggttaa gcttcctgaa 5160 

ggccacaggt cccagtaaat tgtggagcca gaacccaaac ccaagaagtt ttggcttcag 5220 

caaatgcatc agacagcccc tgtccattaa tagggcacag gtaggaagat gcacaaggat 5280 

gtgggaacta tagagaacca atctgatgcc ttggcttaac aaagagtgga catggcaagc 5340 

cttcctcttt ggggaagaaa agcccagaac tgagcagatg gcctccttta tgagttcatg 5400 

tcctccgcct tcagctggag gtaccatatg gcgatgctac ctgtctttct gctggaggta 5460 

ccatatggta atgctgcctg gctgtctgct ggaggtacca tatggtaatg ctgcctgtct 5520 

ttctgaggtt gacttttatg ccatgtcttt cctaagtgtg taagaatttt tctgtttgct 5580 

tcacatttga ctgagaatca ttctagggtt tgattgagcc cctgtcctgt gccactaaag 5640 

gaactcgaac ttttcatcac ttagagattt cagaggggaa tggaaaaaca gttctaatca 5700 

ataagcaagc aattcaagaa aaatagaatt aatcaggcaa tgactgcaac atgtcctatc 5760 

tttaatctat tttcttatta agcttggaca ttgacaatag aaccagaagc ttgtagctgg 5820 

atcaaaatat tctccatagg cctggagttt catgagggtc tattcttttg ttgttgttgt 5880 

tttggtttt 5889 

<210> 599 
<211> 191 
<212> DNA 
<213> Homo sapiens 

<400> 599 

gggccgggga tggcgctgag cctgggcctg ggtctgggtc tgagcctagg catgagtgag 60 

gccaccagtg aggcagagga ggagaggcca cggccgagcg gtgggacgcc tggccacgac 120 

gctgtggctg cgctccgcgg ctgggaggcg gtgctggcgg cggcgcagcg gttgctggtg 180 

tgggagaagc c 191 

<210> 600 
<211> 686 
<212> DNA 
<213> Homo sapiens 

<400> 600 

gggttttttt tttttttttt ttgagacgga gtcttgttct gttgcccagg ctggagtgca 60 

atggtgcagt cttggttcac tgcaacctct gcctcccagg ttcaaacaat tctcctgcct 120 

cagccgtcca agtagctggg attacaggtg catgccacga tgcctggcta ttttttgtat 180 

ttttagtaga ggtggggttt caccatgttg gccaggctgg tctcgaactc ctgacctcag 240 

gtgattcacc cacctcggcc tcccaaagtg ctgggattac aggtgtgagc cacggcgccc 300 

agcctcatga gggtctattc tttacattca ccatggtctg atggttgcta catgtttgtc 360 

tatgattttt tttttctatt atcaggtgtc ttggccggtt catgccccac gatgaaaggg 420 

ccagaggttt tcatatgagt aaaagaaaaa agcagaaatg tgaaacctac aattaggcta 480 

aacaaaaatc aactggaaaa gtacaggctg aggggagaag agttggctac atgtttatgt 540 

taggggagga gggagtacat tttagctatg tattcaaaca gctaatagtt taatgttgct 600 

gcttataaac ttaattttag gctgcattaa taaaagtgta gtctccaaaa caagaaatgt 660 

gataattcta gtgtcctgtg cacgaa 686 

<210> 601 
<211> 26166 
<212> DNA 

<213> Homo sapiens 
<400> 601 

gccacgtctg caagtcagcg tttattgctc aagcgtatta aacaaaaatg tagactgaaa 60 

gagacagttc ttttaaaccc catttttccg gattttttaa gcgctctaaa ataagaaaat 120 

aagaaagtgc aagccagcaa aaacgctcca agtgcctaat tctgactctg aaacttgagc 180 

tctctggtct gcccccaaga agacatcagc ccgccccggg tcgtccctgt ggctcccacc 240 

ccattcccag gagcagaccc cgccagcctc aaagctgcag ggaggtgggg gtggcctgca 300 

gacagggtgg ggtctgcatc cggtaccagt gacagcagcc tctcctctcc cacggtggcg 360 

cttgtttggg gctgtggcca aagtgtttgc ccggcccctg actgtgtcct tccggagctg 420 
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ccgaggactg 
cctgcatccc 
tatcttggca 
cccctggcag 
tgatcttgat 
accagtcaat 
gaaagaaaca 
gcattccagt 
cccaggcgcc 
aaagccagtg 
cctgggctgt 
cttctcagtg 
gtgagaattc 
aggagggagg 
ttacaaagcc 
gtggcactcc 
caagctgggc 
catgacagca 
ggtctaagta 
agagcagcgg 
ccgggccagg 
cctgagcctg 
agtacccggg 
gcagggaccc 
gcaggaagag 
ccttcctcgg 
gagctgactg 
taccctagga 
gtgttagaag 
ctgggcgctg 
ctctccaggc 
acaggaaagg 
ggccacaggc 
aaactaaccc 
aaacgcctgc 
tcactcggcc 
gcatgtgcct 
gcaaaaggga 
agccccagct 
agcccctggg 
ccaccctgag 
ccaccttgag 
ccaccctgag 
actctgggcc 
tgccgtccta 
ggcaggttaa 
aaacgagcca 
caaggttctg 
agcggccggt 
gctgcccagg 
gcgggcaggt 
tgagcctctg 
tgttcaggca 
aggctgggtg 
aacttcagga 
gaaagcatta 
ttgtattttt 
ctttgggagg 
atagtgaaac 
tgtagtccca 
gttgtggtga 



cagagagggc 

ccttcaatgg 

gccagagtca 

ggtctgcagg 

ggccccaggc 

acaccccacg 

aaaccattgt 

ttcacagtcg 

ctcgggacga 

ttattctcac 

accctaagac 

aggatcaaga 

ctgggctctg 

ggctcagctt 

ttggatgcag 

aagctgggca 

atgacagtgc 

cccaggacgt 

gggagacagg 

cagcctgtgt 

acagaggcgc 

gggcaggtgc 

acagacccta 

acagtctggt 

gagctggggc 

cctgggaagt 

atctcactga 

agtgaaaacc 

cagtagctca 

ccacccggtt 

tttctccttg 

tgttacttca 

agcgagcagc 

ctgggcctct 

atcccccccc 

ccttgccctg 

gacggctgag 

ggacagcgcc 

gtccaccctg 

ccccagccgt 

ccccagccgc 

ccccagccgt 

cccctgggac 

ttgctggtca 

gcagggagac 

gggaacagca 

agggcaagag 

aggccggtgg 

ccgcccagga 

tgtttgggca 

ttgcccaggg 

cctttcagat 

gtgacagcca 

ggcattgcct 

cctgtgggag 

agtccagaat 

aaaagacatt 

ccaaggcagg 

cccatctcta 

gctactcggg 

gccgtgatcg 



ctggcttgtc 

ttgaaaataa 

ctcctgttcc 

ccgccccatc 

cacagatgca 

gcggtgaaga 

gaaggaaaac 

ccccaactca 

ggagacgcag 

acccaactgt 

cccaaaacag 

caaagtactt 

acctgatctg 

cccccggtca 

ccccaccccc 

tgacagtgcc 

ccgggacgtg 

gggccggcca 

gcaggcgacc 

cgcttgcgcc 

ccagccctgc 

accttgcagg 

agcgtggagg 

caggtccaga 

cagttcaggg 

tgcggccatg 

ggcacagact 

ccacctgcag 

gggcgattag 

tccgcgtctg 

tcgaactcct 

ccagggccac 

cagcgttcag 

gtggaaagag 

gcacacacgg 

gctctggggc 

gaagaaaaca 

agccccagct 

agccccagtt 

ccaccctaag 

ccaccctgag 

ccatcctgag 

cagcagttct 

caacagaaac 

tgaccacagc 

gtagggaggg 

agcgtcctcc 

tgtgctgaac 

tggcctagag 

tcaaggtgac 

gagcaccacc 

gtgggggcag 

gaagttgttg 

tccctgcaga 

tcctcacaaa 

atttcccagg 

ttacggctgg 

cggatcatga 

ctaaaaatag 

aggctgaggc 

cgccactgca 



ccctctagga 
tgattccact 
gcagtgggaa 
cagctgcatc 
tctccgggcc 
gaaaacgttc 
acctgctgga 
ctgtggattt 
ggaaacccac 
ccctgcagcc 
cggagatgga 
gctaggaggt 
tcctgcattt 
ctggggccag 
aagaacactg 
cgggacgtgg 
ggcagcggcc 
gtgtggcggt 
cggcgtggac 
caggctctgg 
tctcactgtc 
gaagtgggta 
gaggaagccc 
gagcccactc 
gtggggctct 
ctcctgctgt 
agccaacatt 
cctggtttgc 
gggttgtgcg 
ggggtcactg 
tcagggggct 
ctgtgcggcg 
aagttttctt 
ataaaagcct 
tttctaccga 
ggggagcggg 
cttaaggcag 
gtccaccctg 
gtccaccctg 
ccccagccat 
ccccaaccgc 
ccccagccgc 
gggatctggg 
ccaggcgggt 
caggaaaaat 
tggagggtgg 
gggacttagc 
acaggagggc 
gtggggcagg 
agtcctagtg 
aggacccgta 
caagacagta 
acaccccagc 
accgcgtccc 
gccctcttct 
gatgggtctg 
gcgcggtggc 
ggtcaggagt 
aaaaattagc 
aggagaattg 
ctccagtctg 



gcagctggga 
tgtcatgaac 
acctgggagg 
tcccaggcct 
tttccagcag 
atgtcttctc 
aagtaattat 
actgccgtca 
tcctggccat 
tggcaggtgg 
gaagaccgct 
agagggcgct 
tgagtaatgg 
ggagacgtgg 
cctgtcacag 
gcagcggcca 
acgtggcact 
ggatccctct 
tggtgtggat 
aggaggggcc 
caggaagagc 
agatcccact 
ggtcagagtg 
cagcccaagc 
caagaggccg 
tacgacacgg 
ggcctatttt 
cctcacacaa 
tttccatgct 
ggccatttgc 
cctggccacc 
ggaaggtgga 
cgaatgtgaa 
tcacttcctc 
gcctgggatg 
gggaggccgc 
aggtaaccag 
agccccagtt 
agccccagcc 
ccaccctgag 
ccaccctgag 
ccaccctgag 
gagaacacct 
ctcatttcag 
ggcagcaaag 
gggcgacggg 
gggcactgga 
agggtgtgca 
actcagagcc 
gtcactgcac 
gctcggtggg 
ggcagggccc 
tttacggagc 
caccagagac 
gtgtctctaa 
ggtttaccca 
tcacgcctgt 
tcgagaccag 
tgggcttggt 
cttgaaccca 
ggtgacagac 



aggtgtcttg 480 

accatgaagg 540 

gtcctcaaac 600 

cctggttctt 660 

cccatggggg 720 

agatcagaag 780 

caaagtaaca 840 

gctgggagga 900 

ggcactaccc 960 

gcagtgaccg 1020 

gccctggggc 1080 

ggg tgaacc t 1140 

gagcaaacac 1200 

tccagccggt 1260 

cagcggccac 1320 

tgtggcactc 1380 

ccaagctggg 1440 

agaatgactg 1500 

ttcacctggg 1560 

cctgcggctc 1620 

cgctgggcag 1680 

gggctgactg 1740 

ggcaggagac 1800 

catgagagag 1860 

agccgggggc 1920 

gagccactcg 1980 

aaaattaaac 2040 

gggaaaagag 2100 

tggggccagg 2160 

accacgacgg 2220 

tgcaaccgag 2280 

cacgccactc 2340 

ggaagtgggg 2400 

agtgtcccag 2460 

gagcagccct 2520 

ccaaggttgg 2580 

atgcctccag 2640 

gtccaccctg 2700 

acccaccctg 2760 

ccccagccgt 2820 

cccaagccac 2880 

ccccagccgc 2940 

catgcgtctg 3000 

gcccagggac 3 060 

caactgagga 3120 

gtcagctgag 3180 

ggctctgcac 3240 

gggcaagagc 3300 

ccgctgagtg 3360 

ctgagggtca 3420 

gggaagacgc 3480 

gggggaacat 3540 

cccatgtgcc 3600 

cagggaagga 3660 

gagcaaatct 3720 

ggctttgtgt 3780 

aattccagca 3840 

tctaaccaac 3900 

ggcgggtgcc 3960 

ggaggcggag 4020 

tgagactcct 4080 
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tctcaaaaaa aaaaaaaaaa aagaaaagaa agacatttta cttcttcggg gtttttctgg 4140 

ggtgttttta aaaatacaaa gtggcacaat gaatcttcga gggccacgtc gtgtggccct 4200 

cggaggccct caccaggaca ggagccccca ccgcaggcct ctcacctccc cagcgggtgc 4260 

tcctgtgggc cactcacctc cccagcaccc ctgaaaccca caggcctctc acctccctgg 4320 

ctgagcaccc ctgtgggcca ctcacctccc cagtgccccc accccacagg cttctcacct 4380 

ccctggctgg gccccccatg ggccactcac ctccccagca cctgcaccgc aggccactca 4440 

cctccccggc tgggcacccc tgtgggctac tcacctcccc agcaccccct ccccctgcag 4500 

gccactcacc tccccggctg ggcacccctg tgggccactc acctccccag cttcccccca 4560 

ccccacaggc ttctcacctc cctggctggg cacccccgtg ggccactcac ctccccagtg 4620 

ccccctcccc ctgcaggcca ctcacctccc cagccaggcg ctcctgcggg ccgttctggg 4680 

tgtcccagga tgcaccctgc agccttgcac tgaccttgaa gcgcacgcac tggatggcgg 4740 

tgcccgtgtt gaggccggcc tgcagcagga gcactagcag cagcagcagc agcagcacat 4800 

cgtaggatgg ctgcaggcgg aaggaggtgt gagcagagtg gcccagccgc cctccagccc 4860 

aggagaccaa agctccctcg tgcccacggc cttcacaaag agcaggtgcc agggctcgag 4920 

tcggagcccc agtaacccaa acgcattgct atttcaacaa cctgatgata atcattctag 4980 

ctcaattaat gaggtcaatt ctgctttaaa ttccatcagc ttgtgttcaa catacaggaa 5040 

taaaagtgac cgagccactg ccaggccccg tggacagtgc tggctgctag tggtcaagtc 5100 

gctccacccc tcctgctcca acacgcgcaa ggctcagtcc cagagccaga gagaatgtcc 5160 

gcgggaggca gaagcgctga ccgttccgtg tcagcaggat cctgagcgga ggagacaggg 5220 

ggcagccgtc ctggggggat ggggtgcaca gacccccaac agcttcaccc ctgagcacca 5280 

gctgctagga aggaaggggc aggaggcagc gcggtgagct cggcagggaa gaggatggag 5340 

ggagcggccg aggatgcagg gggccagaga gtgaccacgg agcacccaaa accctctgtg 5400 

cccatggata gctttcccgt gttggctcaa gaaaaacaaa gcaactaaat gttaacacac 5460 

aaaagagcca aacatctatc caatgtcact gctacaaaaa gtaagagaag aaacagaacg 5520 

aacagctctt ctcaagttaa cataaaactt ttttttgcga cagtctccct ctgtcaccca 5580 

ggctggcgtg cagtggcgcg atctcggctc gctgcaacct ctacatccca ggttcaagcg 5640 

attctcctgc ctcaacctcg ccagtagctg ggattacagg tacccgccac cacgcccagc 5700 

taattttcgt agttttagta gagacggggt ttcaccatgt tagccaggct ggtctcctaa 5760 

cctcaggtga tctgcctgct tcggcctccc aaactgctgg gattacaggc atgtgccact 5820 

acgcccggcc tggattttaa gtaattatta gaaagtctgg tacttgtgtt cccagctgga 5880 

gtttaccctg gaggatggta aaggcatggg ccccatgtgg tctttgctct tcggttgttg 5940 

agacaccatt tattgcaaag cctatgcttt cctcagtctc aaaaaaaaaa aatccagagc 6000 

cattaaagaa aatattaatc aacttaacta tttaaaaaaa tagctttcac ataacaaaaa 6060 

tcacaacaag cagaacaaga aatagccgcc aggctgagac caccatttgt atcctccatc 6120 

acagatggag ctggtctccc ccggcaccag ggttagcgga aaatggtcag gagacatgag 6180 

cagagtccac tgaaaaagca atgcagacgt cctcatgctt tgtaagaggt cctcgtcctc 6240 

acccacattg agacggaggt gctcccaccc gtgtcaccga actccagcgt gacaccctct 6300 

gtggggggct gtggagaagt gagcaccttc acgccattgg caggagggaa ggtggacggc 6360 

ccccattgag gaggtcgccg agatttatca gcatgacagc tgcacctacc ctctacccag 6420 

cagtctctct cctagctgga tatcctaaag atgcgctgag gattcaaaag gaagatgcgt 6480 

gcaggttgtt cacggaggag ttatttatag tagcagacgg ttggaaataa ctcagtttcc t 6540 

atcactaggg actggctggg ttgaccgggc agtcaggagc gatgaaggaa ttcagaagag 6600 

ctctgatccc cttgggtatc gagaaaacac caggataaat tattaagttt aaaaaaacaa 6660 

gcaaggggct gggcacggtg gcagaagcct cctcatccca gcactttggg aggccaaggt 6720 

gggcggatcg cttgagccca ggagctcatg accagctcag gcgacagagt gagaccctgt 6780 

ctctgaaaaa taaaataagt aaatacataa ttaaaaagcg aggtacatgg ctgggcacag 6840 

tggctcacgt ctgtaatccc agcacttcgg gaggccaagg tgggcggatc gcttcagccc 6900 

aggagctcat gaccagcccg ggcaacagag tgagaccctg tctctgaaaa ataaaataaa 6960 

taaatacata actaaaaagc gaggtatatg gctgggcacg gtggctcacg tctgtaatcc 7020 

tagcactttg ggaggccgag acaggcagat cacttaaggt caggagttca agaccagcct 7080 

ggccaacata gtgaaaccct gtctctgcta aaagtacaaa aattagccgg gtgtgttggc 7140 

atatgcctgt aatcccagat gctccggagg ctgaggcagg agaatcactt gaacccagga 7200 

ggcagaggtt gcagtgagcc aagactgtgc cactgcactc cagcctggga gacagagtga 7260 

gactctatct taaaaaaaaa aaaatggcaa ggtacagaat agtgattcta gtatacaagc 7320 

tattttgtaa gaaagaaaaa aaaattatca tctggttgcc cataagagga gcaggtcagt 7380 

caggcatgag accacagctt ttctgagctc gtcttttata gtttggcttt tgaaccacat 7440 

aaatatttta tatattcaaa aaataccatt ttaaaaagaa aaaaagcacc aagccctaaa 7500 

atggaataca accagaggcc attaaactta atggggtatc gaacccagag gataacccca 7560 

cacagaaagg ggccgacttc gatggcgtgt ggccgtcccc tgctagcaga atgcatgctg 7620 

aagataaaag ggaagcctca ggctcactca ggagatggcc cctgggtggt aatattgtct 7680 

cttgtaatcg tgaacaatta tatgcatatt gtaagacaac atgaagtagg agatatatta 7740 
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aatgtcacag gaaacaaaat aattggagta ggagtgggtg aagcaaacac aagagaagtg 7800 

aagaaaaaac cctgggccgg gtgttgtagc tcactcctgt aatcccagca cctgtgggag 7860 

gccgaggacg ctatccccct tcagacagtg actccatgcc cccatcagac agtgactcca 7920 

tcccccatca gacagtgact ccattacccc attagacagt gactccatcc cccgtcagac 7980 

agtgactcca tcccccgtca gacagtgact ccatcccccg tcaggcagtg actctatccc 8040 

ctgtcaggca gtgactccat cccccgtcag gcagtgactc catcccccca tcaggcagtg 8100 

actctatccc ctgtcaggca gtgactccat ccccctgtca ggcagtgact ccatcccccc 8160 

atcaggcagt gactccatcc ccccgtcagg cagtgactcc atcccccgtc aggcagtgac 8220 

tccatccccc atcagtgact ccatccccca tcagacagtg actccatccc ccatcagaca 8280 

gtgactccat cccccatcag gcagtgaccc catcctccat cagacagtga ctccatccat 8340 

ccatcagaca gtgactccat ccatccatca gacagtgact ccatccccca tcagacagtg 8400 

actccatccc ccatcagaca gtgactccat cccccatcag acagtgactc catcccccca 8460 

tcaggcagtg actccattcc ccatcagcac acgcatgttg acctggagca aacagcttaa 8520 

tacaggtctt ctaatgattt gggtgcaaaa ccacttatac tggaaaaaaa aaaacaaaaa 8580 

aacaaaaaaa cactaggtat ctaagaaata catctagcta cacgtgcctg agcctcagaa 8640 

gtcgtgttgg agataagctc ctactgggtc agctcaaagt aacaggactt tgaattctcc 8700 

ttaaattcac ctgttaaact caatgcagtt ctgatgcaaa tccgaaaagc atgtctcata 8760 

aacccaggac agattctaaa atgggctcag ggacagagtg ggtcccaggt ccctgtgaag 8820 

caggtggtgg ccccagatac agggcggaat cgaggccccg aggccctggg gccaggctgg 8880 

cattggggag cacggtcacc gctgcccagg aacagcggag gaggtgcctc ctggagctgt 8940 

ccctggagcc agcgccccca gagtcccagg caaagccagc aagggccagg ccaacacgca 9000 

gagcaccaca tgtctggggg gctctgaaat aaaatacaac tcacttttat ggctggcggg 9060 

taatccttaa acatctccac gattctcatg atgctgaatg acagatatcc agaggctgtg 9120 

aacagcagcg gagacgtgag gctgcttatg gcaatcagga cctccacctg gaaaaaaagc 9180 

gcgggttgca ggaccacggc cggagcctgg gagcccagga cagcgggcgt ggccagggct 9240 

gggggggcgg gcatggccgg gcactcatgg gcatagtccc aagctggttc cctgcatgac 9300 

tcctgctgaa acctctgcct tttctttttt ctttttcttt tttttttttt ttttttttga 9360 

gacaagtctc gctgtgtcgc caggctggaa tgcagtggca tgatcttggc tcactgtaac 9420 

ctccgcctcc cgggttcaag tgattctcct gcctcagcct cccgagtagc tgggattaca 9480 

ggcgcccgcc accacaccca gctaattttt gtatttttag tagagaccgt gttttgccac 9540 

gttggtcagg ctggtcttga actcctgacc tcaggtgatc cacccacctc ggcctcccaa 9600 

agtgctgtga ttacaggcat gagccaccgc gcccggcctg cctttctttc ttttttgaga- 9660 

ggcagcctag ctgtgtagcc caggcaggag tgcggtggca caatctctga tcactgcaac 9720 

ctctgcctcc tgggttcaag caattctccc gcctcggcct cttgagtagc tgggattaca 9780 

gacatgcgcc accacacctg gctaattgtt ttgtattttt agtagagaca gggtttcacc 9840 

atattgtcag gctggtcttc aactcctgac cttgtgatcc acccaccttg gcctcccaaa 9900 

gtgctgagat tacaggcatg agcagcctgc cttttcattc acagcacagt gccacaagtt 9960 

ttgaaatact taccatctag gcaaggtggc tcacatctgt aatccaagca ctttgggagg 10020 

ccgaggaggg cggatcacct gaggtcacga gttcgagacc agcctggcca acatggtgaa 10080 

acccccatct ctactaaaga tacaaaaact aagcgggtgt ggtggcgcat gcctgtagtc 10140 

ccagctactt gggaggctga ggcaggagaa tcgcttgaac ccgggaggca gaggttacaa 10200 

tgagctgaga tcgtaccact gcactccagc ctgggtgaca gagcaagact ctgtctcaaa 10260 

caaacaaaac caagaaacaa aaaacattta cctgcaattc tccactcttg aaaaaatagc 10320 

ccttccgcat gggtcacctc cttcactgct ctgaagtctg tccttttaac ccaccccaaa 10380 

tcttcctgtt tgacccatca agtctcagag acagccttca ctcatgccaa gcatgggcag 10440 

tcaccttgca aagcactcag ggtccacatt ttggcagcag tcattaacga tcgaccagtt 10500 

caattgtttg cggtcagttt gtggcagttt cggccgggcg tggtggctca caccggtaat 10560 

cccagcactt tgggaggttg aggtgagtgg atcacctgag gtcaggagtt cgagaccagc 10620 

ctggccaaca tggtgaaacc ccgtctctac taaaaataca aaaattagcc gggcatggtg 10680 

gcaggcgcct ataatcccag ctactcagga ggctgaggca agagaatcac ttgaacccgg 10740 

gaggcggagg ttgcagtgag ccaagatcgt gccattgcac tccagcctgg gggacaagag 10800 

cgagactgca tctcaaaaaa gaaaaaaaaa caatcgacca gccttaggag gtgcagatga 10860 

tgctcagaca ctggtctcct tccaagggtt aggcacagag agcgcttcca gtgtctacac 10920 

caccttgttc aggtctgcct gtccctgaga acagtccctg ctgagcgggc cagcctccac 10980 

ctcccacaac cccaccttcc tcattgtgca ggccctgcag cccagtcact gcgctctctt 11040 

gggctggagc ctggtcacat ggcaccaagg gagggctagg ccaggccctg gccccgctcg 11100 

gtgggtccct ggcacccagg gctgaggggt tcacccacca gacacttgct gggacactct 11160 

gctgccacat gactggcaat ggcacttgga aagcactaag agaaaagaaa aaggaaagat 11220 

tttagaggaa atagtcgaag aagtcgaatt ccaaacatgt gccctcccac cagtgacccc 11280 

tccatgcagg ctgcctggct ggcttgctgt ggggggcagg ggggatggtg cagtgcccaa 11340 

aggcgcagct ggggcaggga cgggtccgca gggaagcagg ggtagcctct cagaggtttt 11400 
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gtgtgctctc 

agcagtggag 

gctctctcca 

cagggcacac 

cgatgtggtc 

tttaataatt 

cccaggctgg 

agcctcctga 

tttagtagag 

tgatccgccc 

gccctggtca 

gccattgagc 

tcacctccgt 

gtgtgggaca 

ccccagggtg 

tctgtaggtt 

tcccacttat 

tgctcttcat 

gctggggagg 

ggctccgagc 

tgcaaaggcc 

tcaatacagg 

cagtgagggc 

catgggaggg 

gggccaggaa 

ggtccaggcc 

tgcagggcct 

ctcaccacgc 

gttgggagtg 

ctctccggga 

gaggggcctc 

ttcattctca 

ccctgcgttt 

cagacccgct 

accccttccg 

tgagcgcctc 

tactcgcccc 

ggcgtgggcc 

cttctccgca 

tccacctggc 

actgctgccc 

ataagagagc 

cgtcaacacc 

gtccggaggc 

gtggaggggt 

tgggggccga 

ggcaaaaccc 

tgtaatccca 

gaggttgtag 

ccatctcaaa 

gcatgtgtca 

agaaatatgc 

attatataga 

taaaaataat 

aaaagcagtt 

ggccagcgag 

cccagccacg 

tactcacggc 

catccacgtt 

cgacctggag 

cagaatgcac 



tgttcctcaa 
ggggcatctg 
gctctcccag 
tcttgcacac 
catgcccaga 
tttctttctt 
agtacggtgg 
gtagctggga 
acggggtttc 
gcctcggcct 
tgttatttct 
tcgtagtctc 
tacctggtcg 
ggatttcctc 
tggactgccc 
cctctgttgg 
tgtgaggagc 
cctcttcatc 
gggctgcatc 
aggggagcct 
ccaagccagc 
ggtctctcct 
acaggaaggg 
cgggagtttc 
gctcagccat 
tgcagcccag 
gggcccctct 
atgcctcagc 
tgggtggatt 
agatggagag 
tgtgatgagg 
actttttgga 
cagaccccag 
tgggtgggca 

ggggtgtggc 

agcagcccac 
cacagggagc 
ccttctcggc 
gtccacccgg 
catactgctc 
aggaggagcc 
aggggcaaaa 
ggtgtgcagg 
aggtgggtgg 
tgctctagac 
ggcaggcaga 
ggtctctact 
gctacttggg 
tgaggtgata 
aaaataaata 
ggtgctgtgt 
tgcattgtat 
ataataaaca 
tggtattttt 
aatactttga 
gaccctctgt 
caggatctga 
cactaggatc 
cagggcaatg 
gggacaggac 
tgtgcagaca 



ccattggttt 

aggccccaaa 

atagcctgat 

acccaaagca 

ttcttccagt 

cttctccttt 

cgcgatcttg 

ttataggcac 

gccttgttgg 

cccaaagtgc 

agagctgttt 

cttgtgggtc 

cttctggggg 

atgatgagcc 

cttcctgatg 

atgggcacct 

ctgccttagt 

tccccgaagg 

agccaggggc 

gggcagcagg 

tgctgaggtg 

gggcttggtg 

atggtggctt 

cccgttcacc 

aggggctggg 

cccccaactc 

gccgcaccca 

gctctcctct 

taggggcgcc 

agcagagcca 

gctgactgca 

ggtatgtggt 

ggtcagttgt 

cccatgtgag 

tctatcccag 

taggtcgggc 

tgaaacacag 

agtccacccg 

ccataccatt 

ccaggcaggc 

tgtagctgat 

tctaaaagtc 

ccactacgcg 

tggctctggg 

tcactgttct 

tcacttgagg 

aaaaatacaa 

aggctgaggt 

ttgagccact 

aataaaataa 

gacacatctg 

gaaacacata 

tctacagcct 

ttcaggaaaa 

ctcatttaat 

ggtgactctc 

gcagtgtggc 

caaaagaacg 

atccccccga 

agcatcggct 

tggaccccca 



aaaattttaa 

atcaccagag 

gcctgaacac 

ggaccacgct 

tgtcctcatc 

ttttttttaa 

actcactgca 

ccaccactac 

ccaggctggt 

tgggattaca 

gtttgtttgc 

cctgttaatc 

gtgcagccca 

ttggggagcc 

acgctgactt 

cttccttttt 

tccgcacaat 

tgtccaggtg 

tgtggtgaga 

gagaagactg 

ggtgtggccc 

cttctgaaag 

cagggtggcc 

cagcatggcc 

gcagctcaat 

cagccccgag 

ggccagactc 

cctcctgggg 

cagggggcag 

cccccgaggc 

gggacaggga 

ctctgtcctc 

acacacctgc 

ttctctgtct 

ctcggtcctg 

actggtgccc 

ctgctgccaa 

gccacaccgt 

cccgggcagg 

atgggcctcg 

gctgggtgct 

cacacagcat 

gcaaggtgct 

tgggctacag 

gagggaacac 

tcaggagttc 

aaaaattagc 

gggcaaatta 

gcactccagc 

ataaaacaat 

ctgcagtaac 

gtgtaatata 

gacattcata 

ataatataaa 

ccttaactcc 

tgtctgaatg 

ccagagcggc 

tcactgagag 

ggacggcaga 

cataaggaag 

ggagcagggt 



gcctgtagaa 
tccttgacat 
ctgagagtgc 
cgagagactc 
atgttctttc 
gatggagtct 
acctccacct 
accaggctaa 
ctcgaactcc 
ggcatgagcc 
tcctgatccg 

ggggagtccc 

tcagctgcag 
tcggaggaga 
gactcactgg 
ataaccaaga 
gtaggcagag 
ccagcacccc 
gggtgggcag 
gggagctgag 
tggccaggga 
ggccttctag 
tattcagggc 
actccggccc 
caagggccct 
cacctgcctc 
aggcaggtgg 
tctcctgccc 
cacctgatgg 
aggaggggtc 
agggggtagg 
cctcagcttc 
aggcgtcatt 
cagtagctgg 
cctgggtctc 
tgggggcatt 
tgggatgtgt 
tcccgggcag 
cgtgggcccc 
gcagcttcag 
caccattcct 
gagacaaacc 
gagaacctcg 
gctgtgaggg 
acctgtaatc 
aagaccagcc 
tgggcatggt 
cttgaacccg 
ctgggtggca 
gcccatgtat 
atttaatgta 
aatgcattat 
tgttaaaaag 
ataatcacac 
ccagttcttc 
caccaagact 
ggcgggcggg 
gtgtgggcct 
gatgcctgcg 
tggagacacc 
ccagtgggct 



aagctgcaag 

tcatatctgt 

tggctgagcc 

cacgtggggc 

tttttaattt 

tgctctgtca 

cccaggttca 

tgtttgtatt 

tgacctcagg 

accgcacctg 

gattcggtct 

tcagcctctg 

tcgggcctcc 

ctctgcagtg 

gttaggggga 

aatgcccttt 

ggaggatgca 

agcccggcct 

gacagagatg 

gggctgccag 

ggctgaggtg 

ctgtaagcag 

cccagtgagc 

ctgccctaga 

gccctggaag 

cagcccatgg 

agctgggtcc 

tgaagagagg 

gcagggggtg 

tgcgggatgg 

gcaagtgcct 

gggatgtaat 

ctctctgtca 

gcctggtcct 

tttccctctc 

gctctgtgcc 

ccacctggca 

gcgtgggccc 

ttctcggcag 

gaagagcccc 

gctcatgtac 

aggggctggc 

tggggtctgc 

gatctggggt 

ccagcacttt 

tggccaacat 

ggtccacgcc 

gggaggggca 

gagtgagact 

tgagcatgtg 

aagtaataac 

atattagaaa 

attagaataa 

aaactaatat 

tgaagttggg 

gagctcgggg 

ccagaggggt 

gaaactgagt 

attacctcga 

cccagattcc 

ggccccagat 



11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 
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ctaaccgtgg 

gccctcagct 

ccactgctgc 

aaccaagacc 

cctggacccc 

gacacaccca 

ccctttccgt 

gcataatctg 

gtctctgagc 

aaacaaagct 

agaaccctgc 

aaccagcacc 

ccgcaaccag 

gggctgggga 

aaacacagag 

ggggccggaa 

cacaaataaa 

tgggaggccg 

gtaaaccccg 

aatcccagct 

gcagtgaacc 

aaaaaaaagg 

tgctaaataa 

gaggctgtcc 

acaaggccaa 

attagaatat 

gggaaaagga 

gcaaaatcag 

aataactctc 

aagaacaaaa 

agttgaaagt 

ctttggaagg 

acatgccgaa 

cctgtagtcc 

aggtggcggt 

cgtctcaaaa 

aacaaactcc 

ttaccaatga 

gtgtagacag 

gtcacccccg 

atttcaggac 

agccctacga 

cgctggcatc 

cgccttggag 

catatgaaaa 

gagtcaggct 

gccgccaacc 

gagaggggtc 

gagacccacc 

agctgcaccc 

cgtggccgcc 

aggggcacac 

accggaccac 

tccagggcca 

gggtctttta 

tggatttggc 

aagccttcac 

ggttttttgg 

ttatatgtgt 

caattttttt 

tggcgcaatc 



aggagcccac 

gcggcttttc 

ttttgtccac 

ctcagactcc 

atcgaaggga 

cccgtgccat 

ggcaatgatc 

ccccttaatt 

tgctgctcga 

gctgtctgtc 

tggcctaagc 

gccagactca 

caccgccaga 

agaaacacac 

ctgtggccag 

gggacctggg 

gctgaattcc 

aggcagacgg 

tctctactaa 

actcgggagg 

aagatcacac 

aaaaaaaaaa 

ggggaagtga 

cattgttgga 

ggcggaggga 

gaagggcaac 

aggacggagc 

ggaaaatgga 

agaggaaatg 

tacagttacg 

aagaaggaga 

cctgggtggg 

accctgcctc 

cagctcctca 

gage t gaga t 

aaaaaaaaaa 

ccctaaaaat 

taccaataac 

ccaactcttc 

acagagtatg 

ccgagcagga 

agaaacagaa 

cggccgccgt 

gcgcccgtgg 

ttagcagatg 

tagggggagg 

cctcacccag 

tcatgggaat 

tcgctcaccc 

accttctttt 

atgagcagct 

acgcttcagc 

ctcacgggca 

gctgtgaggc 

gtttttaaaa 

cttgggctct 

ggaatgttta 

agggtaatga 

gttaatattc 

tttttttttg 

tcggctcacc 



ctggatggac 

acgtccctcc 

agaaactgag 

cttccctgtc 

ctegggaaaa 

gtgggtttgc 

caaccacctg 

agcatatact 

ggcacctgcc 

acctccagct 

cctggttgtg 

gggaggggee 

ctcagggagg 

gcccgaaaac 

caggaaagee 

cagccaggcc 

cagccaggca 

atcgtgaggt 

aaataacaaa 

ctgaggcggg 

cactgcactc 

aagaagcaga 

aggcactggg 

gtaaaaaggc 

agggggacag 

ccccaaaaga 

ctctgccaat 

aaactcaaaa 

ggctacattc 

tgetgettat 

aagaggccag 

tggatcactt 

tactaaaaat 

ggaggctgag 

agcaccactg 

aattcctegg 

tacaggacac 

agactagagt 

gccaactcgg 

agagaaaaga 

aatggcactt 

ctgtcacccc 

gggcagtggg 

atctttgggc 

tggtgtctcc 

ggagggtaca 

ctccagagcc 

gccctggagc 

tggggtgatg 

tettgeagtc 

ccagcaggag 

accgggcccg 

ggctgcgcag 

actgtgggct 

ccattcttag 

ggtctaccga 

gagtagcttc 

aacgactgea 

atagaagcat 

agaeggaggt 

gcaacctctg 



ccccagcctc 

tggagtaaaa 

ggtattgacc 

tcacagtgtg 

caggcctggg 

cattgccacg 

gaegttagea 

taaaagtggg 

ggttggagag 

cgcccttgaa 

gactcatctg 

gcaaccagca 

ggccgcaacc 

ctgctcccgt 

aggcataagg 

ttgcgtccag 

cggcggctca 

caggagtttg 

aattagctgg 

agaatcgett 

tagectggge 

aeggaaatgg 

aaacacacac 

aaaatgtcac 

accccgcatg 

atactgaaaa 

tcaaaagaag 

gatggtaaaa 

aaaaagctag 

aacaattata 

gtgcggtggc 

gaggtcagga 

aaaataatta 

gcaggagaat 

caccccagcc 

ggacaacagt 

cagcagcacc 

ctccacagat 

aatcgaaacg 

gaaagaaggg 

cgtccagaat 

gggtgcacgg 

tcaggtcagc 

ttgaggaaat 

tggtgatccc 

gcatctgtct 

cctcctggcc 

agcccagatc 

cctctgcacc 

ctcctcgctg 

cagcaggatg 

cctttctcac 

gactctcagc 

gcacactctc 

tgeaaagget 

ccccattctg 

atgeatgatt 

ggccctgacc 

acattttaaa 

tegctcttgt 

cctcctgggt 



acctggccca 
tagecaaaga 
tgacagtggt 
geaggeggeg 
gcagaagtag 
gcaacactgg 
ccccttttct 
tataaatagg 
ccctgctata 
ttctttcctg 
cccggcacca 
ccgccagact 
agcaccgcca 
gaaggatctc 
agtgggccag 
aaggcacaaa 
tgcctgtaat 
agtacagect 
gcgtggtggc 
caactcagga 
aacagcagga 
aacctaagag 
agacacagtc 
taaatgtcac 
gtctgtttat 
atttcccatc 
gcagggaagg 
ataagtagaa 
ggagagagaa 
ccaaagtaca 
tcacacctgt 
gttcgagacc 
geegggegtg 
tacttgaacc 
tggctgatgg 
gtcaactcac 
tttcgegata 
gaggegcacg 
tgacgtttaa 
aagttttctt 
gttggcgctg 
gcacagggac 
ctgccgtgta 
gttcacacac 
ctggggacga 
tcacctccct 

ggggtttctg 

ggccctccga 
ccctgccctg 
gaccgtgcag 
aggttgaagt 
gccaccgcgc 
cacggtcccc 
ctgcctcaca 
gtacaaacac 
aacgctcctt 
gtcaagcact 
gtattggcag 
aagtcagtgt 
tgcccaggct 
tcaaatgatt 



gaccccctct 

ggtgtttccc 

gagggcagaa 

gtgetcagae 

cttcccatag 

acgtcaccac 

agaaatttct 

aggcagagct 

ctgccactgc 

gaeaaageca 

tcagggccgc 

cagggagggg 

gactcaggga 

tgtgagtcgc 

caagggcttg 

actccagatg 

cctagcactt 

ggccaacatg 

gggcacctgt 

ggcagaggtt 

ctccatctca 

aggactgaaa 

aagcacacag 

aagctagaca 

caaggagegt 

caaaattaga 

agggaaaaag 

atttatcagt 

tgacaagata 

aggcaagaaa 

aatcccagca 

agcctggcca 

gtggcatgtg 

tgggaggcag 

agtgagactc 

etaeggcagt 

aggaaggtta 

ccgccatgcg 

tccagcctca 

actgagtaag 

tcagacatgg 

tcagcactgc 

gatgegagae 

attgtccttg 

gcactggact 

gtagccagcg 

agacccagga 

gggtgacgct 

eggggtcaga 

cgatgatcac 

tgatctgeca 

ttcagcgtcc 

acttttccca 

tgggcctttg 

aggecaggge 

gttcccacaa 

gagaacataa 

tgacatgaat 

agctgtctga 

ggagagcaaa 

gtcctgcctc 



15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
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agcctcccga 
tttagtagag 
gttccgccca 
cctgacaatt 
acagagcctc 
agaggtgggg 
cggctcacgc 
aggagttcca 
atcagctggg 
gaattgcttg 
agtctgggcg 
aaaagcataa 
tctgtatttc 
cagggccagc 
tgacccaggg 
cgcggtggct 
aggtcaggag 
caaaaattag 
caggagaatt 
actccagcct 
aggcagtgtg 
gagtggccag 
atgtcacaca 
gggatttttg 
aggctggagt 
tttttgttta 
tgtcactgtt 
gcagaatcac 
tttagtcaaa 
ttgctgacac 
tgaactcacg 
cgtagtggtc 
aaggcaggag 
acttcaggcc 
gctcagcccg 

ggggggcaac 

agcacacatg 
gcaagctaga 
ggggatgcac 
actctgaaat 
tgttgccagt 
agttttaatt 
ggctcaaatg 
tgcgcccaac 
atgtgctaaa 
gaatgtaact 
aagtatcagc 
gggaggatct 
gagtgagacc 
gaaagaaaga 
tgaggcgggt 
atctctacta 
gctactcagg 
gcagagacgg 
aaagaaagaa 
ctgcatgtga 
ccacagggcc 
acagggaggt 
gctcctgcag 
cacagcagtg 
gctagggcca 



gtaagctggg 

aggggtttct 

cctcggcctc 

tttaaaatac 

gaaatatatg 

aaggacttga 

ctgtgatccc 

gaccagcctg 

tacggtggcg 

aacccgagag 

acagagcgag 

ggagaactct 

tggtgctttg 

cgacagctac 

accacacccc 

catacctgta 

tacaaggcca 

ccgagtgtgg 

gcttgaaccc 

gggcgacaga 

ggtgttccca 

gacatactca 

aagcaggagg 

tctttttttt 

gcagtggcgc 

gaggtgcatt 

aatttttaag 

gagacttcaa 

ctcacgagct 

cattcgtggg 

tttattgctg 

acagcaaacg 

gggttttcct 

attcactaaa 

ctctcccctc 

ctcgggggcc 

ggcacactgt 

ctcaccacat 

tggaatgaaa 

actctaacat 

ccatgcaaca 

ttttcttgag 

atctgttcac 

ccccaggtac 

aagatatttg 

cacgcaggca 

cgggtgcagt 

cttgagccca 

ttgtctcaaa 

ggaggtctgg 

ggatcagagt 

aaaatacaaa 

aggctgaggc 

cgccattgca 

aaaaagaaag 

gggaggcaga 

acaccaaggc 

ggatggagtc 

ccatgggacc 

ctgggtgagg 

ggacagccag 



gttacaggca 

ccatgttggt 

ccaaagtgct 

atttaaaaaa 

atgtcataac 

aacacttttc 

agcactttgg 

accaacatgg 

cacgcctgta 

.gcggaggttg 

actgtctcaa 

aaaattcaga 

acctctggct 

agtgagtaac 

atctgctctc 

atcccagcac 

gcctggccaa 

tggcacacgc 

gggaggtaga 

gc gaga t tec 

ctgggtcctc 

agcctcctcc 

ccctccttgc 

tttttttttt 

catctcggct 

gaacaactgt 

gactgttgga 

taaagacaat 

gtgaataaca 

agtggggctg 

atgggttcag 

tggaaacaaa 

tctttgaata 

aattatcctg 

tgtgcggcaa 

cctctcggtg 

ctgtcagccc 

tggcgttcct 

ccggaatccc 

ttgcgcttcc 

cagtgggttg 

atggggtctc 

ctcagcttcc 

ttcaaaataa 

ttaatgaagg 

caagacctgg 

ggctcacacc 

ggagttcgag 

aaaagaaaga 

tgcagtggct 

caggagtttg 

aaaaattagc 

aggagaattg 

ctccagcctg 

aaagaaagag 

aaggagtccc 

taggagtcca 

gggccagccc 

ccgggccctg 

aggcagaaga 

gctcagctca 



cctaccagca 

caggctggtc 

gagattacag 

gaaagagtgc 

agagecacag 

ttagaaactg 

gggaccgagg 

tgaaacccct 

atcccagcta 

ccgtgagccg 

aaaaaaaaaa 

aaagtgtgac 

ccttcctgat 

ttgcctctga 

cgtcttaata 

tatgggaggc 

catggtgaaa 

ctgtaatccc 

ggttgcagtg 

ttctcaaaaa 

ctgtgtctgc 

agctttcccc 

agtatctegg 

tttttgagac 

cccacaacct 

cttacacaga 

aaagacattg 

tattttcttt 

tttaccacct 

tgggtgtcag 

gactagtttg 

caatatctga 

ataaaagaaa 

attaggacat 

aggctgggct 

ccacaacgtc 

ctccggcttt 

cctggagacg 

atgagcctgc 

agaaggatct 

gcttgactag 

actctgttgc 

caaagccctg 

tttattctga 

ctcttctagc 

tetttgetga 

tctagtccca 

gctgcagtga 

aagaaagaaa 

cacacctgta 

agaccagcct 

tgggcgtggt 

cttgaaccca 

tgtgacagag 

gaaggaagga 

acagccttct 

gggccaggga 

aggcctgtgg 

agtcgggggt 

agccccgtcg 

gcacccactc 



cgcccggcta 

tcgaactcct 

gcgtgagcca 

agaaccacat 

gaagatgtag 

acagatcagg 

cggttggatc 

tgtctactaa 

ctegggagge 

agatggagee 

aaatgacaga 

attaaccttg 

cctgaagaga 

gcagttcaga 

aagaagcgtc 

cgaggtggat 

ccttgtctgt 

agctactcca 

agecgagatg 

caaaacaaaa 

tcatttttcc 

aatcatcaag 

atgaggtcct 

ggagtcttgc 

ctgcccccta 

ttggatgaaa 

tttaccaata 

ggatgcataa 

atttccagct 

gcgtctgcgc 

catccaaacc 

aagttgggaa 

aaaggtaaca 

aatggtgggg 

gggcaggagc 

tgtgcgtggg 

ctccctggca 

gtgaagctca 

cgctcacctg 

gaatcatatt 

aaggccccca 

ccaagctgat 

cgattacagg 

aagaagtttt 

tggataagct 

aaattaccca 

actactccgg 

gctgtgaata 

gaaagaaaga 

atcccagtac 

ggecaacata 

ggcgggtgcc 

ggaggtggag 

caagactccg 

gcaaaggaag 

tgtcctcagc 

gggaggttgg 

cccttgccgt 

gcaggccctg 

gggectgage 

aggtgccacg 



attttgtatt 

gacctcaggt 

ccgcgcccgg 

tcaactaaaa 

atatatcagt 

cagagggagg 

acctgaggtc 

aaatacaaaa 

tgagacagga 

actgcactcc 

tcaaaaagac 

tcttttattt 

cagctcctcc 

tacaagccac 

acaggceggg 

ggatcacctg 

actaaaaata 

gaggctgagg 

gcgccactac 

caaaacaaaa 

tgcccaacga 

tgcattccac 

ggtttccttt 

tctgtcttcc 

caatttcaca 

catagtatta 

aatctgtccc 

atctctcaac 

caacaaacat 

gaagctcgtg 

aaattaaaca 

tctgaaaaac 

gataaaccac 

agtcctgcgt 

tttactgtct 

cacacacaca 

gaggctgect 

caattgeega 

agcaactgag 

cacaaaacag 

ggtacttctc 

cttaactttg 

catgagccgc 

tatttttctg 

ttgaagecta 

aaaaaggaaa 

aggctgagct 

cteggtgata 

aagaaagaaa 

tttgggaggc 

gtgaaacccc 

tgtagtccca 

gttgcagtga 

tcaaagaaag 

tagcaagact 

cttcttgtcc 

gaegtcaggg 

geagggcett 

aggggctgag 

gaggggcaga 

agggeccagg 
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gtagagaaga 

ttgcccatat 

agggcggctc 

tgggacgggt 

cggggcgggg 

caacagtgtc 

ggaggggaga 

cagaggggca 

gccagagggg 

gctctagctg 

gcggttgtct 

gtgtaagttt 

cccctgaggg 

gcaattccct 

aaaggtgatg 

ttgttgttgt 

atctgggctc 

tgagtagctg 

gagatggggt 

cccacctcca 

tgaagaggtt 

agagcacgtg 

ctgcagcaca 

accccgaggt 

gccccgtccc 

gactcaccca 

tcctcctgta 

ccatgtcgga 

caatatttag 

tgcgtgcagt 

catccccctc 

ggctccaggc 

ctctctcacg 

acccccgggg 

cagaggaagc 

aagtccaggg 

ccgcctctgt 

ggaggtgggc 

gaggggagaa 

cctgcataga 

tggctcctcc 

aatccctcgg 

ccctcccagc 

tgagggccga 

gaccctgggc 

gactgtccct 

agacacccat 

tccaagagtt 

ctccagggcg 

agtgtggagc 

gggggcagtc 

gggtcttgcc 
ggctcctggg 
accagttctt 
taagttgttt 
aaatttgaag 
aaaaacggac 
atccatctgg 
acacatcacg 
ttgcaaaaca 
tgccgttggg 



aaggccgggg 

ccggcagccg 

ccatggtgag 

cacctgcccc 

catccacttt 

cctagagaaa 

gctcgggacc 

tggggtccgc 

catggggtcc 

ccgggacgct 

gtcctccctt 

ctaaaaacac 

cagatgctgc 

gcacaggcgt 

aactctacgt 

tttgagacga 

actgcaacct 

ggagtacagg 

ttgtccatgt 

cccctcaaag 

tgaatgaggt 

ccggcgagct 

gcgcaagtaa 

caccaggagg 

caggctggac 

cgtcccccag 

ggacagacgc 

gcatggaccc 

agcaaggaat 

cttccagatc 

cgtctgcaga 

ccaggggctc 

cacagcactg 

tgagtctgag 

gaggtctggg 

caccaggtgg 

ccccagctcc 

aggaccactg 

atcacagcct 

ggcctgtcct 

aatcatctct 

acatctggcg 

cggcccctga 

cctgaaggcc 

ccccagagcc 

accctggtgg 

aaattaataa 

ctgcagcagc 

ctgcagtccg 

tactcacccc 

tcttcgacag 

ggccccgctc 

tcatggcact 

tatctggaat 

ccaagaaata 

aagaaaatga 

ctgaatagtc 

gccctccagg 

ccggggactc 

tgcctctact 

agcactggtc 



gcagggccga 

ctgaggctga 

gataaaggcc 

aaagggcttt 

gtttttaaga 

gggcaggggc 

agctgaagca 

ggcttgcggg 

atggcttgcg 

ggggaggcag 

gctctgagga 

aacatgaagc 

cagctctggc 

ctgcagagca 

gaaactaaag 

agtttcactc 

ccgcctccca 

cacccaccac 

tggtcagact 

tgctggcatg 

acaggtgaca 

tgggcactgg 

tccatctcag 

cagctctgca 

cctgaccctt 

catcaaccac 

agccccgccg 

cccacgctgt 

ggcacctctc 

ccacacccag 

acctaatttc 

ctgagcaggg 

caggggtgcg 

gggagtcagt 

gcagagggga 

aggagggcag 

gaggcagacg 

tgggtgctgc 

ggatctcatt 

gcagcatggc 

gtgcccacga 

ctgcccgtcc 

ggccctccct 

tgagacgcct 

gagggtcatg 

gactaagcca 

aaataattgt 

agggttagtc 

tcaccagagg 

catcagcaca 

ctgcaggtcg 

cagcgtgggc 

tggggacacc 

aaaattatca 

ttttttaaca 

gcacttgaat 

tgggttcagg 

tgcaacacct 

aagcacgtta 

caacggctta 

tctgggtgag 



gtggggccaa 

aggggcacag 

acggagaagc 

gttctaaggg 

catttcgggg 

cagagaccca 

ggaggggcgt 

ccagaggggc 

ggccagagcc 

tggccggggt 

tcagtgggtg 

tttgggagga 

aggacgcagc 

aggttcaggc 

tcatatgaag 

tcattgccca 

ggttcaagcg 

cgcgcccagc 

ggtctcaaac 

acagccatga 

gaaacctgca 

cagaggcgtg 

ccgggaacac 

agacagcgac 

caacttctgc 

acccgcacct 

cagcccagga 

tatctggagg 

cccacaaaat 

gtccagacac 

cagatctgga 

aagaccctag 

gtggggaggg 

atcaaaggtc 

tctggcctgg 

agcccggaga 

cctgtcttgg 

aggggctgcc 

ctcagcccaa 

tctgagcagc 

ctgtgcaggg 

accccagacc 

aggaggagga 

cctcccacct 

gagactcctg 

caacctggag 

gaaaacacaa 

tgagagttgc 

gaccagatgc 

gagaagaccc 

ctcggcttcg 

atccggtcat 

acagctgtgc 

tcggctttgg 

aaagaaatat 

ctaaaagtga 

gtgagattct 

gacaacatcc 

aactgctcca 

atgtctgagc 

ggacttccag 



ggctgggcgg 
ctcacagctc 
ttccagagag 
agcagagacc 
gcgctgctgg 
gcaggcatgt 

ggggtccgcg 

atggggtgcg 

caggggtcag 

tcacagagga 

cctccaccgg 

tgtggggcac 

cgcaggctca 

ctgcttcatc 

agttttttgt 

ggctggagtg 

attctcctgc 

taatttttgt 

acctgacctc 

gccactgtgc 

catctcagaa 

gaggaagctg 

gttccccagg 

agaatcccag 

ttcactgtct 

gggacccgcc 

catggaccag 

agagaagaaa 

actcctgccc 

gagactggaa 

agaagatccc 

caggaggcag 

ggctctctct 

actgggaccc 

gcgggggagg 

tggccaccac 

tggggcatgc 

ggcggaggcc 

gctacacgga 

tgcacacact 

aatggccagg 

tgttcctttg 

ggtccctgag 

ttgcaggtga 

gggcccattc 

cagctcaagg 

gcaaaacaag 

tctctctgtc 

tcgtggccct 

acgtcttgtg 

cgtctggggc 

aggccagctc 

tgaatgtaca 

ccactctgag 

tgttcatact 

aaaagaaata 

acactttaaa 

atcggcaact 

attggtagct 

acttacctcc 

caagaagtat 



gggtgggcac 
cactgggaca 
ggcagcagag 
caggacagca 
ggacagtcac 
ggaagcgaga 
gcttgcgggc 
cggcttgcgg 
ccagcatggt 
ccaggggaca 
ccttggcact 
aggtcccagt 
gaatcaacac 
agtaattctc 
ttttttcttg 
cagtggcaca 
ctcagcctcc 
atttttagta 
aggtgatcca 
ccggcccata 
caaagaaacc 
cttacagagc 
tacagcgaaa 
gttacaacgc 
gtgtattggt 
catcctggac 
cctcccccag 

ggggatctgc 

caacatgttg 
aacaactgcc 
tccaggcagg 
ggagggtgct 
cacgcacagc 
agggtcctgg 
ggctgcgacc 
cacccgggag 
tcaggccacg 
agggcacagg 
accgggggtc 
caggcccaag 
accccaggga 
ttccccagct 
gttctctctg 
cccctcagag 
atggccctgg 
ctggcacaac 
ctcgtcggca 
ccacctgggg 
ccaagggctt 
gctgaagcag 
atagctggcg 
ctctctgaat 
gccacgtgtc 
gaaacttcaa 
gaagtgctct 
actttcctag 
tgcctcattg 
tcgtccatga 
aaacgagaga 
cccgctgctc 
tcacaaatga 
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aacaaaagct cagcaaactt gagctgaagg caggggaagg agagctgctt cctgaatatc 
aatgagggga ggaatcgggt ggatcgtaga aatgtttcgt gttggttgtg taaaccactg 
cctcgg 

<210> 602 
<211> 7029 
<212> DNA 

<213> Homo sapiens 



26100 
26160 
26166 



<400> 602 

ccaccaccgg 

gctcctccac 

tcgtcatttc 

gcagcgtcaa 

ggggctgggc 

gtggcacctg 
ggctcccaga 
cagacaagat 
gtcaccggat 
gagaatgccc 
agaggaaagg 
ttgactttaa 
gagttctaag 
atgaaatgaa 
cctatgtgac 
ggagagttcc 
atcaagcacc 
gagacggggg 
agctctccag 
aggagccaca 
ttcctggaac 
accttagtta 
aaatgtacaa 
gtgtagttct 
agggaccaca 
gaactcagca 
agccaggcgc 
aaggcccacg 
agaggccact 
atggcagcgc 
gtagtgaacg 
agccatgtga 
aggctgtggc 
tggacgacac 
cggggaggct 
caccacaacc 
ttccagaaca 
caaacaacaa 
ccacaccaca 
ccacacatgt 
agagccatga 
tttatacatt 
tgggcagatg 
tatgcagctg 
cagtcatgtc 
tgagtccatt 
caggcgattg 
aagccccgca 
atgtttatag 
aagtatatct 
ttcaagaccc 



ccacctgagt 
cagctccacc 
cggactctct 
catcgtgatg 
agggctcacc 
ccttgcgttc 
gtgacaaagg 
acagagcaga 
acagaggttc 
tccacccatt 
acacctgtca 
agctgctgag 
gacttcaccc 
tacttgagct 
ctggtagaca 
cccactcctt 
agagttgcag 
gctgcacctg 
ggaagtacct 
ggattgcagc 
ccccaggctg 
gataactaaa 
agtatcaccc 
gggattctgt 
tgcccccaga 
ttcatgacag 
acaaatgccc 
tgcagagccc 
gtgctggccc 
tagcctttag 
tttgctgagc 
gccccctgat 
ggacactctc 
ggcctcgcag 
ggctccccca 
aaccaatacc 
cagaactgat 
caacaaaaaa 
cctccctggc 
tccagaatct 
aacaaatgcc 
actgttacca 
cattaaggtt 
ttaagcgtaa 
tccagtattc 
cggtgcatgg 
gtgggccgcc 
acccttactg 
ttattgctgc 
tttatggtga 
caggaacctg 



gtccgtgagc 
ctgagcttcc 
gctgcggagg 
ggcccctcag 
aggacaggct 
ccacccttct 
acagtgatta 
tatcctgtaa 
tgcagaactg 
cacacagcag 
atgctggagt 
atggcccact 
aaataagctg 
tcttcagtga 
gggaaaatcg 
ttgagtgttc 
tggcttggcc 
aacatttcta 
tcctgccagg 
catgggtgcc 
tcctggccaa 
agaatacatt 
ttctccatgc 
caaattctcc 
atacaatggc 
gggctaagtg 
cttcccccca 
ctgtgtctca 
tctatcattt 
ctcccacagc 
tgttcactgt 
gccaccgagc 
ggggttgtct 
caaagtgtgt 
cacctcccac 
gcgatccatg 
ctcaggtttt 
acctcacatg 
caggcacagc 
gaaatctcaa 
ctgatgaaaa 
tgaaaaagaa 
aaatctaagt 
catggtggca 
tgtgaaaagc 
aaatccaccg 
agcaccctgc 
tgcacacaac 
gtccagtgtt 
gaacgacatt 
tatccttgct 



gggccgaggc 

tcttcggcaa 

ggggcaatac 

ccagggctgc 

gggatggtgg 

gcctgcagaa 

gacacgaagt 

acgataatgc 

tggccatctg 

gactcttctc 

tagagcttca 

gcttttaggt 

ttacttgtcc 

gaaaaaagta 

atggtgtcaa 

aggttttcct 

tctggttctg 

gtgtcaccct 

ggaagccaag 

acctttcatg 

gaggaaagag 

tcccctccct 

cccaatctgt 

ttcctgctct 

ctctgtgttc 

tgcatgaagt 

gggccgctct 

gatgctgctt 

gacctgactt 

acggatgggg 

ttctctcttc 

aggggcagct 

gcaggagagc 

cagacgagca 

ctgcattgct 

agggactcct 

tgaaaaaaaa 

atggggaaac 

gctgcccaga 

aagccattca 

taagacaact 

atgaacagca 

gtcaagaaaa 

gcttcacaac 

caggatacgt 

cggagtcaca 

gggcttcaca 

atcgcacaca 

ttgagttata 

tgggtaaatt 

ggggtagagg 





+* S ft i~Tft 

uccagcc tyy 


ou 


gayyagc t uu 


Lccagcgcyc 


ion 


cay cyacacc 


CaytCaLCCa 


ion 
loU 


cagccaggcc 


acxcgggtaa 


Z4U 


cacgggcxcc 


cggcctgagc 




ccccaucccc 


ccctccacgg 


"3 a 


ggcctagccg 


ccctcgaaag 


4ZU 


ccaggcaggc 


accgaaagga 


/ton 

4oU 


ccccacaccg 


gggcatgacg 




acacgactcx 


ggcctgaggg 


can 


ccgctxcxca 


gccaaccgac 


£T c n 
bbu 


4^ 4" ^™ ^ ^ ^ ^ 

at U taaaLaC 


uagacaagga 


f zu 


agaatcccaa 


accagctgag 


no a 

7o0 


aacaaacacc 


cagcagcgct 


q/i n 
o4tU 


ggc aaaaa eg 


ggtcaggtct 


q rt r\ 


taccatggct 


catgctttcc 


C\ c c\ 


ggtgaggtta 


tttgcaggtg 


1020 


ccccctcctt 


catgggaaac 




4# <M _TT_ 

getgggcegg 


ccgccctaca 


n 1 >i ft 

1140 


gaaggggaga 


L.uuacgggct 


1 O ft ft 


gtggttactt 


caggagtttg 


1 O C ft 

1260 


tttctttatt 


tcctcaataa 


1320 


gttaaagtca 


caatctatgg 


1380 


ccaaaatgga 


caattgtcgt 


1440 


tactggggtc 


aagectgeta 


1500 


gacactgact 


acagctagaa 


1560 


ttccagcgca 


gtcatccaga 


1 C O ft 


cagttgcccg 


tcctgtcctc 


T O ft 

1680 


tagaacctga 


cctcaaggat 


1740 


gtgatgccag 


ttagaagtgg 


1800 


tctttggaag 


cacctctccg 


1860 


tcatgaccga 


tgtctggctg 


1920 


aagecaggag 


gaeatgggee 


1980 


gtgacgggcg 


tgcggccggg 


2040 


ctccctcgtg 


ctccccaaat 


2100 


cctgtggaaa 


aggagagctg 


2160 


aaaaaaaaaa 


aaaaaaaaat 


2220 


aaaaaaaagc 


aacaaagaag 


2280 


gcggcacacc 


geggacagea 


2340 


ctaacttaaa 


acaactcaac 


2400 


gcatgttaca 


tcatcggcac 


2460 


geaegtctga 


taccatcagc 


2520 


ccctattfcct 


ttafcaatatt 


2580 


aagaaccgtg 


ateatgeatg 


2640 


taggatgtga 


gtttagaaac 


2700 


caatgctcct 


cactgtagtc 


2760 


gggacgcgtg 


cttgttctcc 


2820 


cctttgtttt 


tgaaattaaa 


2880 


ctgaagcttc 


tcatttctac 


2940 


ctgagacaac 


ttttcccatc 


3000 


tcccaaaatg 


cagtccatag 


3060 
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agtatcactg gcctggtgaa ccatcccttt tcaaaagaca atagttagaa ggaaatactc 3120 

aattttaaac agtcacacaa atttcttcaa agtatttagg atttggaaat ttgaggagag 3180 

ggttggggga agggtattgt agagaatctc ttgccccttg gaaagtaaat ggtaatgctt 3240 

ttctaatttt tttttttgta aactaagaat cagcaattaa taaaagcaag tgtactctat 3300 

aatcacagac agaaggttct tattttcagg tcagcactcc ctagaggtgc aaactttatt 3360 

taatttaaat gcagttgttt gtctaatggg aagctgatag ggccttgctg cagttctcta 3420 

aggtggccaa attaaacttg gctggcagct cctgtaatcc tcttgggcaa ctttacagcc 3480 

aagccccaag ggctaaatgg caggcctgga gctgtgagtg tggcctcccc acggagggtg 3540 

actccttgag ttcagtgagg aaaaccacgg tgcctggcag tggtcagcca ggagtatgac 3600 

tggccactcc ttaggcatcc agaagtagga aacacagaaa gcattccaca aaacggggaa 3660 

tctccccaaa gagctagaaa cccaggctgg aaggaaatgg cagcgcaggc aagagagcaa 3720 

gagcaccctg ggtgctcctc catcccgttt ctctggggag gttcatgagg tgcttcttca 3780 

acagttattc tgttcattgc ttatcacagt tcacgagttc tcgtggcatc ttttgtgggg 3840 

aggagggagg atgaaagtga cacagtgtat ctacaacact aaggtcacat taatcagata 3900 

ggaatgcagt gacgctctgc accgtctatg agagcgggca tctcgagtgt gcataccgag 3960 

gcacagatgt gcacacacat ccgcaggcaa ctgcacccac agagaaaagg tggcagtatt 4020 

ggagtcaccc caggattctt gggggctgtt tgaccagcca tcaacagtgc agccatactt 4080 

ctttttcttt tttatcgtgg tacacaactt ttatatttac aactcatgaa cttgttaatc 4140 

tggtccttca tctagtcagc tatgtatcag aaaggacttc ctgcaaaatg ttatattcta 4200 

agcccccatt tagaggaggc atccctaaac tgacatttct tgctagagtt gttcaaatat 4260 

tctgccagta tatagctaaa gagttatttt ccagttcaag agttatttgg gggctgggca 4320 

cggtggctca cgcctgtaat cccagcactt ttgtaggctg aggcaggcaa atcatgaggt 4380 

caggagttcg agaccagctt ggccaacatg gtgaaacccc gtctctacta aaaatacaaa 4440 

aaattagctg ggcgtggtgg cgggtgcctg taatcccagc tactcaggag gttgaggcag 4500 

gagaatcgct tgaacccagg aggcggaggt tacagtgagc cgagatagca ccactgcact 4560 

ccagctcaag cgacagagtg agactctgtc tcaaaaaaaa aaaaaaaaaa aagagttatt 4620 

tggacacaga tcaaaatggt ttctcctcta aagataatat taaaaccata gcacttggtt 4680 

ttatataaat catgtggcca atttggcatt attgattcta aattttgcag tgtgttgaaa 4740 

tcatctggca agtaatttgt taaaatcatg cgatgcttca acttctttgc ccacaaaatg 4800 

tgcaatgcaa tgcctgttta cttagcccac ccgtctccag cctcatctac catgctgttt 4860 

tgtcccaagt cactatgcta cacaagcatc cgccatcagc acacacacag gtgtgatcca 4920 

cagacacaac cttctatgaa tacggtaatt ggcaggaaaa gaagaggagc aggtctgttc 4980 

cacattttcc agcccaccag cgttcccacc cccttgataa ctggagggct gggaactgag 5040 

tgcctgaatt tgcagcttta ctaggatctc tccctgtcaa ggctgatcat cagagatgac 5100 

actatccatt taaaagactc acccatctcc ctgcatgtgc ttaccctcac tgagaggtaa 5160 

gaagaaacaa tttattaaac tattacgtgt taagcagctg' gcggtatgat ctgctaagga 5220 

ggaaaaggaa agaatgattt agtaaatcgt gcttcaaatc agtcctgatt cctcccatcc 5280 

gaccttgtta ttctagctga gattcttcat ataagcaata catgttaatt ctgtgctttt 5340 

caaattgaat tcctccctga gactcatatc caaccatctg ttagatattt tgctttgttt 5400 

gaataggagg cattccaaac tcaggtataa gagaatccct gacttcaatc tgccctgctg 5460 

cccccacccc accatccggc aagagctcct cacctcagcc gtggcaccga cagggaggtg 5520 

gcgctcagct ctgctcctct catcctggct gcgtgcaggg aggggacggt cacgtggagt 5580 

tctgccacgg cacctcttct tcctttaaat ctcagcatga cactgttgct acaggctgtg 5640 

tgcgtttggc cctgagttct agttatctgt gaattcatct tatccttcat taaatgtcat 5700 

tgccttacaa tacagggatt gcatctgaca tctccataca tgctccatga tcattctgtt 5760 

tgactaatac aacaatcatt tcaggacctc accacaagag acactaaacg ttctttggat 5820 

ttacagcagt ttatcactgc taaatgtcaa ctctgaggtc tgggggcaca aaacacacta 5880 

aaaatgctaa agcctcaaga aggacgcagc agggagtgat gaaagagtga actttggagc 5940 

cacgcagacg tgggttcaaa tctcctttgt caccgctatc actgtgctcc tgaacgaacc 6000 

acttaaacac tccaagcttg tctctttacc gttaaaggga aagtgtgcag aagctcccgc 6060 

agccccaagt gctaggggcc accctgaatc tgttccacac aagtgcatac tttcaagtag 6120 

gcagtgacac tcccacccca cataagcaca actggcttca gtgataagcc cctcagacta 6180 

gctatttaac tttgaaataa ccttcaccag ctttctttct catttaagac aaaatcaaag 6240 

tctacaaaac agacacggct gaaacaccaa gctcaaaagg acaagcacca gaaggaacat 6300 

ggcccttggt gtttcagatg catgaagggc ggaaacgcaa aacggcctcc cggaggtgcc 6360 

aggaaaggct ccatcttgat ggggtcccac tccagctggg gctgaccagc tccaccacgc 6420 

ctgggtgcac attgatgaca tttccccagc ctgcctccct atctcttggt tctcctgagg 6480 

gctgtggtac tttcagattt acggccagga tcaggttccc aggcctttgt gggaagcctc 6540 

tacccaactc agaggccgtg cctctcctgg cccactgagg catgcagccc acaccggtgt 6600 

cctggtgatg gtggctggaa gtcacagcca gcctgccaga catcctgagc tcggaactca 6660 

gtctccctga acagccccgc caggcaggga ctgtggaagg agatgcccac ttcctcctcc 6720 



407 



WO 02/076488 



PCT/US02/08276 



tgggcaggta gcccttttga gggcttagtt cataatctat taggcctccc tgttgtggtc 6780 

cctccaaggc taggcaagtc caagtcaagc ttcacaaagt tacctcggaa gaaaagcgct 6840 

gaggaaagag acactgatgg aggaaccaga cctgggctgc gtgccatttt ggttttgatg 6900 

tgagcccctc ccgccccatt cactgtgccc cagtgcccag cccagggctc ctcacccgcc 6960 

acatgcccga cttgtccaca gttagagaga aggcacactg ctgcttatac cagtgggttt 7020 

gtagccaat 7029 

<210> 603 
<211> 28215 
<212> DNA 

<213> Homo sapiens 
<400> 603 

tcctgtctgt aaccatcaga tttattttta tatttattaa gcagtaagca aacagagttg 60 

ggggagatga cctaaaatgg ttttcttttg gaataatatt ttttaaaaaa tcaaataact 120 

gttatcactt taagttgcat gaggctgaag ctgcacctgg agaggaattt aattttcatg 180 

acatgcaaaa ctgattaggc agtgaactct ttcctcccgc ttcccctgaa cacatttata 240 

aacttcaacg ggggtgaatt ccatctaata ctctgaaaac catttctcat ctcacagttt 300 

acccaaatca tgtgcatgac tgcaagagga aataaacaag aaatctttaa gcatagataa 360 

gccacaatgg atagttaatc ttgtacagac aatccagcat taaggctaat ttgatctttt 420 

aaacatttta tgggaaacag ctgttgagtt ttctctagaa aagcccacaa ggttccataa 480 

ccccataggc atacagcctc gacaggcagg gctccatcag gctcctcttg aactgggcac 540 

tccttcacgg aaggcaggaa tgcacgtgtc ttcacttcct gctgctctgc acgcacgtca 600 

cgatatctgg cagaaggtgg tttctgtttt ccttggtgac ctgtgggaag aagtgagaga 660 

acaggcttca gagctccttt cctcttcttc aatcttgtgc taggtaactc tctaaaagaa 720 

acgcctggaa ggctgggaga gggtgaggta tgttttgtta cccaggcagg aggcccagct 780 

cttcaggagc actcagaaac aggacaggaa tcaccgagtc ccagcatcag tttctgccaa 840 

gtgacagctc aaaaagtcaa agtgggtttg gagcccagac agatttcagc caaccatggg 900 

gagagctgtg aagtgagtcc gacgaggcac accgggacgg atgatgggca cacaggaccc 960 

ctgagcacga cctgtcctga atggaaggaa ctcaactgtt ggagtgaagc gaggaggaga 1020 

gcctgggagc cagcggggcc caggagggca cccagcagag cagcagtgaa gcgcccagga 1080 

ttctgggatg ctcagtcatc tacccgcctc tccacaaaca tcaagtgctc atttgaaaca 1140 

tggagaagtg ggagagccag agacgtgagg gtgcctccgt acagactgtt actcaggcct 1200 

cctgactgca gtgtgttctc caacacttac accattacaa aatcaaacaa aagcaacaaa 1260 

gagttcacca gagctttacc cgatgatctg tttatttttc tttcactacc acaccatccc 1320 

gagtgcccca gggtaaagga agggccatct ctgctcccat cacagtgcct tcctacttgg 1380 

cccatataat atccatccac tcaacacacg ctgatggggt ggaagggcaa acaggagcca 1440 

ggcacagaga tgcgcataga gaagcaccac aggtaaggac cctgacctca tggagctcgc 1500 

agaccaggac aggacaggca tgaacaagga caatgaagga gtctgggggg caggcaggaa 1560 

caggggcatg tgtggggtca ggggcaggaa aggcttccag gggtgcttaa gcctggatgt 1620 

aatttcgaca ggtgaaccat ggaaaagggg ctgagccaag gttcccggta agagcaggga 1680 

gtgcatgtgg aagccttctg aggttcaggg tggttgtatc tgaagagggg gcctggcata 1740 

caataggtgc caatacatgt ctgaagagga gatgcagagc ggacctcgtc aattctggtt 1800 

cctggcacct gctcaccctt cttggaagag tgctctggtt gtctttagga gactgtacgg 1860 

tctaggaaac ttgcccccag ccaacaagtg ggcaagccag gccctggcaa cttctctgag 1920 

atctgtaccc caagaggagt gactgagatc tgcaccccaa gaggagtgag ctcacgcagc 1980 

aagtgccaga agctgatccc atttctactg gtcctggctg cgtcaatccc gtgatcccat 2040 

ttctactggt cctggctgcg tcagtcccgt gatcccattt ctaccggtcc tggctgcgtc 2100 

aatcctgtga tcccatttct actggtcctg gctgcgtcaa tcctgggcgg ctgtggctcc 2160 

acttgaaatg aggacagtca gcttttcctt tggcttcatg ggacagctcc aaagcctata 2220 

aggctttttg atgctgtttt cacttaagtt agccaaagtc cattcctgtt gtttgcattc 2280 

aaggaatcca cctgaggtag aaaggaggcg acacagaggt gggcaccaca ggggcccttt 2340 

tcttccactc tgagttcagc agtgattgca gtagcttctg cctccagcca ggctctgtga 2400 

gagtggactg tgatacacag ctggagaatt tgctctggtc ttctatgaca cagtacggtg 2460 

actacagagt aaataacaag gtagtgtata tttcaagata gctagaagag aagattttga 2520 

atgtcatcac cacaaaaatg ataatttaaa gtgacagata tggaaattac cctgatttga 2580 

tcattataca atgtatacat gcactgaaac atcacactgt accccataaa tatgtacatt 2640 

atatcaatta caaattaaaa attaatttga aaattgagca gaatgcaaac actccaatat 2700 

taatattttg atactccctt tgatttccct aagattattt ctgaaaatta tgcaataaaa 2760 

tgttgatgtg caactctctg cagtattcac ttatgatatt ccctgttgag aaaagcacac 2820 

tcccaaatgc aagaacaagg atcacagcat ctgccaacct caggagtctt tgccttatgt 2880 
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gaaattctag tctacttttt attcaacagc cagaattttg gttgcagaaa gcctttcctg 2940 

agaagctaat tagaagaggg aactgggagc tgggagcagg aaaaggaaat aaagaagact 3000 

gcagaaagaa gaaagtaatc ccaaccattc tttttttctg cactggagcc ttaagagaaa 3060 

gaaaggaagc tgagtgaacc tgaacccttg cacctgccca gtggctgtag gggggccctt 3120 

ggtaagaatt tatcctgggc acagtgaaga taagactgtg agcaccatgg ggtcagaggc 3180 

cacatctcat tcatcctgtg acctccagat ccaatccctt gcttattagg aatgttgggg 3240 

gcccttccca cagtttcaag gaaatgcatg tggttttgtt ttgctttctt gcacatagtt 3300 

gtatactaag aggcatttct tcccctgcct gctccatcca tgttttcagg tctggggttg 3360 

ttaaagtcag aacataagca ggacccagta tctggtactg catctgtgcc cagcagatga 3420 

gtcaagtact gtctctcgcc gtacctaggc caaccagatg agaagacaaa tttgtgatat 3480 

caccacctca acttcttatt taaacaatca agaagcggga aggggagact gagcagggca 3540 

tacagacact gatcatatta aacaagttag aaggcagtga ggtagcatgc aagcagcctg 3600 

gaagatcaga taatatccac tcactgctgc caagaacatg tgcaccccaa ggtctaaaca 3660 

agaggcttag tcatactatg aagctgccag cttgaaaaag aactaaagtt gttgataaca 3720 

ttttcaatgt tatgacatta actagaaggc agcaaaatca atcaatgaca cactcattgg 3780 

gatacaatgg gataaaactc aatgggatac aatggtaata acctcttaga gttctgaaaa 3840 

acaaattatt gaagaaccca taagtcacct gctatgattt gactgtgtcc cccacatttc 3900 

atgtgttgga aacttaaccc ccatgcaaat agaggtggga cctttaagag gtgattaggt 3960 

catgggggca gccctcatga atggattaat gccattatag tgggagggag tgagttatgg 4020 

caggactggg ttactgataa aatgatgtct ggttctccct ctctattgtt cactgctaaa 4080 

actccagtgg atagaacagt acctgacata cagtgcgtgc tcaatattta ccaaatagat 4140 

gaataaaaat actccatcac caatttcttc tactgatgct ttactaggta ataggaggtg 4200 

ccaaagattg tatcagtgag attttatttt ccttctcagg tactgttctt tcttttcttt 4260 

tttcagctat tttttttaac acatcagaga atgttatcgt tttgcctcaa ctgtcaaaca 4320 

taattcttca aaagtcaaca gaggaagaaa agcccactac actcactcag agttttactc 4380 

ttcccattgt tctttcttcc ttcctgatgt tccaggtctc cttttaccat ttcctttctg 4440 

ataagagaac ttcctttggc cattctttta gagcagttct attatgacaa attctcttag 45C0 

ttttccttca tctgtgaatg tcttatagaa ttatgagtcg acagttgttt caacacttgg 4560 

aaaatgtacc acctccttct ggccttcatg gtttctcctg agaaatctgc tgtcactcat 4620 

attgttttta ccctataggt aaggtgctgc tttcctttca ctgctttcaa gactttaatt 4680 

ttcataagtt tgactatggt atgtcttggc atgaatttga gtttatcctc ttggggttca 4740 

ctgaaaatgt aggtttatgt cttttgccaa atatgacaaa aatgcagcca ttacttcttc 4800 

aaacactttt tcaaccccac actcatctcc tgttactaca atgatgtgaa tgtcacatct 4860 

tttgttatag tcccacaggt cccagaaact ctgttcattt tgttcattac tttttctttt 4920 

aagtctgctc tttgtcaggt aatttctatt gttctacctt caagttcact gattctttct 4980 

cctgtcatct ctattctgct gatgagccca ttcagtgact tttttattgg ctattgtata 5040 

gttctaaaat tttcatttga ttcttctatc ttctaggttt ttgctgaaac tttccagttc 5100 

ttgctgagac tatttttcat ttgtttcatg tgtatctgta attttatgaa gcatttttat 5160 

gatagctgtt ttcaaatcct tgccctataa ttccactatc tgagtcacct tgctgttgag 5220 

atctgctgat ggcctttcct cactcaggtt taggtattcc tgattcttgg catgacaaat 5280 

gattttcagt tgtgtctgtt gtgttgttcc ttaggtcctt aggttcctag ccagtctgtc 5340 

tttttctctt cacttttcag gtcttcttat gcttgtatta taaataacat ccttagcagg 5400 

aaaaataaac acaagtgtgt ctactctatc ttgtctaaaa ctggaagagt tctcgggtgt 5460 

ttttggaaag tactctgttc acccaacgtt atgaagaata agtcaagata atattactaa 5520 

tttgtgtggt atatttcccc catatcatga gctcagtctc atcttcacac tatgcatatt 5580 

cattggaaag tactctgttc acccaacatt gtgaagaata agtctagata atattactaa 5640 

ttcgtgtggt acatttcccc catatcacgg gctcagtctc atcttcacac tacgcatgtt 5700 

.cagctggtag caaggggacg ctgactctga ccccttttca tagctcattt tcttacctgg 5760 

aatgttctcc cttctttttc ccatttcctc cttcaagagc caactcaaga accccttcct 5820 

ttggacacct ctccactacc actgctcaca tagatctctc ccttccctgc ccctagattt 5880 

tgattgcccc atggtctgtt tttcttgagg gcagagatca tgcctcacag gtgcctaaac 5940 

gatggactaa gtactgaatg tgaagggcac tctcaaaaat ggattcggat actttattgc 6000 

atgtgagcaa tggaaagaac ccaaaaatat atctaatcca acacctaagt ttgcagataa 6060 

agaaaatgag gttcttggcc aggcacagtg gctcatgcct ataattccaa cactttgaga 6120 

ggctgaggca ggcagattgt ttcagctctt caattcgaga tcaaccccgg gcaacatggt 6180 

gaaaccccat ctctacaaaa aatgcacaaa tgagtcaggt gtggtggcac atgcctgtag 6240 

tcccagcttt ttgggaggct gaggtggagg atggcttcag cctaggaggc agaggatgca 6300 

gtgagccaag agcatgccac tgcactccag catgggccac agagactaac cttgtctcag 6360 

aaaaaaagaa aatgaggttc ttgatttata aaactggaaa aggccaggca cagtggctca 6420 

cacctgtaat cccagcacag attactctgg gcaacagagc aagactctgt ccctccccct 6480 

caaaaaaata agttaaataa aaaataaaat aaaactgaaa cagaaggacc acacaaaccc 6540 
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caaagcaaca acaacaaaaa aaaacagggt atgtcagccc ctatctctta cttaattaac 6600 

tcaaagctaa ttgttttatt tacagatcat tagaagcaat gattatcttg cctggagaat 6660 

aattttgagt ctcagagagc ggaagaaatg tggtatgatc ctaagacaca aagtctgttt 6720 

ctgggcatca cagtcagctc ctttcaatcc ttccttagga ctgttagatt agattagttt 6780 

gtggaactag ccaatccctg gagtctaaca ctagcaattt cccggtcttc cagcctagaa 6840 

gctatctgac ctcaaaacta attttctttc atgatctttc actattcatt gccttgattt 6900 

ccatggaaac tctataatca caaccaaaaa tgtaaaataa gagaagacga catgaaactc 6960 

tgaagaaccc cacagacttt tcagggctct caaatgtcac ctccttgaag agaccttccc 7020 

tgatcgccct tgctactcct gcccccaact accacattgc ctggtttctt tatttttgcc 7080 

tttatcagta tctgaaattg cttaacttat ctgfctcacat ttttggtatt cagctcctcc 7140 

attagaatgt aagctccatg aaagcagcgc cttggtcagt ctcgttcacc acagtatccc 7200 

aagacttaga atgacaccta gcatacagta ggtgctcaat cactattgac caaatgaagg 7260 

aataaataaa tcaataaaga agcgcataca tttttccaag tctggctggt ctaaccctaa 7320 

tggctgttta ccaccacact ccaggattct tggtggcaac atgtggatac tccaggattc 7380 

tttggtggca acatgtggat actgccagtg ggccaaggtg acatttcact cagtgattaa 7440 

aatgatcatg aggaaggtgc tcacgggtac tcactctcca ctccccctga taggagtggc 7500 

cctgcctgcc tgacttgttc acttggcact tgcagggcca gggagctttt aaactggggg 7560 

tggggagtga ggtgtgtgag cagagctcag ccaagagcac agctaccact gactcaaaga 7620 

aactaagggt aggggttggg gagaagcaat atctgactga gaggaaaaca gagggacaac 7680 

agggaaagaa aggcaagaat ttagacaaag tgccagcaat gccatgagtc ccttccgact 7740 

ggctcagacg ctgccactgg tcaacaagcc ctagcccctc atctctcaca tgtggtctac 7800 

agcagtgctt ctcaaagtgc gggggaccta aaaaccacat gagaacttgc cagaacagat 7860 

catggggcca tcccccagag tttgactcct tggaccaggg caaggcccac tcccaggtga 7920 

tgcccatgct gccattctgc aagcctcaca ctgagtggca catcctacag atgcactagt 7980 

gaatgtagca gccacacgag accacagagc acctcagatg tggccagtgt gactgagaat 8040 

ccaaatgttt aattcaattt aaatttgatt atttatgtga ttaaataaat aattcaattt 8100 

aaatatgatt aaatttatta aacataaata gcgagatgtg gctagtcgct actgttctga 8160 

gcaatgcaga gagcacattt ccctcactac tgggaagttc aactggacgg tgctgctctt 8220 

cagcacctgg aagctgtgtc tcccaacaca tccacagcct cttctcccca caagcacaag 8280 

tgggtcagct ttaaaaggga aatctttaaa acaaaacaaa aaatatttgc ctctgtatcc 8340 

caacctggcg cctttcaatc tgagaagaga taagcttaga acaagcacct tagctcaggc 8400 

aattctgatg ccagagaggt cctagtgtct ggtgaggaga tggaagaaaa aagcagccag 8460 

aacaactggg accgtcagga ggccccccca taagcgagtt tgttttttca tcaagtagag 8520 

gtataattta agttctatac atttctagaa aattagactc acctacattt cagcatggaa 8580 

gggaaaaaat gttaactcaa aaccaaagag atattttaga gccatctgct attttagatc 8640 

atctgctcca tttccactga ttatggagga ctggctgtac tcacattaaa caccttcaca 8700 

tcctttctgt ttctgatttc ttctacaaga gccagtggct ttcctttagg aactgggatt 8760 

gtgccaagga ttatagtccc tggggtagac agcgtctgac gaacagcttg aatgaaaagc 8820 

tgactgaaga gctccatctt cccaatctca tcgatgacgc acactctttg ccctgggcca 8880 

ctgctgcagt cggcctgaga atgacaaaga ctttcactgg gacatcagag tcacattaat 8940 

caaagggccc atgggtccca ctccccaatc actcaccaat gaggtaaaaa agacagaaac 9000 

ctttttactg gatttcaaaa agtgacagca tcccacagac caccgctggt actgtgtggg 9060 

aggggactgc acaagggtga gtcccaggag ctgggaatca ttaggactct tggaagctgg 9120 

cagctatgcc tgtcaccagt gggatgttca aggaagtaac ggacataaat tggaataaag 9180 

tcatgagaaa gagaatctga gatctaaagg atgcaattgt acttactttt ttttcccctt 9240 

tagccaagac ttgtttcctt ttaattttca cttttttcca acaacgctta tggaccagat 9300 

catgaaatac acaaacatct acctagagca gttatctccc ccccagcacg tagggaatct 9360 

catatggctc aaaatggaaa aaagaggcca ttattatcaa caaaggctcc catgcttgct 9420 

gtctcacttt ttttaagcct gaggtatttg ggtgccactt attttctggt tgtatataaa 9480 

tataaagctt gatctctgtc ttggatcccc tctctcgaat cactcactct gggggaagag 9540 

aagtgctatg ctgtgaggat actcaggcag ctgacggtga ggtccacgtg gtgaggcact 9600 

gaggcctcat gccaacagcc aagaggaaat gaacctgcca gcaaccacat gagcaagctt 9660 

agaggcaggg ccctcccagg caagcctcag atgaccacag ccccagccaa cagcttgact 9720 

gcaacctcag aaccccagct aagctgtacc tggattcctg atcctcagat actgtgtaca 9780 

ataatgtgtt tgttgtttta agctgctaaa ttttgagata acttgttcca tagcaacata 9840 

taactaacac acatgtcctc tatagaaagc cttcctgggg aaggctgaca ggactttggt 9900 

tgattcttcc ctgggaggga caaaacagaa catgtaataa acacttcaat gaatggtgaa 9960 

tgacaaatac accttatata tgtttgtata catgattatt ttgaaaactg tatgtaaagc 10020 

tgatttacaa agtattcttc actacagttc tgattcctta aatgatgaca gcaatatagt 10080 

cctagagggc atgtctgagt taagctagat tttgttcaaa ttccaaaagc caaagcagca 10140 

tatggtacag tgtggctggt ggtagggatc agtggcaacc aggggcagcc agtgacagac 10200 
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ggggctgcgg gcccagatcc taaaggcctt cgaaggcaca cacagaggtg ggaatcttct 10260 

cccacagccg aaagagtcta atccagggag gatcacaagt tgtgctagaa agactgctgg 10320 

ctgtggagtc agaaacaaat gttagactcc ttttggaata gacaagataa gatagtgggt 10380 

gtctaaaccc cagttgtgat tgcacggaag cagtacagca gtgtttttca gacttttaag 10440 

taaacagtac tctattttcc tactgcatat tgtacactta tgaaatattt ttccattcat 10500 

tttattagct aaaagatcaa taaggagtcc actgtaatca cccaagagag gtctgaacaa 10560 

tctccgcact tctgtgtttc tgacatgctt catttttgcc tctctattaa gtttatctgt 10620 

acacacgcct ggtctcccca cttagggcag gtacatggtt gtcctcgtcc gcatccttcc 10680 

tggggaactc accgatgtcc cagagctgag aagtggggag ctgaaggagc aatgtcaccc 10740 

tccactggtg gggtcgggga gaagggcaga gcatctatga agtaagtcac tacttattcc 10800 

aagtttttcc tccaaaacga aagccatatt tgggagaaat aaaacagcta tcaaccacct 10860 

gtctatgata ataaaaacag acaactgtac aagaaggttt ctgtcttcct gggggcatgc 10920 

cactccaact gacccgactg ttcccaagtg ctgtggacat tttgctgccc ccagagacga 10980 

agagtaattg tggatgccat tcctctggag gctgtcatct aaaagccaaa ataaaactta 11040 

ttctggcacc tcaaaatgag ttttggaagg tgccaaatca gacaagaagt gatatgaaag 11100 

aatgtgtgta cacatgcaca tgtatttatt tgactatata catgagggta gtaaaatgca 11160 

cgtggccatg tgtaaataca cacgtacagg tagaaagtgt gtggctcttc aacaaccagc 11220 

cttcctgcaa tcctctagaa cattcaccag tgaaagctca tcaacattta tttaagtttc 11280 

agcacaaaag aatttggaca cacatgtata cattactttc taccaattgg aaatcctcat 11340 

tttaaaacct tagcttacat aaaatcctca atctgtccag aattggggaa ttttataatc 11400 

caagaacttc aggaatctca aaaataaatc tgtgattttt aattaaaaat gcaagttatt 11460 

catattcagt tgagagaaag atgtcagctg ctgagccact ctgaagcatg tagacactca 11520 

gcctgctttt ctaaatagta caagatcata cactcaaggt cttaaaaaaa gaaccactat 11580 

aacgtgtaac atggttttat ttctatataa attcaagcaa ctttagaggg aaacaaagaa 11640 

ataagtaagt aaacaaacat tcaccccctt tacattgttc taaattgata gagacacaga 11700 

cttagaacaa taggggccca ggtttgaaga tggtcaggaa cccctcccca acccccagtg 11760 

tgaggagagt cctaatgagc tcccagagaa actccaccta cagctgctgg agtaaggagt 11820 

gtgtaaaaac tgtaggtggc tctcagccac gtggtcagaa tcacgatgta cacaccatct 11880 

ggggtgtaca caccaactgg ggagcaccct ttcctcccct cacagcctct cctcagagaa 11940 

caacccaggc agcattgctt cctagtgcca cacctcagtc ctatgtaaca cactagcctt 12000 

cctggccaca cccctcacag gcttccactt cagctgatta taaatgttca gccatgaagc 12060 

gcacttttag tttccctgtt ccgggagaga agatatttcc ttagatagag tcctaaaatt 12120 

gaccagccag ggataagaga agctacgaat aattttataa ctttggttac actcacctag 12180 

tttgctttcc agaaagatta gaacaacaaa tatcttcgta ccagttatat cagagacctg 12240 

aaaatagtgt gatttttttg ggtcctttat ttttatatgt acttttaaaa atacaaagaa 12300 

catataaata taaaataatt tcatttaacc aaaatatcaa actgagagca agctaatcca 12360 

agtggtcttt gccttattct gaattatttg cataataaat gtttccagtg gtattaatgt 12420 

taattttagg aggactgttt ggattgtttt attttaatat cagacacaca gactagctcc 12480 

tgagcttacc cacctgggac agcaagtatg gaaagtcatg aaagtgaata aggagtgtct 12540 

ctcagaagtt aggcagcaaa actgcagccc ttcgccaaac agcagcaagt tctagggaag 12600 

cagaaaggca aatggtgcta acagcagcca tctctgagcc agcccacaat tcctcaccca 12660 

ctgaagttat tagctaaaaa tgcagggctc cggtcatatt ttatgtgtgt ttattccaag 12720 

ccttcaagtc tcataatccc tttgctgtct ggtggcatat ggactgtgag tctccagggc 12780 

atagccagga caaggtgtct agactgcccg atggcgcctg gtaacccggc ccttccccag 12840 

tcaagaatac tccgggcaga agctgctccc ccggacatgc cagagatcct caccccggca 12900 

cacagacaga ctagctctga ggctcttcct ggatgtccta aatggcctct cgggactcac 12960 

aggggctcac ccacatggtc acccgactga agagcgcttg aatggtagcc atggggtttt 13020 

ccaagctatc ttgggactga aacaacacct cctggctgac agtcagtcca gtgagacgga 13080 

gagcccagta ctcctcgtcc tcctcatgct ctgagcactt gggactggcc tgcacatttg 13140 

ggctcctggg actccatgac agattactat cccactccgg accccagctc cctggcaaac 13200 

acaaaccaaa ttgtgaaggg aggctcctct caaggtactg ccatccacca cctcaattaa 13260 

cttctcaaga aaataatcac tgctattgct ctgctgagca cctcctagga gccaggggcc 13320 

aaggcacttt acaagcatta cattcagtcc tcccagcaac ttcacatagc aggtaacagc 13380 

atctgtttca cagatgagga aaggaaatta ggcattaagg gatttactca aaactcactg 13440 

agagcacagt gacagagctg gagtcaaacc agagatcacc tgacttcaaa gctctcttcc 13500 

acaccctgtg tgcaatacgc ctccacactt tgattctgaa ataaagaata gcatctagga 13560 

aactgaccta accacgaaga aactgctttc ctctccaata agaacgaata gcactctggt 13620 

gtttagggat ctatggaggc ataggcacaa aacctcaatt tatttggagt aatcatattg 13680 

ctctaaatta ctcgaaaata actcatattt tgggtctgag aataatatca ccaaaattat 13740 

ttcatcaagt atggatagcg aatggcccac tgcagtaggc agatatgata aagtaaaagg 13800 

acttctctcc tctaccagga gcacaccccg agaagacggt ggaagagtta tgaccaggta 13860 
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tcctgatatg 

tgcttagtaa 

cctgataccg 

taaaatggtg 

cttagagtgg 

gtaattttat 

agttaaatct 

taataaagtt 

ggtctcacac 

acagtggaga 

acacacctcc 

tccagcttta 

aaattggaac 

aatgttccaa 

aagaaatgtt 

gtattttttc 

tccttatttt 

cgtattctca 

acacacacac 

aacataggtg 

taactttctc 

aaacacacac 

atgaacttcc 

ggaaatgctg 

aactcaaccc 

tttcttgaga 

ctgcttagca 

tctgctcaat 

tcaacctcat 

tactgacagt 

gatcctgcat 

agaatcaaat 

tgtcaatctt 

ccaagtggcc 

aaaatacttt 

aaagcagagc 

aagacttatt 

gtgttctgat 

cctggaaaac 

ctccattttg 

catcaaattt 

gcattatctt 

accagaaaat 

ggagaaagag 

ccctttatgt 

atagtaaagt 

cctgggcaag 

gtgatagtgt 

agtttactat 

agaaaaataa 

acagatcaat 

ttaatataat 

tagtaaactg 

caacataatc 

tatataggta 

gaaaataatc 

gctatctgct 

atgaacaact 

taactgaaaa 

ctggcacagc 

cagcaccatc 



tccaccttcc 

aaattttcat 

ttttcattag 

atggcttttc 

aaaaaagctt 

gtttgttaat 

gcttgccaat 

tagataaaat 

tatctcccca 

aataggacaa 

agatgagata 

tttggtcatg 

tatattcttt 

attaacggag 

acaaagaata 

aatgtaagat 

taggaaaaat 

aatagtcacc 

acacacacac 

actctggata 

taagagtaaa 

gtatgtactt 

tcctaatgga 

gaaatgggac 

cagtccagct 

gtggactgca 

tctaaatcca 

agaaattatg 

gctgaatttt 

ttaaaaactg 

tacatacttg 

gccacatttg 

tcaagaatgg 

tgagagtgac 

tccctagaca 

ctatttcaga 

ttaatcaaaa 

tttagcttac 

agggctttga 

tgtgagtagg 

taataaagca 

atcagccttg 

aggagagccc 

aaacatgttt 

caaaataact 

cttgattcta 

taactttcca 

tctgaggatt 

tacaagtgcc 

aaacgattac 

ggaagagaac 

taattaagca 

gctaaatatc 

caaatgaatt 

aaattttata 

accaaagtga 

ttataaaaca 

aaaattaagg 

atgcaaatca 

gcagcaagac 

aagaccttga 



tgccagtgtt 

caatatttaa 

aaactgcctg 

taatggggag 

aaccattatg 

tactgctcac 

taaggcctca 

tagaattggc 

ctcactgctc 

gtccttgatc 

ctcgagaagg 

aggaaacatc 

acaaatgtca 

gctatggaga 

ttgtcaactg 

ttacagaggt 

atactgaaca 

cccttccagc 

acacacagag 

atgagagtat 

attttatcca 

tctcagtacc 

gtgtggtttt 

aaagtgagag 

taagggacgt 

agaagaaaga 

cagaatctcg 

aagatttgcc 

cacctcattt 

attttctgtt 

taatacatga 

ctaacagcaa 

atcctgacta 

ctaaggaaca 

ggagtcctct 

accaaaagta 

caggaaaata 

tcttcctttc 

gtagatgtcc 

agatatgttg 

ttgagggtag 

atcattagaa 

tatgaataac 

tgggctgtgg 

atagtcccaa 

tagttcactc 

ttctatgcct 

aagtgagttc 

accattacta 

catggtaatt 

agataactct 

taaaaaacta 

tggaagaaaa 

gaaaacataa 

taaatttgga 

agtgctaaac 

tctatttaag 

aaaacttaat 

aaacaaagca 

aagcacactc 

aggttcacac 



tacaaaaacc 

gcttttggta 

aggagaaaga 

gtaaccggat 

gcattttttt 

cttgctagag 

agagaaaatt 

taggggaaat 

attagtgtca 

aggtaatcaa 

atgcatcatc 

agaaaaactc 

atgtcctgga 

tgtgacaact 

atgaaactag 

tgataactgt 

tttaggagta 

ttatatattt 

agagagaata 

acaggtgttc 

aatgaagtta 

agactcaaga 

agactgatac 

ggaaaaagaa 

gtgacttaag 

catgaaggtg 

ccaacaatca 

atgacaaatc 

tcttatgctg 

tgcgccagat 

ggtagagaaa 

tgacagaaat 

gaacttggtt 

cttccaggtc 

caccttcaaa 

gatcctgctc 

caaggttcaa 

acagatcttt 

tttgttcttt 

aaaatcaata 

ctatcaaaaa 

aacagaaaaa 

ctcaccaata 

agaattaaga 

tcactccata 

cagctctacc 

cagtttcctc 

atgcctggca 

cagtttctga 

aaaatactat 

gaaacactat 

attagaaatg 

tcagctgaga 

atgtttaaaa 

atagggacag 

tgagaaaaac 

cccaaagaaa 

gaacaactga 

gtgtgttgaa 

acaccctggc 

cccaccatct 



aaattaaaaa 

aatgtttagt 

actgattgtt 

acaaatagag 

tagtgcagct 

aaacaatttc 

atttttatct 

ctgatgagga 

aagaaaaaaa 

aattaatatc 

acttatgact 

caaaagatga 

agacaaagaa 

aagtgtaaca 

aacactaacg 

attgattatg 

aagtaccttg 

atacacacac 

agcaaacaag 

ttgcactatt 

aaaaaattgt 

aaacaaccaa 

gcacgggagg 

aaacggcaac 

gagcatgcaa 

tcataaccca 

gaaataataa 

ttgcaaaatg 

attcactgca 

agcttttaat 

cagaagctga 

tctagcaatg 

tttctacatt 

gagtctaaaa 

ggaagtttct 

tcaaaaacaa 

actatgttgg 

gtaaaaaatg 

aaaacattat 

aaatgttaca 

ggccattatg 

acaatcctaa 

aaaattttaa 

ttatctgaaa 

taagacattt 

acttacacaa 

atctgtaaaa 

cgtagttcgc 

acaaggccat 

ggtactgatg 

acatacaagc 

ggttatttaa 

atctcatagc 

aaagcatgca 

acttttctct 

agaatcaata 

ataaatggga 

aaaacattta 

aataagtaga 

agtgggacac 

cagacttcac 



aaatttcact 

atctctgata 

tttcctataa 

agaagcacag 

tcaatgtata 

aaaagagagt 

taagatgatc 

ggatgtgcat 

tagtaactat 

actaaacatc 

tatggagtat 

acgttctttt 

tgactgtgga 

tctggaggaa 

atagattaaa 

taaaagcata 

atgattgcaa 

acaaacacac 

gcaaatgata 

cttacactta 

caaaaaaaga 

ctcaccccgt 

cgagacagaa 

agtttataca 

cttttctgtg 

gggcaaaagc 

agtctcagtc 

aactgcagag 

ggaatgtcct 

ttccctggct 

gagagaacaa 

ctgaggagct 

ctttctaaag 

aaacaaaaac 

aagattttcc 

atatacttta 

ttttccctgg 

acactgttca 

aatctattgt 

tcttacattg 

aaaaaggcca 

ttccagtagc 

agaaacagat 

aattaagaca 

tttttaatgg 

ttagtgtgtc 

taggaataga 

actcaataat 

tttttgaaga 

caaggatcaa 

acacatagtt 

caaacagttt 

atattacact 

gaagaagaaa 

accttgaagg 

atataaaaag 

agagcaatgg 

accccattag 

gtgctcaggg 

aaagtcaacg 

cttttagacc 
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ctccccaggg 

caaggttact 

aaatcaaaga 

tgaagattag 

ccaagcctat 

caggtataca 

tatgagttat 

catgaatatc 

gttaactcat 

tcctgagtct 

acgatgctca 

cgcctgtaat 

agaccagcct 

gtgcggtggc 

gaaccctgga 

gacagagcaa 

accagagcag 

agctgctctt 

ttgctctgtc 

tcctgggttc 

ccaccatacc 

gccagcctgg 

ctggggttac 

gcaaataggc 

agtattaaaa 

attgaaattt 

cccagcacta 

cttggttgtt 

ctaattattt 

ttccttctcc 

gaaaagaaat 

tataactacc 

gcattgttgg 

gatattcttc 

cactggatct 

atgatccaat 

gagatttggg 

acatggaatg 

atttctttat 

cttatataat 

tctactctgt 

aaccagaatc 

gtagagagga 

cccgcaaaac 

ggaaggaatt 

acttggtggg 

cc tctgagat 

aaccaccact 

atgaagacag 

gtgggaagtc 

aaggccagat 

ttcttggtgc 

catttaattt 

cttaacatgg 

ttcacctgct 

tattctcccc 

cccaatgccc 

ggtcccttgg 

ctttcacagc 

agactaacag 

acttgataaa 



aaacaatcat 

tacaatggtg 

tgaaaactca 

catgttgtga 

gaaactatac 

aaaggggcag 

tcacattgtg 

tagaacttga 

ggaatgaaaa 

agaacactta 

ggctccattc 

cccagcactt 

gaccatgatg 

aggcacctgt 

ggtggaagtt 

gactctgtct 

ttttgcaagg 

cagctcccgg 

acccaggctg 

aagcgattct 

cagctgattt 

tctcaaactc 

agacatgagc 

atagccctgc 

cacacacaca 

cttacacaca 

tgaactaata 

ttcattgcca 

taattaatat 

tttcttcatt 

ccatgaccct 

tgagactgga 

ggaggcctca 

acgtggcggc 

catgagaact 

cacctccctt 

tggggacaca 

tacatgtctg 

tttttaatta 

cttgcatcta 

ctgagcaggg 

ttcacttgga 

atgtatgtga 

acagtggcca 

ggggaagaag 

ttccaagaag 

tagaggatat 

acatacttca 

cgtgctcacc 

tgattgaacc 

atcacagggc 

catagaaaac 

tcacaaatca 

tccacagagc 

cctgctccag 

agacctcccc 

tctcacctgt 

tctaaagtca 

attaatgacc 

ttttgaaaga 

tatctgataa 



ggctagagaa 

atacggaacc 

taccatgaaa 

gaccattccc 

ataacatttt 

aaacaaaata 

ttttttattt 

atactcagga 

aaattaacag 

ccacacacac 

ctgcctaaag 

tgggaggcca 

gtgaaacccc 

aatcccagct 

gcagtgagct 

caaaaaaaaa 

tgaacctggg 

ccactttttt 

gaatgcaatg 

cctgcttcag 

ttgtttttgt 

ctggcctgaa 

cgctgcaacc 

ttggcctaat 

cacatacata 

atggatattt 

ctgtgtccct 

gtatcaaaaa 

tcttttcatt 

cccataaaat 

gactgggtga 

taatttataa 

ggaaacttac 

gggagagaga 

ccctcactat 

caggtctctc 

gagccaaatc 

catttggcaa 

aaaaacagat 

gacactaaac 

taagtaaaaa 

atttcatttg 

cgggaaaaca 

cgagagtccc 

gaggtgggag 

cacaaccccc 

cctcagaggc 

gctgagctct 

aaaaaaaatg 

agtgtggctg 

tacagggcct 

tctggatagt 

aagtagtaag 

cctgcatctc 

ggctctgagg 

accctgacac 

tcctttctgt 

gttcttcctg 

atgacaagta 

gtagaaacat 

atagatacat 



aaagattcat- 

tgaatgctca 

taggatgaaa 

atgtccctca 

ccctgatcct 

agctaacatt 

ttattttaca 

aaataacaaa 

tcaaccctac 

tctatttttc 

aaacgtaacc 

aggtgggtgg 

gtctctacta 

actcgggagg 

gagatggcgc 

aaaggaaaag 

ttgtgcctgg 

tttttttttt 

gtgccatctt 

cctcctgagt 

ttttagtaga 

gttatccact 

agtcccagcc 

tagagaagta 

caaccctcaa 

cttattatcg 

ttataaaggg 

tggtgtgttt 

tttttcttta 

atgtatgatg 

gaatcctata 

acaaaagagg 

aatcatggtg 

gagtgcaggg 

cgggagaaca 

cctccacaca 

atatcaaatc 

tcagaataca 

ttatgctctg 

acctgtgcaa 

attctttgta 

aagtatttca 

gctaaaaaga 

cagcacagga 

gaggaaacac 

tgctgaaaca 

tatagctaaa 

gaatgtcaag 

gcaagtgcca 

aagaatggca 

tctaggccac 

ttatagttgg 

tctgtaatag 

atggcccccc 

tctgaatgta 

atacaaatac 

ttaaatgtca 

tccctccctt 

taaatttact 

gtctgttttg 

aaaactacat 



gtataaaggg 

acactgaggg 

ccactaaaat 

atcatattaa 

gaatgactga 

ggctaactta 

ttttttataa 

tgttgacaaa 

tagaggcttt 

tttatttctc 

aggctaggca 

atcacgaggt 

aaaatacaaa 

ctgaggcaca 

cactgcactc 

aaaaagaaac 

ccctcctgag 

tttttttttg 

agctcactgc 

acctgggaca 

tactgggttt 

cacctcagac 

acatccttac 

cttttttctt 

actatctttc 

gaatttgtgt 

tttctaacac 

atactcccca 

gtataaagac 

ggagaaaaat 

ttagtccatt 

tgtaattgac 

gaaggcagaa 

gaaactgcca 

gcatggggga 

tggggattac 

ccctgccagg 

aaatgttggg 

atctggtcag 

actctcctcc 

ttgacccagt 

cttgactttt 

aaagtaccat 

actgagggta 

agcgtaggaa 

gaccataaga 

atcttcagca 

gagcagagag 

agggcctcag 

agatgaggtc 

tggaaggact 

tttttgagac 

attaaatgac 

acctcaccaa 

tctgttcctg 

cccaatacta 

tcttcccaga 

tcatagcacc 

tagtgtctac 

ttcatgactg 

aatcaataaa 



tgctttgttg 

agtaatagtt 

ctcccagctt 

gggaaacaca 

ataaataaac 

gtgataagat 

tttttttaac 

taaggaaaga 

ccaaaaacct 

acgagcgacc 

cagtggttca 

caggagttcg 

aattagccag 

agaatcgctt 

tagcctggct 

aagaaacata 

cctctcctgg 

agatggagtc 

cacctctgct 

cgggcatgtg 

caccatattg 

tcccaaagag 

ttccccacct 

atttttttgt 

actcatacat 

tttcatagat 

ttccagttta 

gttttgttgc 

aaaacactag 

ttctgctcca 

ttcacactgc 

tcacagttcc 

gggaagcaag 

cttttgaaac 

aaccaccccc 

aattcaagat 

aactttttct 

agaagtctgt 

catatgatct 

caggatttgg 

cagttcctaa 

gaacatttaa 

gttctctgct 

agttccaaga 

gaaaaggaga 

taccaccagc 

gcctttgggc 

gcaaagatct 

cctggcatat 

agaggttagt 

ttggacttta 

aggtctgtat 

actcacattc 

acttaccttc 

ccatttagaa 

actctctctc 

actctttcct 

ctgttctttt 

ctttcccact 

tatatgcagt 

acaaaagctg 
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gttctttgaa 
gatttaaata 
gcattatggg 
ggacaaatta 
acagacctat 
gcccacatgg 
ttcacaaact 
tattaacctg 
tcttttgtga 
aataagacac 
taggaataag 
tagccagggt 
ccgtaagatg 
gttaactgct 
tgacattttc 
agaagacaga 
caaaataaaa 
gcaaattatg 
ggggccttta 
agagcagatg 
caactgctca 
aggtggaggc 
tgatttctgg 
cttaaaaatg 
tgaacagagc 
cagggctatc 
agaaacagga 
gtaaaatatc 
gagaaatccc 
tgcccaggga 
agtatctgct 
ggagcttaga 
cccccgggtg 
ttcttcggta 
atggatcaat 
gcccattagg 
aattatggtg 
ttttgattgc 
cacgaatgat 
cacaatccca 
gtgaaatggt 
caggggctgg 
agtgggaaaa 
ctcttaaata 
gcctatcagg 
cacctttatc 
cccaccttga 
ggtttgtaag 
gccagacact 
ctctgaggca 
ggttaggtat 
tttacctgtc 
gtggcatcat 
gagctatttt 
ttaagcctcc 
cttcacatta 
ctggctacag 
tgacattcta 
tctgtttcat 
gcaatttcaa 
aaacatcaac 



aagatcatct 

actaaaataa 

tgaatctatg 

ctaaaaggta 

aaggagagat 

cttctctagt 

cttccaaaaa 

ataccaaaac 

ataaagatgc 

agttaacatc 

ataaggatgt 

aattaggcaa 

tcaataaagt 

tacagtactt 

tcttcctaaa 

ggaaagactt 

tatttatatg 

ttgtttatcc 

agagcaaaag 

ggttcaaaaa 

aaagaagtca 

tctaacctga 

tttttagctg 

accaacacaa 

taccctgttc 

tcttaaggtc 

ctaagaatta 

actttcttat 

taaaggccta 

gacactctca 

tttggtgaat 

taatagaaca 

ccggacaacg 

taaaatccat 

gttgtttttc 

aaacattccc 

ggtagagaaa 

tccaggtgct 

atcttgaggt 

gaatcaactg 

ttacagttcc 

gaatttaaga 

tcactttgtt 

tcattagcat 

ttagcaactc 

ccccacacct 

tccttcctgt 

caccaacagg 

gttctagcac 

attaccatcc 

tttacccaag 

accatgactt 

gactacccaa 

gagatactac 

ttgacataaa 

aaataatggc 

taagccacag 

atccagccct 

gcaagtaatg 

tgcgcctgcc 

aatacaaatg 



aaattgacaa 

agaaagaagt 

aacaactata 

caaattacca 

ttaattggta 

gaattctatc 

ttagaaaaga 

taaacaacaa 

agaaatcctc 

atacataatg 

ctactctcac 

gcaaatgaaa 

ttttttcttt 

ggtaggcatt 

tgtttgttgt 

atctttggga 

tactgaattt 

ttattaagac 

gcattctgtt 

gcagaaatca 

ggtcgaccac 

aggaaaagac 

ctccactgag 

aatgcggtgg 

aaagtgtcca 

caagcagcta 

taaagtgaga 

acttttttca 

agattcttga 

acctaaaggg 

ttagctcttg 

ccacagaaat 

tgacgacatc 

caacaggcac 

caactcctaa 

ttactagagc 

tgggctgtaa 

aaaaatgctt 

ttatttttaa 

agaggttcct 

catttttctg 

tttcagcttt 

cctcctttga 

ccctgttaat 

cttccaacat 

tccccagttt 

ccatccagcc 

tgctacctaa 

actttggaag 

ttatgcatat 

gctatacccc 

tttaaaatat 

caatggaaag 

agagttgcaa 

atatgatcca 

acctggcttt 

tgagcatgca 

ggccatcagt 

gcattgccga 

acagcaccct 

tgtgaaagag 



acttttagac 

gacaatatca 

tgccaataac 

caactgattc 

atttaaaagc 

aattattcaa 

aggaaacact 

tatcccaaga 

aacaaaatac 

gtgaaagact 

cacttttatt 

taaaaggcat 

tttaaaaaag 

caataacaac 

gctgtgaaac 

atgttggtac 

ataggaattt 

tcaatctttt 

tccatggcta 

cgtactcaca 

atactgccca 

attatgttct 

cctctggcat 

tctgtaaagc 

cacctccaac 

ctgcatcaac 

ggagtttatg 

tgtgatggtt 

ggatagatga 

aaagcagatg 

ttgaaaagct 

gaaatatcag 

gaatcctatt 

accagaggat 

aataaaaaaa 

acacacagca 

atggatttat 

aatttcaagt 

attgtgaaac 

gatgatattc 

cccaggaagt 

caagagatta 

cttacattgt 

ggtcactttc 

catcttctcc 

gaacctatgt 

tggtcagctg 

tatagtaata 

cattttcttg 

tttacagatg 

tcataaggtc 

gagccttgtt 

agagagaaaa 

atcatgctat 

ttattcttat 

ctggaacatg 

gctggtagaa 

aacaactgct 

atgtgtaagc 

gtagttcatg 

gacattcact 



tgaccaagga 

ccaatcttac 

ttagatactt 

aacaaaaaaa 

ttctcaccaa 

agaaaaatta 

tcctaactct 

aaagaaagct 

gcctgagtaa 

aaatactttc 

caatgttgga 

tcaaagtaga 

gtaatataat 

tggctattat 

caggttaaga 

ctgggcatca 

atctgtagct 

caattgtaaa 

gcaattccca 

ttcctcaaga 

actcggcatt 

ttgttgaaaa 

tccacatatg 

tgatcaggtt 

aatgggcctt 

agtctttgaa 

acttggtgaa 

tcattctgtt 

caacatttgg 

ggcatccaag 

cccaaagcac 

tacccaactc 

cttctccctc 

tttaaaacct 

aaaaaaaaaa 

cttgtggact 

gaatatgaac 

agaaacaagc 

attgttaatt 

tcccatgcca 

aacactggtc 

caaacggacc 

tgtaactcct 

aaactttttc 

agggtctcta 

acctctgccc 

actactctga 

ctcatcactg 

gttaagtcct 

gagaaactga 

catacctttc 

cccaatatgg 

ctcctagcat 

ttctaagact 

ttaatgtatc 

aacagcaaca 

ctgtgtagaa 

t taggaaagt 

tgctacttct 

actaggaatt 

gcaacattct 



aaaaagagaa 

aaaataaaaa 

cagatgaaat 

tacaaaatca 

gaaaaaccta 

atacgaatca 

attagaccag 

acagaccata 

actctataca 

tttctaagat 

ctggaggttc 

aagaaagaag 

aatataatac 

gttcatataa 

catgactgtg 

agagcctttc 

cagagtgata 

atgcaaaact 

caatacacta 

cgggtagtgc 

cacgttttcc 

actcctgtat 

ctaaatcatc 

ttctctgctt 

gcagttatgg 

aagtggcacc 

caagttttaa 

ttctttctct 

ctcaaccctg 

gacccaacag 

agatggaagg 

tcgataaagg 

cctgtctgac 

cactggcttt 

aagaagaaaa 

ctgaggcagc 

ttcaatttaa 

ataataaaaa 

ttgtcacgta 

tttgcatcct 

acacttgagt 

tacacttaac 

gaaaatgtgt 

ataacagcaa 

agcatttctt 

cctcagcctc 

tgaactcact 

acgtcatgga 

cacaaccatc 

ggtacatggt 

aacaggggac 

atttaagtaa 

gtctgcaagt 

acccactctt 

tatcttcaca 

caacaggatt 

cataaaatga 

cactaaagaa 

tttctgaagg 

catcctcagg 

ttataatagc 
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aaaacactga aaacaaccta aatgttgatt gtaagggaat gttaactcat ggcaccatgt 24900 

atataatagc acataaatac acagaaatac catgcaacca ttaaaaaata tactgcttat 24960 

atgtatttat tagcatggaa aatagaccac aatgtatgaa aataatatat ggaaaaagca 25020 

ggttaatagc agaaaccatt ccattatgtt ttttttaaag aatggtcaag tacgtgtcta 25080 

cagaaaaaaa aaatgtcaag aagaatacag aacaaagatt tacaatgttg atttctgatg 25140 

ttcaaattcc ctgtgattgt cactttcttc cttaaacttt tcttgttttt tttaattctt 25200 

atactgaatg ggtcttactc ttatcctaac aaaaattgcc gtttatattt tgaaaaagat 25260 

aaaatggagc aacggtaact attcctgttc acaaagggta tactgtgggg actgtatgtt 25320 

gtgtagaaca tagtagagca ataaatagga agcatttggg caaagtccct tttggcttct 25380 

tcagctctag gaatcttctt tcagttaatt ttttcttcat tagttaagct gctaccccaa 25440 

tgctgtctgg ataagaaatg acctctgctt caggagacct cgagatgata gctgtatacc 25500 

aggcacaaac agatcagaaa gctctggaat tattttggtt ctggttctcc cctaagcaga 25560 

tcctcagata aagattcaag tgcaggtagt ttaacctggg cagtgacaac atgatggact 25620 

ggcagggaag aaattcagaa agggaaggga aggacaccag catgaggtgc actgtcaagc 25680 

aagttccact gggggcagct ggagcttcaa ccccctgggg aatcctaagg gacaagtcag 25740 

ggcaagtggc tcagttatcc tgtctgagga gtgagaaagt agggaattta tcaactaact 25800 

ttcaccaatc acaggttgaa ggctaatccc agggcacagg aattccctgg ctcttctggc 25860 

ctgctctgtt tgcaggccaa gtgggctcca gggaccagag aaagccctcc aagatgagtc 25920 

atgggtacag acagttggaa atcggaaaaa atgctaagtg ccaggggcac taagaggata 25980 

tgcaacaaaa agtgacctct ttgagaggag actttacctg atggtctgaa catattgaga 26040 

aaacagaaag aaactaggca aataaaaaca cctgacaata actcctaatg gggaaaaaag 26100 

ttatgtgtaa aaggaaaaaa aaaaaattag agggcatggt ggtgcatgcc tgtagtacta 26160 

gctactaggg aggctgaggt gggaagatgg tttgagcctg gaaggtggag gttgcagtga 26220 

gccaagattg tgccactgca ctccaacctg ggcaacagag agaggccctg tctcaataag 26280 

aagagggaaa aaaaagatgg gtaaggtggc tcacacttat aatcccagta ttttgggagg 26340 

cccagatggg aggatcactt gaggctagga gttggagatc agcctgggca acataacaag 26400 

agacaggtct ctacaaaaaa aaaaaaatac ccgggtgtgg tggcatgcgc ttgtagatcc 26460 

agctattcag gaggttgaag tgggaggatc acttaagccc aggagtttga gaatgcagtg 26520 

agctataact gtttcactgc actccagcct gggtgacaaa gcaagatcct gtctctaaaa 26580 

taaataaata aatggaaaag aaatcataga ttactatgtg gctctgttgt gattgtcttt 26640 

acatggccat aatactgtaa acattgacta atgaaaatta cgctgtaact acattaagaa 26700 

gagaaggaga aaagacaaag gtgcgtgggt ttgaggacag gtttaagaga gctacactgt 26760 

agaattattt gagtctttca actacgtgta ggtaagactt tgataaagca gaaacaagaa 26820 

ccaccatcaa tacaatcacg gtaacaggac acctccatta ataaaacaca cagttgtctt 26880 

ggggtcagcc tgctgagtat caaactgcat tccaagtttc aaataaagga ttgacaaacc 26940 

aactttcaga aaacagttct ttgcaatact taggagtgac tttagatagc aggcatttct 27000 

caaaccaccc tcctcctcct caccccctgc tccccaactc ccccatccaa tgcctgacaa 27060 

aatcccaaag tctgcaatat tagctgtagc ctttgctact tacaactaaa cataatcatt 27120 

aaaataaagt gaaagagaat tctagatcca gggttacagc ctttgatatc tgaacccaca 27180 

ctaaacttca aaattagatt tgccttgcaa aggagctgaa gaaatcactt tctccactct 27240 

tagcccatct atcttttttt agcctatctg atgcagccta cctgtccttt ttagtaatgc 27300 

ttgatacatg ggtagcgcct tggtttaaaa gcacctcctc ctacacctct catctgacct 27360 

caacgctatc ctgcttctca gatgagaact ccacaggcca cagggttgag ggatctgaca 27420 

gacatcagga ggggacagcg acaggaacaa gcaagagtgt ctgagaaggg ctcctgtagg 27480 

taccaaagct tctccacctc tggcctcagc tgtggaggga attcgtgggg atggagatgg 27540 

ggatgggggt ggggggatga acttagatgg gaaagaaagg ttacaacttc agtttcgctg 27600 

tcctccaact aaaatttacc attttttttt tgcacgatca atgtagcaaa gacccacagt 27660 

aaggaaatac ctgtgactat caacagaaat tagacatctc cacttcacat cgcggttgcc 27720 

gcagatctct tgaaatacca tttgtgcaga ccacaacgtc aacactacag taggagttac 27780 

cgctgccgct ggcccctgcg gtttctgccc ttcatcaaga agccagtcta ggaccccctc 27840 

acaagtgtgt tctaatcctt taggccctgt gtcttttaaa tcacgttttc cgggaaggag 27900 

tacaaactct tcagactacc aagcccgcca tttaaattgg gaacttcaga ggccggagcc 27960 

ctggagccgg agaccctccc cgacagcaca aggtcgcccg cagcccgcgc gcccccaccc 28020 

cctcgacctc ttggaggtgg ggatcccctc agggttacct gggggccccg ttaggaacac 28080 

gtgccgggcc atactggtgg gcgggggtag gggttgcggt caggggagca gtccaggttg 28140 

gggtcgcaat tcaggtcagg tccggaccag ggtcgcgact caggacccgc ccaccgcgtg 28200 

tcgcaggccc cgccc 28215 

<210> 604 
<211> 15689 
<212> DNA 
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<213> Homo sapiens 
<400> 604 

agagcccaga gagctgaacc tgcatcccgg acctgcggcg accgtcgtac accatgggcc 60 

tccacctccg cccctaccgt gtggggctgc tcccggatgg cctcctgttc ctcttgctgc 120 

tgctaatgct gctcgcggac ccagcgctcc cggccggacg tcacccccca gtggtgctgg 180 

gtgaggcacg ggtctcgtgg tggatctgtc ggtcgggcgg gacgggccgc gggcggggct 240 

gccttcccgg tctgcttctg ttccagtcac gaaatggggg agcgtattgg tatctgtctt 300 

gtcacttagg tgatctagat ctgccggctc acctcccgcc atgctgggcg cgggtgtggt 360 

cagctcccca cctcctgtcc tggtagaaaa tgggggtgag tgggagagtt accatctggt 420 

tattcatagt tccacctgtc cgcatcattc ccaagccttg cctcccctta tcccattttt 480 

aagctggaaa gacaggtgct tcagccccac tcctgaggga tggtgggggt gtgtagagtg 540 

gctcctgagt tgacttctgc atcgacctgg gcctggttag gaggctgtca ccctcgaccc 600 

ctccaatgcc acatatgcca gaaggcattg ctgaaaggaa cctccttagg gaaaggattg 660 

ctgtcctagc tggccaccaa ctcccacaac ttatcagatg tgtggcattg cctgagggaa 720 

aaggcctaag ggaaactgaa gacaactgaa gtcaggggaa agggttggga caatggccat 780 

gaacctgtgt caggcaacat gcttgtactc ttaccctgcc acacatgctt ggactagact 840 

atgtggaggc tcggtgacct cccctgctca ccccgctcct tgatcacaac cctggcaact 900 

ctacaacggc cactgtattt ccttgcaggg atgggcaggg atgagacctc tgtctggggg 960 

gccttgggtg cctcattgag gccgtcactc ccatccagag ctggtgtgtc cgtggccagc 1020 

caggattcct gaaagcagag gcatctgagg catgaattac acgtggagtc tagtgcagca 1080 

tgatctccat gctctttctg ctgagagtac tttaaatggg gacgatgcaa atcacagaag 1140 

ggtagagttg gcaggtaggg tggtgccagg caggccccac caggagttcc cagccaggaa 1200 

aagctccagg gggatactag gcccaggttc accatgggat ttgggcagac ttggccctgc 1260 

ctccccaaag cttgcaggat aggaataaca gtgatcccca gagaggaagt ctgtgtgggg 1320 

caagcacaga gaatccacca acccagcctg gatgggtggg tgtcaagtaa ggcttcctgg 1380 

agtatatagc tggagctaag gccaggagag agtaatgggt cggggagaga cgtgagcatc 1440 

tcaggctgcc tgggatcgag tcacaggagg agtggactgc acaggacttg tgactggttg 1500 

ggtcatctga actttatctt gagggcctca gggagtcctg ggttggcctg cagagttcta 1560 

agtctgaggg gcgtggcatt aggaagcatg tggtctgtgg cacacaaaat ggcacatagc 1620 

cattttgcct ggattgatgc taaccgcact catcaattca gtcagtctgt ttatccttgg 1680 

tcacagcaga agagcctcag accccaccca gaatcacaga gcagcaggag tcctaacatg 1740 

cccagcttct cagggagtga ggctgctggc cttagccctg ggagacctca cggtgggcag 1800 

ttggataagc agctcctggg cacagagcag ggatctgacc ttgactgtca gaagatcaga 1860 

atcctctagg ggtggatcct gaaatggtgg tggaggaaag agagaccagc gatgtgatac 1920 

agaagaccct gggcaagggg aagggaccat aggcaaggcc atttccacct cttgcagcct 1980 

cagtctcccc atctgttaag tggaaatagt aaggctgtcc cctcccagtg ttacttgagg 2040 

tgaaaggcat gtggatggct cttttcctag ttccaggcgc agatgggctc aggaagtggc 2100 

cactgctatt tttaggggcc acacagcctg accagggtaa ggagcatctg agcaaagcag 2160 

atctggtgac ttcctctttt accatttccc ctttggtaga gacagtgacc agggtctctg 2220 

ccctgccctg cccagtccac aggcatgtca tccagaggct gtcatgacag ggactccctc 2280 

cctctgcacc cataccatga agggaggtga aggcatgtgc cgcccctgcc tctcatggca 2340 

ccctgaagac actgggatgg tgcccaggga gggcacgccc tgttcagggt ggatttacgg 2400 

tcagttccca gtgaatgtcc aggagagcca agtgctggtt tttcactcaa tgatcatgga 2460 

ccctaaacgg ggatgtgttt gggaagtggc gagagcatgg catgtggggg caggtgaggc 2520 

ccgaaaggtc aatgggagta acgagactgt gaagggcttt gacgcagtgg ggagggattt 2580 

gggtgtcacc agggccccag agttctgagt acactgaata ggatggctgg gatggagaga 2640 

agtcacccta gggcctggac tgtggtgata atggtgctgg ttcagggtgg ctgtttccaa 2700 

tacgggctcc cagccatgct agggccctcc agatgatctg ctctctggca ccttactgga 2760 

ctctggcctt tagtaaggcc tgggcccttg gtaggctggc gggtgccaat gtaagcttct 2820 

ctcagctgtg tttctgttca tttccctccc accagcctgt cctgcatact ttgtctggtt 2880 

gaattttagc tgagtggccc agggctctgt tctgtttcct tcttcctctc tcagggcgaa 2940 

caggaccctg tcttctafcgt agcctgggat agctatgggt ggaagacttt tgggttcaag 3000 

cctccagggt atatatcttt tttttttttt gagaccgggt cttgctctgt cacccaggct 3060 

ggagggcaat ggcacgattt tggctcactg caacctctgc ctcccgggtt caagcaattc 3120 

tcctgtctta ggctcctgag tagctgggat tacaggtgtg tgccaccaca ccaaggtaat 3180 

tttttgtatt tttagtagag atgggctttc accatattgg ccaagctggt cttgaactcc 3240 

tgaccgcaag tgatccactc acctcggcct cccaaagtgc tgggattaca ggcatgagcc 3300 

cctgcgccct gcctcgaagg tatatatcta atgacaaggg ccttggccaa actttggagg 3360 

ttctggaatg caaaggggtc tccaagcccc tcctcccagg cagagctccc atcggggtcc 3420 

attaaccata gaaaccatgt gtagaggtgt gtctgtgttt gctgggggtg attcagacca 3480 
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agatctcatt taacagatac ggaaactgag gcctagtgtg gggagtgact tcaccctgaa 3540 

cccccttcca ataatctgaa tggcccagga ggatgttcca gtgaaaatcc aggagactta .3600 

ctttatctgg agttggagac aatgttaggg cccttcccca aacatgtcag gcctctccca 3660 

agccccccac aggcatctct gcaccacaca gcagtagcag cttttctgca caaagatggt 3720 

tccctggcct ttgggtcaag gtccctgctc tctgaatttc ctgccaccag cactgtgctt 3780 

taatttgaga tgtggaactg cacaggtctg ccatattatc tgtggctctt cagggggtct 3840 

gctgcagaca tcaaagttct tgggagtctt gaacctgccg ggctgagggc tcacacatgt 3900 

cctgtgcccc ctgcagtccc tggtgatttg ggtaaccaac tggaagccaa gctggacaag 3960 

ccgacagtgg tgcactacct ctgctccaag aagaccgaaa gctacttcac aatctggctg 4020 

aacctggaac tgctgctgcc tgtcatcatt gactgctgga ttgacaatat caggtggggg 4080 

ctggggcaca cagagggggg tgctgctcac caacagtgcc tgagaggcct ccagagtctg 4140 

ttccttctat cctcatctcg aggtcactga cctctctcct tccccaggtc ctcggtctgg 4200 

tctggtctgg tctggttggc accacctctg cactcccagg cagaactaac tttcccttcc 4260 

tgctgccctt gttcctctac tctccagtcc tggcctctca tgcctacaca caattttttt 4320 

tttcttttat tttgggatgg agtcttgctg tgtcactcag gctggcgcca tctcagctca 4380 

ctgcaacctc cgtctcccag gttcaagcga ttcttgtgcc tcagcctccc aagtagctgg 4440 

gattacaggt gcccgccacc atgcctggct aattttttgt atttttagtg gagatggtgt 4500 

ttcaccatgt tggccatgct ggtcttgaac tcctgagctc aagtgttctg cccaccttgg 4560 

actcccaaag tgctgggatt acaggcatga gccactacgc ctgcctacac aaaatttctg 4620 

tgtcccagcc cctattcaga gcctcggcat ggcccccacc cctttccttg tctgtgcctc 4680 

ggcctcccct acctctccag ctgcttattc ttcttgggac atgccatgtc tttgtacctg 4740 

ctgttctttt ggcctggaat ctatttctac accctttgcc tgataaatgg cttccattca 4800 

cttttttttt tttttttttt tttggagacg gagttttgct cttattgccc aggctggagt 4860 

gcaatggcgc gatctcggct cacaactctt tttttttttt ttaactttta ttttttgaga 4920 

cagagtcttg ctctgtcgcc caggctgggt gcagtggcgc tatctcggct cactgcaagc 4980 

tccgcctccc gggttcacgc cattcttctg cctcagcctc ccaagtagct gggaccacag 5040 

gcgcccgcca cagtgcccgg ctaatttttt gtatttttta gtagagacag ggtttcaccg 5100 

tgttaaccag gatggtctcc atctactgac cttgtgatcc gcctgcctca gcctcccaaa 5160 

gtgctgggat tacaggcgtg agccaccaca cccagcccca ttcactcttt aaaacatgac 5220 

cctaggccag gcatggtggc tctgctataa tcccatcact ttgggaggct gaggcagtca 5280 

gatcacctga gctcaggagt tgaagaccag cctgggcaac ttggtgaaac cccatctcta 5340 

ccaaaaatac aaaaaaatta gctgggcatg gtggcatgca cctgtagtcc cagctactta 5400 

gggggctgag gcaggagaat tgcttgagcc agggaagcag aggttgcagt gagcagacca 5460 

ctgtacccag cctgggttgc agagcaagac cctgtctcaa aacaacaaaa acctgaccct 5520 

ggtccctttg tgacatgtcc cttttgactc cccagagcgg tttgcctctc ctcccaggcc 5580 

ccatgcttgc ttcctctaaa catcatcatg tgcccgaatt tgtctgcagg catgtttgcc 5640 

caaattgcac aggcactgta cattttatat actttctttc agttagtctt acagcaaccc 5700 

cgttttgaaa taggtgctgt tttctttttt tttttttcac cagtgttact tgccaacagg 5760 

tggtttttaa acctgcttta caactgcgaa atctggaggc tcaatgaggt tacaaaattg 5820 

actgtgctca ctcagctaat aagtggtaga atcagaattc aaatccagaa tgtgtggcgt 5880 

tgagctgagg ttccctccca ggctgagttc cgtgaaggga gggagtgtgt gtctgtttta 5940 

tccttatagc agtagaacca gaaacacagt tggcgcctga taaatgtttg agctgaatgg 6000 

agtctcccaa attaaaagtt tttcctggtc gggtgcggtg gctcatgcct gtaatcccag 6060 

cactttggga ggctgaggtg gacagatcat gaggtcagga gttcgagact agcccggcca 6120 

acacagtgaa accctgtctc tactaaaaat gcaaaaatta aggccgggca tggtggctca 6180 

cgcctgtaat cccagctagt caggagtctg aggcaggaga aacgcttgaa cctgggtggt 6240 

ggaagttgca gtgagctgag atcgtgccat tgtactccag cccaggtgac agtacaagac 6300 

tccatctcaa aaaaaagaaa atacaaaaat acaaaaatga gccgggcgtg gtggcacacg 6360 

cccatgatcc cagctactcg ggaggctgag acaggagaat cacttgaacc tgggaggcag 6420 

aggttgcagt gagcagagat gggggcactg cactctagcc tgggtgacag agcgagactg 6480 

tctcaaaaaa aaaaaaaaaa aaaagttttt cccagagacc cgctagctgc tcctccaggt 6540 

gccctggcct gccggctgtt tgtgatgccc attttgatga ctccttcctc ctctgtgtgt 6600 

tgtgtgactg gggttatttc tgggctgttg atgtgcattt ctatttttat actggggaca 6660 

cagcctcagg tgctgccagt tggaggtgcc agctgatacc aagaggggcc tgtctgcctt 6720 

cactgcttct ctgacccgtt tagattttca cccagcgcag aactgctttg gctttccaga 6780 

acagttttct gcattttgct gtttgataat ttatttttag tagcaacaag ctactttgct 6840 

atgtggtcat tgggataatc tacaaattgc cctggagaga atgggcctcc aggacctgtt 6900 

atctgggtca gttgtgggga tcctgtcagc ttcagtggct gatatctaag tgggcaagtg 6960 

ggactcagga ccaccagaaa ctccaggttc ctgtgttttg ggggcacatt tgcttcttca 7020 

gtgtgtggcg aggccagagc aaggttccca gcctcgcaca gaggcacaag gcttttctcc 7080 

tggtataagg tgtctcccct ggcccatcac cctgttccca tcatttcttc aggccttgac 7140 
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aaccctcttc ccacaggaga tggtgcagca gctgtgtact gagccctgag gcggggtctg 7200 

tcctgtctgg ttctgtttgg cactgcagct acagaaacta tgagatcacg tgtgtggtgt 7260 

tagaagctgc tgagtctgtg gtaatttatt acacagttct agaaaactaa taggctgggt 7320 

gcagtggctc acacctataa tcccagcact ttgggaggct aaggcaggag gatcgcttga 7380 

gcccaggagt ttgagaccag catgagcaca tagtgtggcc ctgtccctac aaaataaata 7440 

aacaaataaa taaataaatt agccagacct ggtggtgcct gcctagctgt tcaggagggt 7500 

gagqtaagag gatcacttga gcccaggagg ttatggctac agtgagctat gatctcctca 7560 

ctgtgctgca gcctgggcaa cagagtgaga ccctgtctca aaaaagaaga agaaagaaaa 7620 

gtaagaaagg gctgttaggg atcttaagtc agtagctggt ggtccatgtc ctatgtcatg 7680 

gcgcttagga atttgctggg caagccctga gccttgtggc tgcagtctga cagttgatac 7740 

agccctgggg tactgggagg tcataggagc atggttcatt cccttcctcc atggaggatg 7800 

tgtttggagg ttggctggct gctgtgtcca atgcgcagag gaagcgggac ccaattatac 7860 

ctacctgggg ccacacaagg ccttcctcct gtctctcccc accagctccc atcatttctc 7920 

tgggccttgc tccttctcct acctgcctac ccacccatac agcagccctg aggcatggtg 7980 

cctttctagg gtttgttttt gtttgtttgt ttgttttttt gagacagagt ttcgctctta 8040 

ttgcccaggc tggagtgcag tggcgcaatc tcggctcact acacctctgc ctcccaggct 8100 

caagcacttc tcctgcctca gactccggat aactgggatt acaggtacac gtcaccatgc 8160 

ctggttaatt tttgtgtttt tagtagagat ggggtttcac cctgttggcc aggctggtct 8220 

cgaactccta acctcaagtg atccacgtgc ctcagccttc caaagtgctg agattatagg 8280 

cgtgagcccc tgtgcctggc ctagtgtgtt ttgttttgtt tttttttttt ttttgagaca 8340 

gggtcttact ttgctgccca ggctggtgtg cagtggcatg atcacggctc actgcagcct 8400 

caatatccca ggtgcaagtc atcctcctgc ctcagccttc caagtagctg ggaccacagg 8460 

cctgcaccac catgcctggc taatttttta tttttagtag caacaaggcc tcactgtatt 8520 

gcccaggctg gccttgaact cctggcctca agcattcctc ccaccttggc cttgcaaatt 8580 

gttggaacta caggtgtcag ccactgcacc cggcccctgt ctggttctgt tgtgtactgc 8640 

actcaccacc tccttcacct gggatacgtt gggctggcag gagctggatg caaggagatg 8700 

gagcgatgga aagctgaaag agggtagact gcctcttggg gtccagtgtg aagttcacgg 8760 

cagaatcagg actggtactt ccttctccca accaggtcca agaccctcat tctagagtag 8820 

ggtcacgggg gagaggggaa tgttgctttg acaggggtcc tggggtcagt atttggagta 8880 

agtatgcctt ggtggggtcc tcatttggat gtaattactc tgatctgtgc tttttatttt 8940 

tatttattta tttatttatt tatttattta tttatttatt tttttgagac agagtcttgc 9000 

tctgtcgccc aggctggagt gcagtggtgt gatctcagct cactacaacc tccgtgtcac 9060 

aggttcaagc gattctcctg cctcagcctc ctgagtagct gggattacag tcgcctccca 9120 

cgacacccgg ctgattttgt atctttagta gagatggggt ttcgccatgt ttgtcaggct 9180 

ggtctcaaac tcctcactca ggtgacctgc cagcctcagc ctcccaaagt gctgggatta 9240 

acaggcgtga gccactgcac ctgggcaatt tttgtatttt tagtagagac agggtttcac 9300 

catattggcc aggctggttt caaactcctg acctcaagtg atcctcccac ctcagcctcc 9360 

ctaagtgctg ggattacagg tgatctgtgc tttatcaacc tgctttgtta cttttcggtc 9420 

tctcctatcc atctgaccaa acctgggcat gagaagggag tatgactcag gacatgatgc 9480 

actgttggga cacaccaagc tcagtgtccc ctccgggtca ctggctcaat atcctctcac 9540 

aacaggctgg tttacaacaa aacatccagg gccacccagt ttcctgatgg tgtggatgta 9600 

cgtgtccctg gctttgggaa gaccttctca ctggagttcc .tggaccccag caaaagcagc 9660 

gtgggtatgt agcccttact caaggcctcc gggagctggg atggggtttc tgccggactg 9720 

gagctggagc tggaggaact ctgctggttt gtagggacag cctgtgagct gtctctgatc 9780 

agcgtgggca cagagccctg tagcattctt ccaaggacct gctagctgtc acagtctcca 9840 

tgctgggcat agtgtaggtg gccggcacta gctgtatctt ttcttatcct ctgtatttct 9900 

gtctacaggt tcctatttcc acaccatggt ggagagcctt gtgggctggg gctacacacg 9960 

gggtgaggat gtccgagggg ctccctatga ctggcgccga gccccaagta agcaggcact 10020 

ctcattccct ccctgacgtc tcgggaggta ggggtgaggt gatcatgggc accacagacc 10080 

ttgggctctc cccttgtcct tggctgtctc ctgtccctgg gcctctggca tccagtctag 10140 

tggtcacagc caccaccttt ggtcagtctt atcctgtcct ccatttccca ccctgggacc 10200 

tctgggcctg tgagccctgg ggagaaatat aaggcttcct cccttcatgg aaggcggggg 10260 

gacccagacc gctctgtttg aatgtgagca ccctcccctc cccctctcgt cttgtgtctg 10320 

gcctgagaaa agctcagtgg ttccggctcc aggacccttc ccacctgacc cctgcctggc 10380 

tctggcctgc agatgaaaac gggccctact tcctggccct ccgcgagatg atcgaggaga 10440 

tgtaccagct gtatgggggc cgcgtggtgc tggttgccca cagtatgggc aacatgtaca 10500 

cgctctactt tctgcagcgg cagccgcagg cctggaagga caagtatatc cgggccttcg 10560 

tgtcactggg tgcgccctgg gggggcgtgg ccaagaccct gcgcgtcctg gcttcaggta 10620 

agaccctacc tggcccagcg tggggggctg ttgccaggaa ttctgccctc tcttcccttc 10680 

ctaagtgtcc tcctgggcca gcatgcctcg tgtctgtccc acggtgtgtg gggtctatgc 10740 

aaatctaccc ctaaaagtcc aaagaagaaa gaggctgaca aatctagttt ctcagagaaa 10800 
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agcatttaat 

agatggtgga 

ggcgactccg 

tatggttaag 

gatctggggt 

tctccacaat 

gacagagtct 

acctctgcct 

gtgcccgcca 

tggccaggct 

tgctgggatt 

acttcttcat 

tggaggcagg 

tcatgcaacc 

gggactacag 

tttcactatg 

gcctcagcct 

tttgtatttt 

acctcaaatg 

tgcacctggc 

cacggagcct 

gcctggcaat 

gcagctcacc 

ctctgctagg 

gtgccaagca 

attgtgattg 

ggactcatta 

ccctgagttt 

gctttgtttc 

gtggtgcggg 

ccagccaggc 

tttcagatac 

ttcccaagtt 

tcacattcct 

cagcacgcct 

gctgaacgct 

gggaaagcag 

agtgcccaca 

ataaatcacc 

aacatagctt 

aaggcaagca 

tagcccaggc 

gatcacttga 

gcctgggtga 

atgaggcatg 

tggttgagcc 

t^ggcaacag 

agtcacagtg 

tggctggggt 

agcacagact 

gcccctgaag 

caactacaca 

gcgggactac 

ggacacagaa 
tggtactggc 
taaaatctgc 
ggcctggcag 
catcgagatg 
ctgactcctg 
gctgtcatgg 
ggactgtgaa 



agggacatac 

tccccacact 

atgcgatgtg 

tacgtgcaca 

taatcagaag 

gactattctc 

ggttctgtca 

cccgggttca 

ccatgcctgg 

ggtcttgaac 

acaagtgtga 

ccccaccttc 

gtctttctca 

tccaccttcc 

gcgcgtacca 

ttggccgggc 

cccaaagtgc 

tagtagagat 

atctgcccac 

ctctagacta 

ctgtaggcaa 

tgggaaagcc 

agctgcagct 

acttggggga 

tctcatgagc 

ggtcaggccc 

tagccaccac 

agacaggtag 

aaaggctggg 

gaggtggaag 

cacagcaggc 

agggtggggt 

gcagctttta 

gtgtataggg 

ggctgcccag 

ggtctgagct 

aacatgtcat 

ggcctgatca 

ctgtgggtat 

ggatgtgtgg 

aactgggcaa 

gtggtgggac 

acccaggagg 

cagaatgaga 

gtggcgcata 

tgagaggttg 

agtgagaccc 

attacaatga 

ctggcagggg 

gaccagagcc 

atccgggagc 

tggtcacctg 

cgcaagttct 

gggctggtgg 

gtccccacac 

tttggtgacg 

agccgccagg 

ctggccaacg 

tgccacagga 

cccacgcgtt 

gcatctgcca 



gaatagaagc 
attagccccc 
ttgaacattg 
agaaacagta 
ccaacatggt 
ttgggtcagc 
cccagactgg 
agcgattctg 
ctaagttttg 
tcctgacctc 
gccactgtgc 
accgcagagg 
ctctgccatc 
aggttcaagt 
ccacacccgg 
tagtctctaa 
tgggattaca 

ggggtttcac 

cttgacctcc 
cagtttttta 
cacagggagg 
acccctttta 
gtagagtaga 
ggcttatgga 
tcaagtgcct 
agtgactctg 
ttcggtctag 
ccccggcccc 
aagctgttct 
tccgtgcacc 
agagctgagg 
gatgggtgac 
cttcctggag 
actccgcctc 
gtcagcgcac 
taggaaaaga 
ctccgagttg 
gcacttggag 
gtgccgcctt 
ttcccagatg 
catagcaaga 
gtgcttgtag 
tggaggttgt 
ccctacttca 
cctgtagtcc 
aggctgctgt 
tgtctcaaaa 
tgataaccgg 
cctggtggtg 
ttctccctgc 
agcagcggtc 
agaaggtgtt 
tccaggacat 
aagccacgat 
cagactcctt 
gcgatggtac 
agcaccaagt 
ccaccaccct 
ctcctgtggc 
ttgcaaagtt 
tggggaagtg 



cacatctgtg 

agacccagag 

aaggatgata 

gataaagtgg 

ggattagtat 

cctgtttttt 

aatgcagtgg 

cctcagcctc 

tgttttagta 

agatgttcca 

ctggccatcc 

aggaaggcta 

caggttggag 

gattctcctg 

ctaatttttg 

ctcctgacct 

agcatgaacc 

cattttggtc 

caaagttctg 

tttatgagtt 

aagcaggtgc 

tggctcctcc 

gcttcttcct 

ccctcacttt 

gccactgctt 

agttggctgg 

cctgggcctg 

tttcagagtt 

gtcctgcctt 

cctaggggct 

ccctacagtc 

catggatcct 

gcctccctga 

catctgcagc 

atgcaagtgg 

aaaactggta 

ccaacctgct 

gatggtagag 

actccctggt 

tagcaaaaat 

ccccatctgt 

ttccagctat 

agtgagccaa 

aaaaaaaaaa 

cagctactcc 

gagccgtgat 

agaaaaaaaa 

gtagagatgc 

aacatgctgc 

aggagacaac 

agctgtctcc 

cgtgcagaca 

cggctttgaa 

gccacctggc 

ctactatgag 

tgtgaacttg 

gttgctgcag 

ggcctatctg 

tcggccgtgg 

tgtgactcac 

ctgtttgtta 



tctcaggtgg 

cttatattct 

acacagaggc 

aaatctcagt 

ccaaaatgga 

ttttaatttt 

tgcaatcttg 

ctgagtagct 

gagatgaggt 

ctcgcttcgg 

ctccctctta 

gagcattttt 

tgcagtagcg 

cctcagcctc 

tatttttagt 

caagtcagga 

accgcacccg 

aggctggtct 

ggattacaag 

gcgcctccat 

agatctggct 

cccgtctttt 

ctagaaggga 

gggagctatg 

gacctgcttg 

aagacactca 

gagggacgga 

tctctgtgag 

ggcagtccag 

ttccttgcct 

agtctgagag 

gaacaggcca 

ctgccgggtg 

ctcccctcgt 

gtggaatgag 

cacagttcag 

gactcacttt 

tttacaggca 

gaggaaagca 

aaagagaaac 

ataaaaagtg 

gcaggaggct 

gactgtacca 

aaaggaaaaa 

aaaggctgag 

catgccatta 

agagccaagg 

gggactgtgg 

ccaaccagct 

aaccggatcc 

accagctggc 

cccacaatca 

gatggctggc 

gtgcagctgc 

agcttccctg 

aagagtgccc 

gagctgccag 

aaacgtgtgc 

acctgctgtt 

cattcaaggc 

tcctttctct 



gggcaagaca 

gtagggaaag 

tgttttgatt 

ggccttcctg 

gttgctttgg 

tattttttga 

tctcactgca 

gggattacag 

ttcatcatgt 

cctcccaaag 

ccccatcctt 

tttttttttt 

tgattttggt 

ccgggtaact 

agagactggg 

ggatctgcct 

gccagatttt 

tgaactcctg 

gatgagccac 

ctgtgggcag 

gggcctgctt 

catgactgtt 

gctttcctcc 

agccagggct 

gtctactggc 

gatttagtga 

tgtgggcata 

tccccttaga 

gactgggtgt 

caggggtccc 

gcctccctgc 

gaggcccctg 

tttgtgttgt 

gccctgtctc 

gccacggctg 

tgtcataaaa 

agactaggtc 

aggaagcatc 

aaaaaaacaa 

aataaaagcc 

gaaacatttt 

gacataggag 

ctgcactcca 

tttgccaggc 

gcgggaggat 

tactccagcc 

cagggatggg 

actgggcccc 

ggcattccta 

cagtcatcgg 

tgctgcccta 

actacacact 

tcatgcggca 

actgcctcta 

accgtgaccc 

tgcagtgcca 

gcagcgagca 

tccttgggcc 

ggcctctggg 

cccgagtctt 

gtggcagtga 



10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 
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agaaggaaga 
gtggaactgc 
agtccctgcc 
gggccctggc 
aggtcccagg 
aggcctgcca 
ccagcctgac 
cgttcttcgt 
aggagcattc 
caggggccct 
ccctgggact 
tatggagaaa 
ctcttgctaa 
ggtcagcaca 
tgggactagc 
cctaggggct 
gagagtcagg 
aaaagggtac 
ctcccacagg 
tccccttgtt 
aataaaagta 



aatgagagtc 
tgtgacctta 
tggggccatg 
cccgcagcct 
gatcggctcc 
ctggtcatgg 
tggcttcctg 
ggttcccagg 
aagctctgga 
gatttcctcg 
gtggttccaa 
gggaatccaa 
ccccaccatc 
gggctgagga 
ccagaaactt 
ttctgtctgc 
gctgccttca 
agcctctagg 
gtttctgtgc 
cctgagtggc 
ctctggatgc 



tagactcaag 
ggactggctc 
tgtcccccct 
tcctatgagg 
tggcccctcg 
gtagctagag 
ggcgagccta 
ccctgggaca 
ttgggcagca 
gatgtgctat 
ggatgagagc 
ggaagcagcc 
acactgccac 
tggggctcct 
gaatgggacc 
cccagggtgc 
tggcagtagg 
tggggttccc 
agctggattt 
cccacatggg 
tgtaaggtg 



ggacactgga 
cacagggtgg 
attcctgtgg 
gatgttactg 
ggtgaccctt 
ctgctggctt 
gtagctcctg 
tctcactcca 
gatgtgcccc 
tggccccagg 
aggggttgga 
aaggctgctc 
cctgccctag 
atccaccctg 
ctgagagagc 
tccatggatc 
ctctaagtgg 
aaagacgcct 
tctctgttgc 
gctctgagca 



tggcaagaat 
actggctggg 
gcttttcata 
ggctgtggtc 
cccacacacc 
ccctgtggct 
caggcagggg 
ctcctacctc 
cagtcccgca 
actgaagctg 
gccatggcct 
gcagcttccc 
ggtctcacta 
gccagcaccc 
caggggtccc 
tccctgtggc 
gtgactggcc 
tcacgctgga 
atacatgcct 
ggctgtatct 



gctgctgatg 
ccctggtccc 
cttgcctact 
ctgtacccag 
agccacagat 
tagctggtgg 
cagtttgttg 
ccttaccacc 
ggctgtgttc 
cctcccttca 
tctgggaacc 
tgagctgcac 
gtaccaagtg 
agcttagtgc 
ctgaggcccc 
agcaggcatg 
acaggccgag 
ctgagctgct 
ggcatctgtc 
ggattctggc 



14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15689 



<210> 605 
<211> 384 
<212> DNA 

<213> Homo sapiens 



<40O> 605 

tatggtgctg 

tggcgcaggg 

cgagctcccg 

tcgagactca 

tgcactctgc 

cctggcctct 

tggccgcacc 



gcatataacc 
agtccatgtc 
atgcctctcc 
agcccactcc 
gcccctaaaa 
cccggttccc 
caggtgccct 



agctgtcagg 
ctctctggtt 
tgctctgcag 
caaccccgcg 
cgaacagatt 
cccgcatctg 
cggg 



tctttgccca 
ccccaggttt 
gggaacttgc 
cccgaactgc 
aacccctctc 
taaccccggg 



ctctgttcgc 
gcgagagtgg 
agatggccca 
ccggactggc 
ttgggaactg 
gcagagttac 



ccctgcttcc 
agggggacca 
tggcgcaggg 

ggggtgacgc 

aacatgctga 
aggggctgac 



<210> 606 
<211> 3871 
<212> DNA 

<213> Homo sapiens 



<400> 606 

gaggcaggac 

agaagtgttt 

gtcattgttg 

agttatgggc 

tcacaggtct 

ggaaccttct 

ccatcaagcc 

gcccacaagt 

cccagcccac 

gaaagtaggg 

ttttgtttag 

tggacctaag 

aaagatctga 

tatctgtcta 

atctctcttg 

ggcatgtgac 

tagggatgtt 

gtactgcatg 

tcagcaaagc 



cttgtcctat 
cagggttctt 
ttataatctg 
cctctccttg 
acagaatcct 
agcagtcctt 
ctcctcagcc 
gcaggggccc 
tcagaaggat 
catggtcatt 
ggcgctgctg 
acttgcttcc 
attcccatca 
tctatctagt 
cccccgatcc 
tgttgagaat 
cgtgcctatt 
atgcaaacat 
agt cat teat 



tcattaatct 
ggcccttgtt 
gggttacctt 
gaaactgeae 
ggaaactctc 
tccagctcct 
cataaggtgg 
aaagacatcc 
geattttgee 
gttgaaaatg 
cttctggctt 
catcacctta 
aatctctget 
accttttgtg 
tggaatggac 
cactcaccct 
tgctaatcat 
aacatcatat 
ttactcagtc 



tgcccctcaa 
tgaccagtcg 
ttggaaggtc 
acacaccatg 
atgtccggtt 
ccccagctcc 
gccaggtggc 
cagagcaaac 
ccctctgttt 
tgagaaatgc 
aagatcctaa 
cataaaagaa 
attactagct 
aatatgagtg 
caaatacctt 
ettagageca 
gagtgacaat 
ttgtagggtt 
aaatactgat 



cagttatttt 
tcctaaccct 
atggggtacc 
cagcttacaa 
ctactcattg 
tcagctctgc 
ctgtggggat 
ccaagaatcc 
atttgtttgt 
agagaagtta 
atcaaagcag 
agagcactgg 
tttatctcta 
tttctcaccg 
gtttactgaa 
tggtttcctc 
gacataaggt 
gttgetaata 
gcactggtac 



cagaggggca 
catgtcttgg 
ettttgeaaa 
ttcagggagt 
tagcttcagt 
ttccctccgc 
aaatcagagt 
ctctacaagc 
ttttaattat 
aaatgatget 
ctgccagatc 
actaggaatc 
tttatttatt 
aggccctccc 
ggatacagat 
ttctataaaa 
acataaagct 
atactatcca 
attcagtttc 



60 
120 
180 
240 
300 
360 
384 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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ctcttttgta aaatggggat aaaaatagga cttagctcat agggtattta agattcagtg 1200 

agttaatata tataaaatac ttagagcagt ccttggaaca tattaagaac tcaatacata 1260 

ttagctagtt gagcaggctg tagtatttgt tccagccaag aaaagactgt tctctaacag 1320 

cacaggaaat aaagatgggg ttaggcacga cacggcagca ggacttacct tctgtctaat 1380 

tcagctggca gtcaaagaaa gaattattag aagcctatga gcttgttctc ccaaagatct 1440 

actgagtaca gggggatatt taaagaataa aaatccctag acccacttac agtacaggag 1500 

agacaagaag ctgttcacac aataacaagt gctaaattcc ttgattttta tactgacagc 1560 

tgaagtttta gagaagagaa ggatcaataa agaccggaat attaaagcag acaggcctaa 1620 

aagaggattt tagatttgat aaagaattcc tccagttctc agagcaggga ctttggaggg 1680 

taacaacttg gatttcagtc cttgcctgcc acttactgat tgtatgacct tggaaaagtt 1740 

acttcacttc tgtgagcaat gattctctca tctggaaaac aaaacaaaac aaaaaaacta 1800 

acaagggtga taataatacc tacctcccta cctcataggg ctgatgagaa gattaaaaag 1860 

tacctacata aagcccttct ctgcgtgcct ggagcatggc aagggctcca tgtgaaccac 1920 

tatttttttt tttttttttt aatgaaaagt catgggcagc accaaagctc agaattttgg 1980 

cagtaaggaa ttatgattct acattgaaat ttgccagaag gggagctgac tgcctcatga 2040 

gacatttttg aatgaggcca aaaaaggaaa caagtgtatc ctgggatttt acgagatgct 2100 

aggtatgtcc ttagcactta atcctcatga ctaccctatg atgtaagtac tatctgttgt 2160 

ccctatttta cagttggcca aggtcacata gctgaaacgg acttctgtgt gtcttctaac 2220 

ttttgctttc aggtctggaa aaatgcaatg taaacctaga ccctctttga aatactgaag 2280 

atggtaatct tatcccttcc tcactttctg ttttctaaaa taacagtcct tgttccttac 2340 

aatgtctgtt ttccagattc ttagaagact ttttgcttat tttccataac tctttacttg 2400 . 

tgatccctga atgacaccgg gggtatagca gagaatgtcc atttcctcaa agttcaaagg 2460 

tcctacaaaa aatagttgct agcctggcat gatggtgtgt gcctgtagcc tcagctacct 2520 

gggaggctga ggcaagagga ttgcttgagc tgaggagttt gaggctgcag cgagccatga 2580 

tcacgccact gctctccagc ctggatgaca aagcaagatc acatcttaaa ataaaataaa 2640 

ataaatttta aaaatatata aaaaataaaa agtggtgcca ctaagtgtcc tggacagcat 2700 

tagagaaagc aagagataga agagtgcaaa cacctgaaag gaaagagatg cgggtcttcc 2760 

ctctcccttg agtagccacc ggctatccct ggagcatagg agtaggattt tatcccccag 2820 

cttcggcctc cccaggcagc acttcctttc tgtgctttga ctccaatttg gatggtgctc 2880 

aggcgggaag taggctgggg tgggaggagt ttaggggaat atttgtcttc tctcctgttt 2940 

tgccctagag attctggcca ggaagacaaa tggctagtac cacttggtcc ttttcttttc 3000 

ttcttttgag acagggtctt gttctgttgc ccaggctgga gtgcagtggc gtgatcgcgg 3060 

ctcaccgcag cctcccaggc tcaagcattc ctctcacctc agcctcctga gtagctggga 3120 

ccacagctcc actaattttg aagttttttt ggtagacatg aagtctccct gtgttgcccg 3180 

ggctggtctc aaactcctga cctcaagcag tcctcctgtc ttggcctctg gaaatgctgg 3240 

gattacaggc gtgagccact gtgctggcct cttttttctt tttctttttt tttaaggttt 3300 

ttatttgtta aatgggaagt ctgtgccatc aactgagcat tgtattttct ccttagtaag 3360 

agcctgggtg ggccactggg agagaactat acattaaatg taagtagcct ctgggtagag 3420 

agcccctggc tggtttcctt tcctttctct ccttttctct actttggtgt ctggaggcat 3480 

ttcccagact ccagtttctt accaccctca cggattttgc tattgtatta tcacctcctt 3540 

tatcattccc aaaattgact ttatggagac tcattaaaag aaaaaatcat cggccgggag 3600 

cggtggctca cgccacgaag gcgggcgaat cacctgaggt gcggagttcg tgaccagcct 3660 

gaccaaaaca gagaaacccc atctctacta aacaatacaa aattagctgg gcgtggtggt 3720 

gcacgcctgt aatcccagct actggggagg ctgagacaag agaatcactt gaacccggga 3780 

ggcagaggtt gcagtgacca aagatcgcac tattgcactc cagcctgggc aacaagagca 3840 

aaactctatc tcaaaaaaaa aaaaaaaaaa a 3871 

<210> 607 

<211> 3872 

<212> DNA 

<213> Homo sapiens 



<400> 607 

gaggcaggac 

agaagtgttt 

gtcattgttg 

agttatgggc 

tcacaggtct 

ggaaccttct 

ccatcaagcc 

gcccacaagt 



cttgtcctat 
cagggttctt 
ttataatctg 
cctctccttg 
acagaatcct 
agcagtcctt 
ctcctcagcc 
gcaggggccc 



tcattaatct tgcccctcaa 
ggcccttgtt tgaccagtcg 
gggttacctt ttggaaggtc 
gaaactgcac acacaccatg 
ggaaactctc atgtccggtt 
tccagctcct ccccagctcc 
cataaggtgg gccaggtggc 
aaagacatcc cagagcaaac 



cagttatttt 
tcctaaccct 
atggggtacc 
cagcttacaa 
ctactcattg 
tcagctctgc 
ctgtggggat 
ccaagaatcc 



cagaggggca 
catgtcttgg 
cttttgcaaa 
ttcagggagt 
tagcttcagt 
ttccctccgc 
aaatcagagt 
ctctacaagc 



60 
120 
180 
240 
300 
360 
420 
480 
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cccagcccac tcagaaggat gcattttgcc ccctctgttt atttgtttgt ttttaattat 540 

gaaagtaggg catggtcatt gttgagaatg tgagaaatgc agagaagtta aaatgatgct 600 

ttttgtttag ggcgctgctg cttctggctt aagatcctaa atcaaagcag ctgccagatc 660 

tggacctaag acttgcttcc catcacctta cataaaagaa agagcactgg actaggaatc 720 

aaagatctga attcccatca aatctctgct attactagct tttatctcta tttatttatt 780 

tatctgtcta tctatctagt accttttgtg aatatgagtg tttctcaccg aggccctccc 840 

atctctcttg cccccgatcc tggaatggac caaatacctt gtttactgaa ggatacagat 900 

ggcatgtgac tgttgagaat cactcaccct cttagagcca tggtttcctc ttctataaaa 960 

tagggatgtt cgtgcctatt tgctaatcat gagtgacaat gacataaggt acataaagct 1020 

gtactgcatg atgcaaacat aacatcatat ttgtagggtt gttgctaata atactatcca 1080 

tcagcaaagc agtcattcat ttactcagtc aaatactgat gcactggtac attcagtttc 1140 

ctcttttgta aaatggggat aaaaatagga cttagctcat agggtattta agattcagtg 1200 

agttaatata tataaaatac ttagagcagt ccttggaaca tattaagaac tcaatacata 1260 

ttagctagtt gagcaggctg tagtatttgt tccagccaag aaaagactgt tctctaacag 1320 

cacaggaaat aaagatgggg ttaggcacga cacggcagca ggacttacct tctgtctaat 1380 

tcagctggca gtcaaagaaa gaattattag aagcctatga gcttgttctc ccaaagatct 1440 

actgagtaca gggggatatt taaagaataa aaatccctag acccacttac agtacaggag 1500 

agacaagaag ctgttcacac aataacaagt gctaaattcc ttgattttta tactgacagc 1560 

tgaagtttta gagaagagaa ggatcaataa agaccggaat attaaagcag acaggcctaa 1620 

aagaggattt tagatttgat aaagaattcc tccagttctc agagcaggga ctttggaggg 1680 

taacaacttg gatttcagtc cttgcctgcc acttactgat tgtatgacct tggaaaagtt 1740 

acttcacttc tgtgagcaat gattctctca tctggaaaac aaaacaaaac aaaaaaacta 1800 

acaagggtga taataatacc tacctcccta cctcataggg ctgatgagaa gattaaaaag 1860 

tacctacata aagcccttct ctgcgtgcct ggagcatggc aagggctcca tgtgaaccac 1920 

tatttttttt ttttttttta atgaaaagtc atgggcagca ccaaagctca gaattttggc 1980 

agtaaggaat tatgattcta cattgaaatt tgccagaagg ggagctgact gcc teat gag 2040 

acatttttga atgaggccaa aaaaggaaac aagtgtatcc tgggatttta egagatgeta 2100 

ggtatgtcct tagcacttaa tcctcatgac taccctatga tgtaagtact atctgttgtc 2160 

cctattttac agttggccaa ggtcacatag ctgaaacgga cttctgtgtg tcttctaact 2220 

tttgetttea ggtctggaaa aatgcaatgt aaacctagac cctctttgaa atactgaaga 2280 

tggtaatctt atcccttcct cactttctgt tttctaaaat aacagtcctt gttccttaca 2340 

atgtctgttt tccagattct tagaagactt tttgettatt ttccataact ctttacttgt 2400 

gatccctgaa tgacaccggg ggtatagcag agaatgtcca tttcctcaaa gttcaaaggt 2460 

cctacaaaaa atagttgeta gcctggcatg atggtgtgtg cctgtagcct cagctacctg 2520 

ggaggctgag gcaagaggat tgettgaget gaggagtttg aggctgeage gagecatgat 2580 

cacgccactg ctctccagcc tggatgacaa agcaagatca catcttaaaa taaaataaaa 2640 

taaattttaa aaatatataa aaaataaaaa gtggtgccac taagtgtcct ggacagcatt 2700 

agagaaagca agagatagaa gagtgeaaac acctgaaag'g aaagagatgc gggtcttccc 2760 

tctcccttga gtagccaccg gctatccctg gagcatagga gtaggatttt atcccccagc 2820 

ttcggcctcc ccaggcagca cttcctttct gtgetttgae tccaatttgg atggtgctca 2880 

ggcgggaagt aggctggggt gggaggagtt taggggaata tttgtcttct ctcctgtttt 2940 

gecctagaga ttctggccag gaagacaaat ggctagtacc acttggtcct tttcttttct 3000 

tcttttgaga cagggtcttg ttctgttgcc caggctggag tgcagtggcg tgatcgegge 3060 

tcaccgcagc ctcccaggct caagcattcc tctcacctca gcctcctgag tagctgggac 3120 

cacagctcca ctaattttga agtttttttg gtagacatga agtctccctg tgttgcccgg 3180 

gctggtctca aactcctgac ctcaagcagt cctcctgtct tggcctctgg aaatgctggg 3240 

attacaggcg tgagccactg tgctggcctc ttttttcttt ttcttttttt ttaaggtttt 3300 

tatttgttaa atgggaagtc tgtgccatca actgagcatt gtattttctc cttagtaaga 3360 

gcctgggtgg gccactggga gagaactata cattaaatgt aagtagcetc tgggtagaga 3420 

gcccctggct ggtttccttt cctttctctc cttttctcta ctttggtgtc tggaggcatt 3480 

tcccagactc cagtttctta ccaccctcac ggattttget attgtattat cacctccttt 3540 

atcattccca aaattgactt tatggagact cattaaaaga aaaaatcatc ggcegggage 3600 

ggtggctcac gecacgaagg egggegaate acctgaggtg cggagttcgt gaccagcctg 3660 

accaaaacag agaaacccca tctctactaa acaatacaaa attagctggg cgtggtggtg 3720 

cacgcctgta atcccagcta ctggggaggc tgagacaaga gaatcacttg aaccegggag 3780 

gcagaggttg cagtgaccaa agategcact attgcactcc agectgggea acaagagcaa 3840 

aactctatct caaaaaaaaa aaaaaaaaaa ag 3872 

<210> 608 
<211> 280 
<212> DNA 
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<213> Homo sapiens 
<400> 608 

tcacgcctgt aatcctagca ctttgggagg cggaggcagg cggatcacct gaggtcggga 60 

gttcgagacc agcctgacca acatggagaa accccgtctc tactaacaat acaaaaaaat 120 

tagccgagca tggtggcgca tgcctataat ctcagctact tgggaggctg aggcaggaga 180 

atcgcttgaa cctgggaggc ggaggttgca gtgagccgag atcgcgccat tgcactccag 240 

cctgggcaac aaaagcgaaa ctcggtctca aaaaaaaaaa 280 

<210> 609 
<211> 6654 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<222> (554) . . (554) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 

<222> (640) . . (640) 

<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (650) . . (650) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_f eature 
<222> (953) . . (953) 
<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 
<222> (1019) . . (1019) 
<223> n equals a,t,g, or c 

<400> 609 

gaaactgtct tctactgact gtttcaaaac cgaggccttc acatccccgg aggccctgca 60 

gcctgggggg actgccctgg cgcctaagaa gaggagccgg aaaggccggg caggggccca 120 

tggactctcc aaaggcccgc tggagaagcg gccctatctt ggcccggctc tgctcctgac 180 

tccccgagac agggccagtg gcacacaagg ggccagtgag gacaactctg gtggaggagg 240 

caagaagcca aagatggagg agctgggcct ggcctcccag cccccggagg gcaggccctg 300 

ccagccccag acaagggcac agaaacagcc aggccacacc aactacagca gctattccaa 360 

gcggaagcgc ctcactcggg gccgggccaa gaacaccacc tcttcaccct gtaaggggcg 420 

tgccaagcga cgacgacagc agcaggtgct gcccctggat cccgcagagc ctgaaatccg 480 

cctcaagtac atttcctctt gcaagcggct gaggtcagac agccggaccc ccgccttctc 540 

acccttcgtg cggngtggag aagcgagacg cgttcaccac catatgcact gttgtcaact 600 

cccctggaga tgcgcccaag ccccacagga agccttcctn ctctgcatcn tcttcctcat 660 

cctcgtcatc gttctccttg gatgcagccg gggcgtccct ggccacactc cctggaggct 720 

ccatcctgca gccgcggccc tccttgcccc tctcctccac gatgcacttg gggcctgtgg 780 

tttccaaggc cctgagtacc tcttgccttg tttgctgcct ctgccaaaac ccggccaact 840 

tcaaggacct tggggacctc tgtgggccct actaccctga acactgcctc cccaaaaaga 900 

agccaaaact caaggagaag gtgcggccag aaggcacctg tgaggaggcc tcngctgccg 960 

cttgagagaa cactcaaagg tcccgagtgt gcagctgccg ccactgccgg gaagccccnc 1020 

caggcctgac ggcccagctg acccggccaa gcagggccca ctgcgcacca gtgcccgggg 1080 

cctgtcccgg aggctgcaga gctgctactg ctgtgatggc cgggaggatg ggggcgagga 1140 

ggcagcccca gccgacaagg gtcgcaaaca tgagtgcagc aaggaggctc cggcagagcc 1200 

cggcggggag gcccaggagc actgggtgca tgaggcctgt gccgtgtgga ccggcggcgt 1260 
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ctacctggtg gccgggaagc tctttgggct gcaggaggcc atgaaggtgg ccgtggacat 1320 

ggtaagaggc cagcccagcc agggtgggga gtgtggggtt ccaaaggaca ggcaggcagg 1380 

cagtcgggga gccccttgtt tctagtgcta cagtgtgggc caaatgtgtc tgcagtctcg 1440 

ggacaatctg cagagtcctg agcctctctg gggtgtgtgg gggaagtgga ggcacccatc 1500 

tggctaaggc aggtcacact cacctaccct gtcccaggag acttcccgcc ggggcccgta 1560 

tgcccagcca ccgccactct cccactcacc tggcatcttc aggctccagc cccacccaga 1620 

atccactggg cccctcccca tgctgtcacc ggagatgcca aggcccagag aacacgagga 1680 

ctcccaagat tgaacatcag ggctgccccg ggcggtgcaa gagcaagttg cccaccctct 1740 

gttcctgtct ccattgcctc tgggatgaag ctaagggctc aggaagaccc tctctggcga 1800 

cacaggcttg gtgcatggac ccagtcacag ggttcctccc tctgcaggac agtccatggg 1860 

gtcacacagt cacaacaggc agggcggggc agatccagac cctctcacca ttggctctga 1920 

gaccagcttt ggaagccata gtcactgagc accaacatga gagagggaac tctggggctg 1980 

tgccgccatg cacctttggg gctgtgacct gccctctctg tgcctccgtc ccctcgcccc 2040 

taacgtgggt atgagggtgc ccatggagga ggggtcggga gggttaaaag gtgggcacct 2100 

ctgagcccac cgagcacagc agccctgtga tagtcagtgc agaggggcct ggtggctctg 2160 

gtgacagcag ggtgacgttc ttcgtggtgt caccacttcc ccagtaccat agagtacctc 2220 

tgtgcccttc tgagggaggg gtcagtgggg gcggtggggt gcacctctcc cccggggcca 2280 

tgcttctgcc cccacattca accccgactc cactgcctca tccagacttc cgtggagtga 2340 

cgctcccggc tctcagcgct gtgtggtggc tctcagtggg gacgcctgca gccttctctg 2400 

gcccctgccc tgagtcccat cccctccact ccttccttca acccactatg tgcggctgag 2460 

gtcacagggg agccagaggg gcttggcagg ggtctccaga ggccctggac aaacactgcc 2520 

tgcatctgag tgattcagtg acccagcaca gtggccccag ccggtggctt cgcccctccc 2580 

cctgcccagc ctcctccccc gtcgctgcac tgcccacctg ctcctcgcag tacacccctg 2640 

cagcccctct gagcagcccc tgctcctgac agttgccgtg gcattgcctg gccaaaccaa 2700 

ggtcccggtg ggggcccact gtgtttgcca cccacgcctc ctagtgtgtg tcttgggcac 2760 

ctgcttccca ctgcccatgg ggatggctgg agaccttctc cacgcccaga cgccctgccc 2820 

acctcactcc accctccacc tcagaggaag gctggccccg acaggactcc tgcaggttcc 2880 

tggtgccgcg ggccttgctt acccagcaga tggtggcctt gggcctcttc cctctcccct 2940 

tcccagttag tgattttttg cctctctcct gctttctgtc tctctctgtc tctctctctc 3000 

tctctctctc tctctctctc tctctctctc tctctcatta ctggctcctt cccagcgcct 3060 

tgaaacaggt tcaagtctct cccgtcatca aaaaataata accctcagat caccagcccc 3120 

agctggctgc cacctcccct gcttcacagc cccattcctg aaagccttgt ccccacggca 3180 

tcctcggtcc cgcagcctca ccaggggctc cccgacactc acgtcccacc aggctgagat 3240 

caggtcgtct ccacatcccc tgcagccagc actggccggc cgaagggagg aggggtcagc 3300 

cccgagcacc gaccacccca tcctgtccct ggcccatggg tgtcttaacc gcccatgcac 3360 

acacaaggtc ctgctggact gaggagacag tgtgtgttgg agaggcgcta gcttccctac 3420 

accacgcggg ggctggaact tgccagccca gcccatgcct gtctgctgtg ccatgctcca 3480 

gaaatggcct gacagcctgc gcaactaggg ctgccccggg aaggtcggaa ccaggaacac 3540 

cctgccagac agatgggaca tcctgggaga atgggcagag tctggggctt cccagggtca 3600 

cttttggctc accacccctt cctgcctcag tggcttgggg gcgccatgct ccttggagtg 3660 

agaccctggt tcaaatccca gctctgccga gggctggttc catagctcca ggcgagtcac 3720 

gcatgctctg agaactccac aatgtcaaac cccgttagca ggattgtggg gttctgtgga 3780 

tcatgtggga cctgcctggc atgtggaagg ggcttggggt gcagctgtgt ccctggcccc 3840 

agcactgcca cacccctcgg taccttgacc tcacttttgc gctctggagg gcagaacctc 3900 

tgagccagct ctcagggata ccttttttcc tcgatggcct tttctgggaa ctgccctcgc 3960 

ctgctatctc tgagccctgc ctgcccttcc cccaaagcca tcatagaccc ctacctccta 4020 

tccagatgga gcctggccat ggaatgcttg gggtgagagg gacccatagc gggctccggg 4080 

cattgttagg gggctaggtg gtgtctgcag aggccccacc tgtggcaggg tttacatgcc 4140 

gccacccctg ccctcctctt gggtagctgg ggtcagaagg gcctgaggcc tcactctctc 4200 

tgcatcttag tttcttcaca tctaaaatgc tattcatatg cctgcctgag ggtggttggg 4260 

gcctccgtgc agtaagtgga gagaagcctt cagccagagg acccccatat gcttaaacat 4320 

ccagacacca agggccactc acagacagca gccggcttgt gtttcgtcat cccttttggg 4380 

tcaggaactg gggacgctgg actccagatc tccaggaggc catcccagac taggcgggaa 4440 

ggaggcctca gccctgagct gaaaacctga aattctagaa taaaccccag gccccactcc 4500 

cagcatgggc agcggggagg ggagtgttct gacatttctc agctgctgtg gttgggatgg 4560 

tgcgcccacc ctgagcatgt ggggtcctgt gggcccctac tccagcctgg aggcagctgc 4620 

aggggtcaga gaccagcctg gagccctcgg gcaggttact cagcctcagc tctcattggt 4680 

aacccaggta aaataacagt aatcccatca taaggttgct gtgggcatga ggcttccagc 4740 

catgcctggc acataggaag ctattgtcgt ttgtttcact ggcttcgcac ttgggcagag 4800 

gccgccgccc tctctgctgc cttctctacc tcaccccccg cccccagcac ggtggcactg 4860 

tggctctggc gcctggtgac ttctccctcc ccggcctcag gtttgcttca ctcttccaag 4920 
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ggccttcttg gtgtgactgt cctgtcctgg ccacgtccac tccaagcagc tggtccaggt 4980 

gtggggcagc ccccaagctc agcagaggac agttttgaga cttagaggag gacggcatca 5040 

gatttacatg aagtcagcca tctcaaggga ttaccacgtc accgcctcct ctcacaggtc 5100 

tgagagctgg tcccctggag ggtgcaggtc tcatcctagg tagaacagtc ccagcctagc 5160 

ttcaaagtgc tcctgctgcc agaaagggta gaaaaccatg ctgaatgcct tcactgttgc 5220 

ctggaatggg agcctcagtt tcaccatctg taaagtgaga ttaatgaaag ggacccacag 5280 

agggctgtta gagccagtag gcggccaggc gcggtggctc acgcctgtaa tcccagcact 5340 

ttgggaggct gaggtgggcg gatcacgagg tcaggagatc gagaccatcc tggctaacac 5400 

ggagaaaccc cgtctctact aaaaatacaa aaaattagcc tggcgtggtg gcgggcacct 5460 

gtagtcccag ctactgggga ggctgaggca ggagaatggc gtgaacccgg gaggcggagc 5520 

ttgcagtgag ccaagattgt gccactgctc tctagcctgg gcaacagagc aagactccat 5580 

ctcaaaagca aaagaaaaaa gagccagtag gccaggcgct atgtcttaca cctgtaatcc 5640 

cagcactttg ggaggcctag gcaggcagat cacctgaggt cgggagttcg agaccagcct 5700 

ggccgacaca gtgaaactcc gtctctacta aaaatacaaa aattagctgg gcatggcagc 5760 

acatgcctgt aatcccagct actcgggagg ctaaggcagg agaatcactt gaacacagga 5820 

ggcagcggtt gccgtgagcg gagatcacac cattgcactc cagccccggg gggacagagt 5880 

gaaattctgt ctcaaaaaaa aaaaaaaaaa aaaaaagcaa gccagtgggc aaggacacac 5940 

accacgccca gcacaccaga agcatgcagc agatgctggc tggtagccac agccatggta 6000 

gtaactggca tcactggccc aagggggtct ccctacccaa tgtaccagcc cagagggtgc 6060 

taaatgaccc atgtttgtgg gcatgtgtac cccagtgctg aacctccttc tggtagattt 6120 

tcaagcacca ttttggagaa gggaggcagg gaccacaggg ggccagccag cctgtacagg 6180 

ttttttgtgt tttttttgtt tgtttgtttg tttgtttttt gaggcagggt ctcactcttg 6240 

cccaggctgg agtacaggca gtggcgtcat ctcggctcac tgcagcctca acctcccagg 6300 

ctgcagcaat cctcccacct caccttcctg agtagctggg actacaggca cacaccacca 6360 

accctcacta ttttttgtag agacagtttc accatgttgc ccaggctggt atcaaactcc 6420 

tggactcagg cgatcctccc accttggcct cccagagtgc tgggattaca ggcatgagcc 6480 

actgcacctg gcctaccagc ctgtaaagct tgagggctgg gctccaactg gagactcacc 6540 

tgcctttcct ttctcttcat cagatgtgtt ccagctgcca agaagccggg gccaccattg 6600 

ggtgctgcca caaaggatgc ctccacacct accactaccc gtgtgccagc gatg 6654 

<210> 610 

<211> 6650 

<212> DNA 

<213> Homo sapiens 

<400> 610 

gaaactgtct tctactgact gtttcaaaac cgaggccttc acatccccgg aggccctgca 60 

gcctgggggg actgccctgg cgcctaagaa gaggagccgg aaaggccggg caggggccca 120 

tggactctcc aaaggcccgc tggagaagcg gccctatctt ggcccggctc tgctcctgac 180 

tccccgagac agggccagtg gcacacaagg ggccagtgag gacaactctg gtggaggagg 240 

caagaagcca aagatggagg agctgggcct ggcctcccag cccccggagg gcaggccctg 300 

ccagccccag acaagggcac agaaacagcc aggccacacc aactacagca gctattccaa 360 

gcggaagcgc ctcactcggg gccgggccaa gaacaccacc tcttcaccct gtaaggggcg 420 

tgccaagcga cgacgacagc agcaggtgct gcccctggat cccgcagagc ctgaaatccg 480 

cctcaagtac atttcctctt gcaagcggct gaggtcagac agccggaccc ccgccttctc 540 

acccttcgtg cgggtggaga agcgagacgc gttcaccacc atatgcactg ttgtcaactc 600 

ccctggagat gcgcccaagc cccacaggaa gccttcctcc tctgcctcct cttcctcatc 660 

ctcgtcctcg ttctccttgg atgcagccgg ggcctccctg gccacactcc ctggaggctc 720 

catcctgcag ccgcggccct ccttgcccct ctcctccacg atgcacttgg ggcctgtggt 780 

ttccaaggcc ctgagtacct cttgccttgt ttgctgcctc tgccaaaacc cggccaactt 840 

caaggacctt ggggacctct gtgggcccta ctaccctgaa cactgcctcc ccaaaaagaa 900 

gccaaaactc aaggagaagg tgcggccaga aggcacctgt gaggaggcct cgctgccgct 960 

tgagagaaca ctcaaaggtc ccgagtgtgc agctgccgcc actgccggga agccccccag 1020 

gcctgacggc ccagctgacc cggccaagca gggcccactg cgcaccagtg cccggggcct 1080 

gtcccggagg ctgcagagct gctactgctg tgatggccgg gaggatgggg gcgaggaggc 1140 

agccccagcc gacaagggtc gcaaacatga gtgcagcaag gaggctccgg cagagcccgg 1200 

cggggaggcc caggagcact gggtgcatga ggcctgtgcc gtgtggaccg gcggcgtcta 1260 

cctggtggcc gggaagctct ttgggctgca ggaggccatg aaggtggccg tggacatggt 1320 

aagaggccag cccagccagg gtggggagtg tggggttcca aaggacaggc aggcaggcag 1380 

tcggggagcc ccttgtttct agtgctacag tgtgggccaa atgtgtctgc agtctcggga 1440 

caatctgcag agtcctgagc ctctctgggg tgtgtggggg aagtggaggc acccatctgg 1500 
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ctaaggcagg tcacactcac ctaccctgtc ccaggagact tcccgccggg gcccgtatgc 1560 

ccagccaccg ccactctccc actcacctgg catcttcagg ctccagcccc acccagaatc 1620 

cactgggccc ctccccatgc tgtcaccgga gatgccaagg cccagagaac acgaggactc 1680 

ccaagattga acatcagggc tgccccgggc ggtgcaagag caagttgccc accctctgtt 1740 

cctgtctcca ttgcctctgg gatgaagcta agggctcagg aagaccctct ctggcgacac 1800 

aggcttggtg catggaccca gtcacagggt tcctccctct gcaggacagt ccatggggtc 1860 

acacagtcac aacaggcagg gcggggcaga tccagaccct ctcaccattg gctctgagac 1920 

cagctttgga agccatagtc actgagcacc aacatgagag agggaactct ggggctgtgc 1980 

gccatgcacc tttggggctg tgacctgccc tctctgtgcc tccgtcccct cgcccctaac 2040 

gtgggtatga gggtgcccat ggaggagggg tcgggagggt taaaaggtgg gcacctctga 2100 

gcccaccgag cacagcagcc ctgtgatagt cagtgcagag gggcctggtg gctctggtga 2160 

cagcagggtg acgttcttcg tggtgtcacc acttccccag taccatagag tacctctgtg 2220 

cccttctgag ggaggggtca gtgggggcgg tggggtgcac ctctcccccg gggccatgct 2280 

tctgccccca cattcaaccc cgactccact gcctcatcca gacttccgtg gagtgacgct 2340 

cccggctctc agcgctgtgt ggtggctctc agtggggacg cctgcagcct tctctggccc 2400 

ctgccctgag tcccatcccc tccactcctt ccttcaaccc actatgtgcg gctgaggtca 2460 

caggggagcc agaggggctt ggcaggggtc tccagaggcc ctggacaaac actgcctgca 2520 

tctgagtgat tcagtgaccc agcacagtgg ccccagccgg tggcttcgcc cctccccctg 2580 

cccagcctcc tcccccgtcg ctgcactgcc cacctgctcc tcgcagtaca cccctgcagc 2640 

ccctctgagc agcccctgct cctgacagtt gccgtggcat tgcctggcca aaccaaggtc 2700 

ccggtggggg cccactgtgt ttgccaccca cgcctcctag tgtgtgtctt gggcacctgc 2760 

ttcccactgc ccatggggat ggctggagac cttctccacg cccagacgcc ctgcccacct 2820 

cactccaccc tccacctcag aggaaggctg gccccgacag gactcctgca ggttcctggt 2880 

gccgcgggcc ttgcttaccc agcagatggt ggccttgggc ctcttccctc tccccttccc 2940 

agttagtgat tttttgcctc tctcctgctt tctgtctctc tctgtctctc tctctctctc 3000 

tctctctctc tctctctctc tctctctctc tcattactgg ctccttccca gcgccttgaa 3060 

acaggttcaa gtctctcccg tcatcaaaaa ataataaccc tcagatcacc agccccagct 3120 

ggctgccacc tcccctgctt cacagcccca ttcctgaaag ccttgtcccc acggcatcct 3180 

cggtcccgca gcctcaccag gggctccccg acactcacgt cccaccaggc tgagatcagg 3240 

tcgtctccac atcccctgca gccagcactg gccggccgaa gggaggaggg gtcagccccg 3300 

agcaccgacc accccatcct gtccctggcc catgggtgtc ttaaccgccc atgcacacac 3360 

aaggtcctgc tggactgagg agacagtgtg tgttggagag gcgctagctt ccctacacca 3420 

cgcgggggct ggaacttgcc agcccagccc atgcctgtct gctgtgccat gctccagaaa 3480 

tggcctgaca gcctgcgcaa ctagggctgc cccgggaagg tcggaaccag gaacaccctg 3540 

ccagacagat gggacatcct gggagaatgg gcagagtctg gggcttccca gggtcacttt 3600 

tggctcacca ccccttcctg cctcagtggc ttgggggcgc catgctcctt ggagtgagac 3660 

cctggttcaa atcccagctc tgccgagggc tggttccata gctccaggcg agtcacgcat 3720 

gctctgagaa ctccacaatg tcaaaccccg ttagcaggat tgtggggttc tgtggatcat 3780 

gtgggacctg cctggcatgt ggaaggggct tggggtgcag ctgtgtccct ggccccagca 3840 

ctgccacacc cctcggtacc ttgacctcac ttttgcgctc tggagggcag aacctctgag 3900 

ccagctctca gggatacctt ttttcctcga tggccttttc tgggaactgc cctcgcctgc 3960 

tatctctgag ccctgcctgc ccttccccca aagccatcat agacccctac ctcctatcca 4020 

gatggagcct ggccatggaa tgcttggggt gagagggacc catagcgggc tccgggcatt 4080 

gttagggggc taggtggtgt ctgcagaggc cccacctgtg gcagggttta catgccgcca 4140 

cccctgccct cctcttgggt agctggggtc agaagggcct gaggcctcac tctctctgca 4200 

tcttagtttc ttcacatcta aaatgctatt catatgcctg cctgagggtg gttggggcct 4260 

ccgtgcagta agtggagaga agccttcagc cagaggaccc ccatatgctt aaacatccag 4320 

acaccaaggg ccactcacag acagcagccg gcttgtgttt cgtcatccct tttgggtcag 4380 

gaactgggga cgctggactc cagatctcca ggaggccatc ccagactagg cgggaaggag 4440 

gcctcagccc tgagctgaaa acctgaaatt ctagaataaa ccccaggccc cactcccagc 4500 

atgggcagcg gggaggggag tgttctgaca tttctcagct gctgtggttg ggatggtgcg 4560 

cccaccctga gcatgtgggg tcctgtgggc ccctactcca gcctggaggc agctgcaggg 4620 

gtcagagacc agcctggagc cctcgggcag gttactcagc ctcagctctc attggtaacc 4680 

caggtaaaat aacagtaatc ccatcataag gttgctgtgg gcatgaggct tccagccatg 4740 

cctggcacat aggaagctat tgtcgtttgt ttcactggct tcgcacttgg gcagaggccg 4800 

ccgccctctc tgctgccttc tctacctcac cccccgcccc cagcacggtg gcactgtggc 4860 

tctggcgcct ggtgacttct ccctccccgg cctcaggttt gcttcactct tccaagggcc 4920 

ttcttggtgt gactgtcctg tcctggccac gtccactcca agcagctggt ccaggtgtgg 4980 

ggcagccccc aagctcagca gaggacagtt ttgagactta gaggaggacg gcatcagatt 5040 

tacatgaagt cagccatctc aagggattac cacgtcaccg cctcctctca caggtctgag 5100 

agctggtccc ctggagggtg caggtctcat cctaggtaga acagtcccag cctagcttca 5160 
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aagtgctcct gctgccagaa agggtagaaa accatgctga atgccttcac tgttgcctgg 5220 

aatgggagcc tcagtttcac catctgtaaa gtgagattaa tgaaagggac ccacagaggg 5280 

ctgttagagc cagtaggcgg ccaggcgcgg tggctcacgc ctgtaatccc agcactttgg 5340 

gaggctgagg tgggcggatc acgaggtcag gagatcgaga ccatcctggc taacacggag 5400 

aaaccccgtc tctactaaaa atacaaaaaa ttagcctggc gtggtggcgg gcacctgtag 5460 

tcccagctac tggggaggct gaggcaggag aatggcgtga acccgggagg cggagcttgc 5520 

agtgagccaa gattgtgcca ctgctctcta gcctgggcaa cagagcaaga ctccatctca 5580 

aaagcaaaag aaaaaagagc cagtaggcca ggcgctatgt cttacacctg taatcccagc 5640 

actttgggag gcctaggcag gcagatcacc tgaggtcggg agttcgagac cagcctggcc 5700 

gacacagtga aactccgtct ctactaaaaa tacaaaaatt agctgggcat ggcagcacat 5750 

gcctgtaatc ccagctactc gggaggctaa ggcaggagaa tcacttgaac acaggaggca 5820 

gcggttgccg tgagcggaga tcacaccatt gcactccagc cccgggggga cagagtgaaa 5880 

ttctgtctca aaaaaaaaaa aaaaaaaaaa aagcaagcca gtgggcaagg acacacacca 5940 

cgcccagcac accagaagca tgcagcagat gctggctggt agccacagcc atggtagtaa 6000 

ctggcatcac tggcccaagg gggtctccct acccaatgta ccagcccaga gggtgctaaa 6060 

tgacccatgt ttgtgggcat gtgtacccca gtgctgaacc tccttctggt agattttcaa 6120 

gcaccatttt ggagaaggga ggcagggacc acagggggcc agccagcctg tacaggtttt 6180 

ttgtgttttt tttgtttgtt tgtttgtttg ttttttgagg cagggtctca ctcttgccca 6240 

ggctggagta caggcagtgg cgtcatctcg gctcactgca gcctcaacct cccaggctgc 6300 

agcaatcctc ccacctcacc ttcctgagta gctgggacta caggcacaca ccaccaaccc 6360 

tcactatttt ttgtagagac agtttcacca tgttgcccag gctggtatca aactcctgga 6420 

ctcaggcgat cctcccacct tggcctccca gagtgctggg attacaggca tgagccactg 6480 

cacctggcct accagcctgt aaagcttgag ggctgggctc caactggaga ctcacctgcc 6540 

tttcctttct cttcatcaga tgtgttccag ctgccaagaa gccggggcca ccattgggtg 6600 

ctgccacaaa ggatgcctcc acacctacca ctacccgtgt gccagcgatg 6650 

<210> 611 
<211> 460 
<212> DNA 
<213> Homo sapiens 



<210> 612 
<211> 400 
<212> DNA 

<213> Homo sapiens 
<400> 612 

accctcttca agaggatgtc ttctcccaag aaagccaagc ccaccaaggg caatggcgag 60 
cctgccacaa agctcccacc cccggagacc cccgatgcct gcctcaagct cgcctctcgg 120 
gcagccttcc agggggccat gaagaccaag gtgctgccac cccggaaggg ccggggcctg 180 
aagctggaag ccatcgtgca gaagatcacc tcgcccagcc tcaagaagtt cgcatgtaaa 240 
gcgccagggg cctctcctgg taatcctctg agcccatccc tttccgacaa agaccgtggg 300 
ctcaagggtg ctgggggcag cccagtgggg gtggaagaag gcctggtaaa tgtgggcacc 360 
gggcagaagc tcccaacttc tggggctgat ccgttatgca 400 

<210> 613 
<211> 400 
<212> DNA 
<213> Homo sapiens 



<400> 611 



aggctggagt gcagtggtgc gatcttggct cactgcaacc tccgcctccc gagttcaagc 
gattctcctg cctcagcctc tcaagtagct gggactacag gcgtgcacca ccacgctcag 
ctaatttttg tattttttgt agagatgggg tttcactgct ttggacagga tggtctccat 
ctcttgacct cgtgatccgc ccgcctcagc tgcctaaagt gctgggatta caggcgtgag 
ccactgtgcc cggcccctag tagcgttttt aatgtgtggt cttgagcaag ttggttgata 
cc tctgcaca cagtttccct acc tgtatga tggagatgat aatagcccct tctctgcaga 
gctgctggga ggagagtgaa ataatgaaca ctacccacac ggtgctgcct cagtcctatt 
ttgggtccag cctctgtgac ctctccccac acggtgctgc 



60 
120 
180 
240 
300 
360 
420 
460 



<400> 613 
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accctcttca agaggatgtc 
cctgccacaa agctcccacc 
gcagccttcc agggggccat 
aagctggaag ccatcgtgca 
gcgccagggg cctctcctgg 
ctcaagggtg ctgggggcag 
gggcagaagc tcccaacttc 

<210> 614 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<400> 614 

aggctggagt gcagtggtgc gatcttggct cactgcaacc tccgcctccc gagttcaagc 60 

gattctcctg cctcagcctc tcaagtagct gggactacag gcgtgcacca ccacgctcag 120 

ctaatttttg tattttttgt agagatgggg tttcactgct ttggacagga tggtctccat 180 

ctcttgacct cgtgatccgc ccgcctcagc tgcctaaagt gctgggatta caggcgtgag 240 

ccactgtgcc cggcccctag tagcgttttt aatgtgtggt cttgagcaag ttggttgata 300 

cctctgcaca cagtttccct acctgtatga tggagatgat aatagcccct tctctgcaga 360 

gctgctggga ggagagtgaa ataatgaaca ctacccacac ggtgctgcct cagtcctatt 420 

ttgggtccag cctctgtgac ctctccccac acggtgctgc 460 

<210> 615 
<211> 518 
<212> DNA 

<213> Homo sapiens 
<400> 615 

atgattttta aacagatttg gcacaggagt gcctttctgg gtttagggaa gtggtggaca 60 
aggcaggaga gaaccacatt catcttctcc tcttgtgttt gtcttctgtc tttcaataac 120 
gtccatgaac tgtgaggtta gtgtcttggc tgagagataa gtatggcttg gcattgattc 180 
ttctgttgtt acctcaagct gttttctagt ccccaagaac agcactctca gtgggtgtgg 240 
aagtgggcgg gacatgaagc aatggtttta cattgcattg cctggctaca gcttggcatt 300 
tctttccttt ttctttttct ttgcgtcatt gccattggtg ccactaattt tgcttcccct 360 
ctcttttata acttgtttct tcgggagttg cctagagtct ctgcattata tcttatttgg 420 
tattgaggca gtgtgttctt ggccaataac ctaggagatg atatctgttc atcttacagg 480 
tttagtgctg gaggaattca ttaaaaataa taataaaa 518 

<210> 616 

<211> 3598 

<212> DNA 

<213> Homo sapiens 

<400> 616 

atggggcggc cctggccaga agcggaggag gtggcacccg ggaccgagct ggggtcttgg 60 

aggaagagag ggtgagggga atacagtact gggggtgaga gaagggttgg acagaagagg 120 

gtcgggtatc tgggcatgcg cagggccgca ggactcttgg tggggtagcg agggggacgg 180 

tcccacgact gctccgaagg gccgggactc ccagtggggg cgggaccccc ggagtgcccg 240 

cctgcggact cccaagcctg gagcctgggg agagggtggg cacctccgtt cccgcacacc 300 

cgtccatggt gtgtgcgccg agcgtccagg agccacggcg gtgtcttcct tgcgcgtctt 360 

ttacacgtgt gggggtaggc tgctcctcgg ggctgagccg tggccagggt tatggagagg 420 

cccgcctctc cccagatggc gtcgtcgagc cctgactccc catgttcctg cgactgcttt 480 

gtctccgtgc ccccggcctc agccatcccg gctgtgatct ttgccaagaa ctcggaccga 540 

ccccgggacg aggtgcagga ggtggtgttt gtccccgcag gcactcacac tcctgggagc 600 

cggctccagg tgggttagac tttatggggt gctgggaggt gtggcagatc tctgcattct 660 

tttaagacct tcttccctgc tccccacacc tgggaggctg ccagaagtag tggaagagca 720 

tgagctttag gtctaccaac ctggacctga gctcattatg tagcctcagt ctacctcagt 780 

ttcctcttct gtaaaatggg aatgagacct tcctcaaagg gatctataag gtaattggca 840 

gagtgccaca ctcgaggggc cgcccctcta tgagtggacg cttcctttct cctccctcac 900 

ctccagtgca cctacattga agtggaacag gtgtcgaaga cgcacgctgt gattctgagc 960 



ttctcccaag aaagccaagc ccaccaaggg caatggcgag 60 

cccggagacc cccgatgcct gcctcaagct cgcctctcgg 120 

gaagaccaag gtgctgccac cccggaaggg ccggggcctg 180 

gaagatcacc tcgcccagcc tcaagaagtt cgcatgtaaa 240 

taatcctctg agcccatccc tttccgacaa agaccgtggg 300 

cccagtgggg gtggaagaag gcctggtaaa tgtgggcacc 360 

tggggctgat ccgttatgca 400 
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cgtccttctt ggctatgggg ggctgagatg ggcgccaacg agcatggtgt ctgcattggc 1020 

aacgaggctg tgtggacgaa ggagccagtt ggggaggggg aagccctgct gggcatggac 1080 

ctactcaggt gcagaccctg cccttcctca tctgcctgac acaccagaaa tctaggggct 1140 

gagttttgac ctgggcccat ccatccctcc cccagcctgg ttcacagggg cctcctcctc 1200 

tctgcagact ttgcccttgt gccttcgtga agaaggctgc agcagcagcc acctttgggc 1260 

ctctcctggc ccagaaatag agcagtggtt atttatttat tttcagttga aatcttacat 1320 

agagtcccaa tttataaaac cagtaatagt ggatggagca gctctgggtg gagagaggtt 1380 

aggggctcag gacacttctc tctctgaaac tgtgtttcag gccttttcat ggaaccctca 1440 

aagcatagca catggaacca gagcattgaa agccactggc ataattgcat ttgatacgat 1500 

cagtgccagc agatttgctg tgtgacttgg gcctatcacc aaacqtctcc gggcctcttc 1560 

tgtcccctgg agcttctgcc accagccatt gatccctctg tcacccttct gtcccttggc 1620 

cctctctttt gccaggctgg ctttggaacg gagcagctct gcccaggagg ccttgcatgt 1680 

gatcacaggg ttactggagc actatgggca ggggggcaac tgcctggagg atgctgcgcc 1740 

attctcctac catagcacct tcctgctggc tgaccgcact gaggcgtggg tgctggagac 1800 

agctgggagg ctctgggctg cacagaggat ccagggtgag gtgttccctt tctcccagct 1860 

ttgggaagtg ggagagatgg taggggcagg gaggggcccg atccaggtgc aagcctgtca 1920 

ggacatccag ggagatggga gatgagccca cttgggaatt ctcctcccct tcacttggtt 1980 

aagtcttccg tgtgctacag cctgttgctt cctctgggaa gccttcctgg acttccctgg 2040 

gtggtcaggt ttcctgctta tatgcaagca ggtacttctt tttcatcgca ttcaacacag 2100 

ttgcatgctt acatttatct ctgtgattat tttgtctgcc tcccccacca aaacgtaggc 2160 

tccatgaggg taggtagttc tcttctccac catgttctca gcacctcgcc cagtgcctgg 2220 

catagaatag atgctcaatg gtaaatgaac cactccccga tctcctccac agagggggcc 2280 

cgcaacatct ccaaccagct gagcattggc acggacatct cggcccaaca cccggagctg 2340 

cggactcatg cccaggccaa gggctggtgg gatgggcagg gtgcctttga ctttgctcag 2400 

atcttctccc tgacccagca gcctgtgcgc atggaggctg ccaaggcccg cttccaggca 2460 

gggcgggagc tgctgcggca acggcaaggt tagtgaacgg tggagggggc tgggggccag 2520 

gagggccaca gcagtgccag ccactctccc ctcccacagc ttccccctct actccttggc 2580 

agggggcatc acggcagagg tgatgatggg catcctcaga gacaaggaga gtggtatctg 2640 

tatggactcg ggaggctttc gcaccacggc cagcatggtg tctgtcctgc cccaggatcc 2700 

cacgcagccc tgcgtgcact ttcttaccgc cacgccagac ccatccaggt gggaagaatg 2760 

agggtgggga aggctgggga gaagagagga tctgatatat ctccgtgctt ccatctgtgc 2820 

ccctctaggt ctgtgttcaa acctttcatc ttcgggatgg gggtggccca ggccccccag 2880 

gtgctgtccc ccacttttgg agcacaagac cctgttcgga ccctgccccg attccagact 2940 

caggtagatc gtcggcatac cctctaccgt ggacaccagg cagccctggg gctgatggag 3000 

agagatcagg tatcccccag ggagtagggg ctaccttgag gggatgatag acctccccca 3060 

ctcccagtgg gactctggaa atatgaagga actagggagt ggaagagatt tcagagctgg 3120 

ggagaggagt tcctcccttc aaagccagca actgcctttg gggaatgtcg gggggtatct 3180 

cctttctcct gcttgtgtga ggtggtacac agtcccccct tcacctggcg ggaagcctgt 3240 

cccggacaga ctcatctcag ctttcccttg gggcaggatc gggggcagca gctccagcag 3300 

aaacagcagg atctggagca ggaaggcctc gaggccacac aggggctgct ggccggcgag 3360 

tgggccccac ccctctggga gctgggcagc ctcttccagg ccttcgtgaa gagggagagc 3420 

caggcttatg cgtaagcttc atagcttctg ctggcctggg gtggacccag gacccctggg 3480 

gcctgggtgc cctgagtggt ggtaaagtgg agcaatccct tcacgctcct tggccatgtt 3540 

ctgagcggcc agcttggcct ttgccttaat aaatgtgctt tattttctct tcagtgaa 3598 

<210> 617 

<211> 5689 

<212> DNA 

<213> Homo sapiens 

<400> 617 

tttagaatca ggtggctcac tgagctctgt attttgtttc ctggagcttt cactggtttc 60 

ttcccctgag ataccccaag tgacatgaaa agcatactca gggcctagag acactttact 120 

ggggatgggc ttctgtcaca ggtcagaggt ctgagaagag gggcaggccc cactcctctc 180 

cactagtaga gaaaggttga cagagaatca tttcttgctt ctcttggccg tagttttggt 240 

tgtgctgggg gcctcagcca cagaggcctt gggggctgtg gctgctcgtg cccccttcct 300 

tccccagaaa gagctttggt ggcccctggg aatcagactg catggtttct tggtgggaga 360 

ggaggcctgg ggtgaggaga cggcctcagg gactgtctcc tccccttgcg caggagtggc 420 

agaagggctg ctgtccccag ccatgggcac cccaggtagc aggggcaggt cggtgagggt 480 

gggctgcatc tccatcctca gcaggtgctc tgtcagggcc gtctgttgcc ggtgctccct 540 

gtgcctgctc agctcctgct ccagctcctt gaggaagcct gggaggggcc gggggtggag 600 
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ggtacagggt gggtggagcc ctgggctcag caggagggtc cctgggctca gggaagtctc 660 

tggctggccc cttgtccctt gtgggaagga gcctgaggct ggggcccagg actgacacct 720 

ggctctggcc cagatgttga tctgaacttg gggctcccct ccccggacgc cactgccacc 780 

ttagcttccc tcatgtccca cagggcagag gtgggctctg gggaggctga aaaccttgga 840 

aagcagggtc acctcgttct gagcagaatg ggccactcag ctctgggaac tcctcatcgc 900 

ttgaggcttc atcctcctca tccgaaatcc agcgctccac cacaggctgc ccgtccaggt 960 

ccaggagaag tggcagggct tctgtcacca gctcgctgca gagcagaagg agcagaggtt 1020 

acccaggagg gcaccctggc gtggaggatg caaagacacg caccacagca ctcacacccg 1080 

gaggggtggt ggctcatgag ctaggtagga ggtgggggag aaggtgtcat ggacaggacc 1140 

ccaggtttgc aggctgagag gggtctgggc tgagttagca ggtagagctc agccatgacc 1200 

tttcttccca ccctccactc cttaccggta gccatcctgg ttggtgcagc tgtttccaga 1260 

caggttgagg atgagaaggc tctgggggaa ctcatctgca cacagcaaag agagaggaaa 1320 

tggggttctc acttgatctt agccaaaaga ccatgaagcg atgggaatgg ggttctcata 1380 

cccactttct agtccaaagc cagtacctct aggcacaacc ccttgaactc cctgaggatg 1440 

aaatgtggac ctgcaggggc atggctaata gaagcatggg cagatgggtg cagaggggag 1500 

ggcaggcccc acaggggatg tgtttggtga gccagggcga gtgggttcct acccagcttc 1560 

aatgtttcta tcaggttctc agaaaggtcc agaaactgga ggcatgggag gtcgaggagg 1620 

ttttccacct gcctgatttg gtttcctgcc agagacagga agctgtaggg gaaggaaggg 1680 

gtacagaggc agagctgaat catggaacag ctggaagaaa agaggagggc acctgacacc 1740 

ggggcccctg ggctaggcag ctgatgacca gtggggacac atggccttct ggggtgaggg 1800 

agtagatcta ggcttgctgg tccttgactg acaaggaccc aggcttggag aaatctgaac 1860 

cagttctttg ggatatggct aggttccccc aacccctctt cccctgcctg agccctggca 1920 

ccacataccg caaggagggg atgcaagcca ggttctcaat ttgctggatc ttattctgca 1980 

agaagaaacc gaagtgggga aaccaagctg agatcagatg cgtcaggggc cctttacggg 2040 

gacaaagccc caagaccatg tctactctgc agctggagat ctggcacttt gggaatgatc 2100 

accccactcc ttgaacaggt tgccactacc ttaagaaaga tgattttctt tccccatgtg 2160 

gatggttaag ggtgtaaacc tggggtatct aaacctgact acgtatactc ttctgattcc 2220 

acaaccaaag agtgacctgg gaatcagaaa aggaaactaa gaagccacca ggaaaacgga 2280 

aagggccttg gtggtggctt gcatacacag agagaacaat gagcatacag gaacttgacg 2340 

gttaaagttt ctatggggaa agttgaagcg agctgggaat actcaaccag gggaaggctg 2400 

aagatcaacc aaccaccacc accaccacca aacagccttc aagtaggatc tgaatgatgt 2460 

acaggaatct tcatagtttc ctcttcttca aaacaggaaa tgggctggag ctatttaacg 2520 

tgaaagattt agagtagatg ccaaaaagaa cttctagatc aagagaagtg cctaaagaac 2580 

agtggaagga gcacaggtca tgttcaggat ctattttgag ggaattagtt tttcaattca 2640 

ggaccctatg tcattcgtct gggaatggtt taaatgaagg tctgcctgcc tgaacaccag 2700 

ggaatgggca ggatgacctg gttggctcct ttcagatgag acagattagg tccacagggg 2760 

tcaaggggag gggaagggtg gagaaagaag ttaccccttg cagatagaga ctgtgaagat 2820 

tctggaggcc ttctaagttc ctgatagtag taatcccctc ccggtccagg cggacagtct 2880 

gcagttcatc aagagtgtga aacctgggaa aagagtcagt aagggtggtg ttagagcacc 2940 

actcatgtgg tctcttagca gggaggatgg gagacggagg agggaagggg tttcaaggag 3000 

gggtagttgg ctgagtcatt ttaatcagat gtgagaaaca tcttgacatc ttgaggtggt 3060 

gtctagatca tgaaaccggc ttgacttgca gactcgtatc tctattgtta gcactgggaa 3120 

gggtgagaga gaggagtaag ggccctctgg gaaattgggg gatggcagct tatggtttct 3180 

ggagagattc agaggcaaga gaaaccttcc tctggccagc ttggatggtg gaggcagggg 3240 

tgggacagag atgagacact gcagaaagat ccttcctcac cttccttttg agtcgttcat 3300 

ggagaaaaaa gcacttgttc catccaccca ttcattcatt ccacaacatt gctgagtgcc 3360 

tactagagac aaatgacaga gtctctgccc accactaata aagaagcagg gcaggctgac 3420 

catggtggct taccctgtac tcccagcact ttgggaggcc gaggcaggag gatcgctcga 3480 

gcccaggagg tcgaggctgc agtgtgccat gattatgcca ctgcacttca gcctgggtga 3540 

cagagcaaga ccctatgtca taaaaaaaaa aaaaaaagag gctgggcgag gtggctcact 3600 

gtaatcccag cactttggga ggtcgaggtg ggtggattgc ctgaggtcag gagttcaaga 3660 

ccagcctggc taacatggtg aaaccccatc tctactaaaa atacaaaaaa aattagccgg 3720 

gtgtggtggc gcacacctgt agtcccagct actcgggagg ctgaggctgg agaatcgctt 3780 

caactcagga ggcagacgtt gcagtgagcc gagatcacac cattgcactc cagcctgcgc 3840 

aacagagtga gactccgtct caaaaaaaaa aaaaaaaaat aggaagaagg ccaaacatct 3900 

atgacaagtg gataacagag gcaagtacag gggaaaggaa gaaggaggag gaagaaggga 3960 

aagaaggagg aagaggaaga agaaagaaag aaagaaagaa taaggccaaa catccctgac 4020 

aagtggataa cagaagcaag tacaggggac aaagggagta cataggctgt gcactaaatt 4080 

cacagtagga ggaatcaggg aatgcttcct agaggaggtg acagatgagt aggcattagc 4140 

catgaaggtg gggggatatg gggagaaggc atttcaagca gaaggaatag tacatgctaa 4200 

tacagccctt ccgaaactcc aatatgccca tgcagattct aattcagtag atcgggtgga 4260 
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ggggctgaga tgctccactt ctaacaagcc ccctgtgatg ccaatgctgc tgtcctaccc 4320 

ctgcaccccc tcccatccac acatactctg agtagtaagg tactaaggtg tgagtacaca 4380 

gtgtgggaaa ttgtacactt gtggagagtg gccagaaata aggctgaaaa gcagaagtca 4440 

actcatgctt aggcattggg atttatttgg gaggaagttt ctatcagtga atgcctgatt 4500 

agatttgtta tttaaaagga tcactttggc tactcaggag gctgaagtgg gaggattgtt 4560 

tgaggagttc aagaccagtc tggccaacag agcaagaccc catctctaaa aaagtaatta 4620 

aaaatacttt actttttgtt tgttttagaa atagggtctc accctgttgc ccaggctggc 4680 

atgcaatggc atgatcatag cttactgcag ccacaggtac ctgggttcaa gtgatcctcc 4740 

tgtctcagcc acctaggact acaggtgtgc accaccatgc tcagctagtt tatttttatt 4800 

ttttagagat aggattctgt ctctattgcc caggctggtc tcaaactcct gggctcaagt 4860 

gatcctcctg cctcagcctc ccaaagtggt gggtgtgtag gagagaggtg aacacggcct 4920 

tatctaagac agttgagtga ggatggtgaa aaagaaatgg aattattttg aagaagggaa 4980 

aatcagctgg gcattaccac tgattgaatg tgtggagtaa ggagagaaac aaagatcagt 5040 

tgacaaatca gtacacgtca gggacctggt catcctgagt gtttcagcct tctagcaccc 5100 

cttttctccc ccatgcactc acatcttctc tgacagttcc ccatcttcag ggaaagtcaa 5160 

gttccgctta gtgataaggg cttcagtgat gcagacgccc ccttcctctg gaccctgggc 5220 

tgacttccct gtgaaaagat gagtccaact gtgacacttc ctcactcttg gaggccttac 5280 

cccgctgttt tccaactgct ctacccaccg tcccacctcc ctactcacct ccagacatga 5340 

tctaaaataa aaggctgctg gtctgaggcg ggagaggaac gaaaagagag gtcttggcgg 5400 

cccctaagga tggcagaact caggatggca ggaggagaga gaaactcaga gacttaggag 5460 

aggaggaaag ggggttgatt cagagaaaat tgctggggtg aggtcgaaga aaacagtaaa 5520 

ttgatgtgaa gggtctggag tttgaggggt gtggaggggc tttgctggca gcaagctggg 5580 

gtgttgtggg caggaatggt tgagaaagga gcagttccta ggaagccgga gtcgttgcta 5640 

agagactgga cgccgagtgg ggaggtaaag gcgggctccg ttggcccgg 5689 

<210> 618 
<211> 776 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> (709) . . (709) 
<223> n equals a,t,g, or c 

<220> 

<221> misc_feature 
<222> (738) . . (738) 
<223> n equals a,t,g, or c 

<400> 618 

tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 60 

cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt 120 

tgcgcagcct gaatggcgaa tggcgcctga tgcggtattt tctccttacg catctgtgcg 180 

gtatttcaca ccgcatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa 240 

gccagccccg acacccgcca acacccgctg acgcgccctg acgggcttga ctgctcccgg 300 

catccgctta cagacaagct gtgaccgtct ccgggagctg catgtgtcag aggttttcac 360 

cgtcatcacc gaaacgcgcg agacgaaagg gcctcgtgat acgcctattt ttataggtta 420 

atgtcatgat aataatggtt tcttagacgt caggtggcac ttttcgggga aatgtgcgcg 480 

gaacccctat ttgtttattt ttctaaatac attcaaatat gtatccgctc atgagacaat 540 

aaccctgata aatgcttcaa taatattgcc aaaggaagag tatgagtatt caacatttcc 600 

gtgtcgccct tattcccttt attgcggcat tgagcctgtc tgtttttgct cacccagaaa 660 

cgctggtgaa agtaaaagat gctgaagatc agttgggtgc acgagtggng tacatcgaac 720 

tggatctcaa cagcggtnag atcctcgaga ggtttcgccc ccgaagaacg tttttc 776 

<210> 619 
<211> 878 
<212> DNA 
<213> Homo sapiens 

<220> 
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<221> misc_feature 
<222> (741) . . (741) 
<223> n equals a,t,g. 



or c 
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<400> 619 

gaaaaccctg gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg 60 

cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc 120 

gaatggcgcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcata 180 

tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc ccgacacccg 240 

ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa 300 

gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc 360 

gcgagacgaa agggcctcgt gatacgccta tttttatagg ttaatgtcat gataataatg 420 

gtttcttaga cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc tatttgttta 480 

tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg ataaatgctt 540 

caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc ccttattccc 600 

ttttttgcgg cattttgcct tcctgttttt gctcacccag aaaacgctgt gaaaagtaaa 660 

gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct caacagcggt 720 

aaaaaccttg agagttttcg nccccgagaa cgtttttcaa tgatgagcac ttttaaagtt 780 

ctgctatgtg gcgcggtatt aatccctatt tacgcccggg cagaagcact cggtcgccgg 840 

atacactatt ctagaatgac ttggttgagt actaacca 878 



<210> 620 
<211> 150 
<212> DNA 

<213> Homo sapiens 



<400> 620 

cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct 60 

ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc 120 

agcctgaatg gcgaatggcg cctgatgcgg 150 



<210> 621 

<211> 1299 

<212> DNA 

<213> Homo sapiens 



<400> 621 

tttttttttt ttttggcaaa taaagagtaa aacaggctat ttaaaacatc catttaaatg 60 

caaattttga tatcccaaga gaaaaatgtt aatcatttaa atagacagga ttatcgccca 120 

cccttaccac ttccctcctc tcccaagttt tagaaaacgt agccttagcc cacacaagtc 180 

aagtcagcca ggagtcctta catcttaaga actcccactc agatgagagg gctgaggcag 240 

atagaggggg acttttcctt cttttgagga aggagatgga aaagagagaa aatagtctaa 300 

catatcctat aagccaggca tggggcaata ttataaatac aaacacacac acacacacac 360 

acacatgcat gcacacacac acacccacac ttaatcatca caactgtctt ggaacgggga 420 

attatcatcc tcatctacag acaaggaaac tgaggcttag aaaggtgaag cagtttgccc 480 

cactggtagg tgacccagtt gggagtagag gcctccatag ccctcccagt ccatggtgac 540 

ctgcggtcct gtccagactc acccagcatc cccatgccaa gcatgaacgc gtccatggtg 600 

taaggcagtc cccgggcccg gcctcttgtc ccaggtggga acatgacttc tttcggctga 660 

gctttcttac attctacctg ttaaacagaa aggcgaacaa atgagatgca aatgaattcc 720 

acgtaaatgt caaatgcaga acctgtctgg ttatgagcta agttattttt agccttgctc 780 

.cttacccact gtaaaaatag ccatgccgag taaatggcct gcaaaataaa tacttaagaa 840 

aacacgccac ttgtgcaaat cctgcttcgg aaaaacctac aaggatgaaa ctgggttctg 900 

gaggggtggc aaggacgtaa gctgcagata tacacacaga ttgaagctca ggctgacgct 960 

cgtgcagaaa gcatgacaga taaggaggca gacaatggct gttggatgtg tctgtgaata 1020 

ataacaacaa tagcaataat aataatccct caggattttt cagcccttta cacatttaca 1080 

aatgcccctg acattctctc aggccagtga ctcttcatgc caggccatct gattcaaact 1140 

cttattaaac agacagacca cttagcctta gctccagttc acagatgaga aaacagagca 1200 

tcagaagttt tgtgttggct gggcgtgggg gatcatgttt gtaatcccag cactgtggga 1260 

ggtggaggca ggtgaatcag cttgagccca ggcgtttca 1299 



<210> 622 
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<211> 330 
<212> DNA 

<213> Homo sapiens 
<400> 622 

tttttaaaaa cattttctca tttattctca cttgtttctt cttccaggga taagttttag 60 

aaacatctgg ccaatttaaa aaaattacaa taggatttct ctttgaaact ctgttaagcc 120 

taaaaaattc atatgaggag aattaacatc atggccattt ttaatattct cctggcatct 180 

ttttctattt cataaagttc ctcaaaatag ttttacctgc ccttgttatt tcctcagaca 240 

agtagataca attctttgtc agaccacaag atgttttata aatttaaaat taagccagaa 300 

gtgatttaaa cagcagcaaa tctgaaaata 330 

<210> 623 

<211> 3328 

<212> DNA 

<213> Homo sapiens 

<400> 623 

ttgttttgtc tcaggttttt tttttctttt ttttttttga gagggtgtct cgctctgttg 60 

acaggctgga gtacagtagc acgatctcag gccactgcaa cctctacctt ctgggttcaa 120 

gagattctcc tgcctcagcc tcccaagtag ctgggactat aggcgcatgc caccacgccc 180 

agctaatttt tgtattttta gtagagacag ggtttcacca ttgttggcca agctggtctc 240 

gatctctttg accttgtgat ccacccacct tggcctccca aagtcctggg attacaggcg 300 

tgagccactg tgcccagcct ggctcaggtt tttcaataca gtcttgacct tggcattcag 360 

tatcctcaca gcatggttct aattaacttt ctagctctat ttcccttttc ctgctccctc 420 

tctctacaac tagtctttct ctgattgccc cgccctcaac ccatctaaac tagaccccag 480 

ggaagcacct tggtcccctt cctctctccc actcaccatc caaccaatca ccagagcctg 540 

tacattctat attttcaaca tcgattcaat tgtctacttc tttctagcct gccctctctg 600 

actgggactc cttgagccag cctgatcacc ccaatccatc cctcacactg tgcccatctt 660 

tctgaagtag gaatctgatc acaccaccct gctaaaaaca ctctggttct ccccacggca 720 

tgtggtgccc ttgtatagct ggcaaagcct tgcatggcac ggccccagcc tgtgcttcaa 780 

ctcaattgcc cgactctctc cagctctgct gagccaccta agtcacagat ggtttctcct 840 

ctcatctctg ctctcttcca tgtgccattt ctgtggcttg gaatgttctt ccctcattct 900 

ctttctggcc ctttcccgtc acaccttaga cgtgcatctt cctctcgaaa acctctagtg 960 

aagcctccca gggccaggca gtaccctcct ctggcttctt ctggatacag aggaagaatc 1020 

tgagcatcga ttctccatct cagcaggcct ctgtgtgcct gctgactccg actagaccag 1080 

agatccgtaa ggacagggat cgagtttttt ttcttttaat tcactgcctc aaaaatcctc 1140 

tgtgcattac ctattcatcc tcttctctcc cttaacctga accagtgatc ttactgtctc 1200 

catcattgtt tttttctttt cttttctttt cttttttttt tttgaggtgg agtctggctc 1260 

ttcacccagg ctggagtgca gtgatgcgat ctcgactcac tgcaacctcc atctcctggg 1320 

ttcaagcgat tctcctgcct cagcctcccc agtagctggg attacaggca tgcgctacca 1380 

tccccaacta atttttgcct ccataatttt gccttttcta gaatgtcata caggtggaat 1440 

tactcagtat gctgcctttt tcagattggc ttctttcact tagtaatatg tttgtttttt 1500 

gagacagggt cttgctctgt cgcccaggct agagtgtggt ggtgcgatct tagctcactg 1560 

aaacctccac ctcccaggtt caagtgactc tcctgcctca gcctcccgag tagctgggac 1620 

tacaggcaca tgccaccata cccggctaat ttgtggattt ttagtacaga cggggtttca 1680 

tcatgttggc cagggtgttg ttgaattcct gacctcaagt gatccacctg cctcagcctc 1740 

ccaaagtgtt gcgattacag gtgtgagcca ctgcgccaag cc tea tt tag taatatgeat 1800 

ttaaactttc tccatgtctt taatggcttg atagctcatt tatttttatc atggaatatt 1860 

tcattgtctg gatggaccac agtttatttc tccattcacc tactgaagga catcteggtt 1920 

gcttctaagt tttggcaatt atgaataaag ctgetataac catcaagtgc aggtttttgt 1980 

gtggacctat tatcaactaa ttcgggtaaa tctcaaggag tgeaattget ggatcacaca 2040 

gtaagagtgt gtttagtttt aagtggctgt gecattttge attcccacca gcaatgaatg 2100 

agagtttctg ttgctccaca ttctcactac cattcggtgt tgtcagtgtt ttgcattttg 2160 

gecattctag taggtgttta catggtatct agtcatttga atgggcatat gatgtggaac 2220 

atcttttttt ttttaatttt attattatta tactttaagt tttagggtac atgtgcacaa 2280 

cgtgcaggtt tgttacatat gtatacatgt gccatgttgg tgtgctgcac ccattaacta 2340 

gtcatttagc attaggtata tctcctaatg ctattggaac atcttttcat gtgtttattt 2400 

gccatctgta tatcttccct gatgagttgg ggatgeatte tttccatctc agagtcccca 2460 

gaaactaaca tagcagttgg tacagagttg gtgetcaaca aacatcagct taggaactat 2520 

gtcctatgtt tttttgtttt tttttttttt taaaaaggaa tgtgagctgt tcccaaaacg 2580 
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tatgtccttc ccccatgcct ctaccctgcc cttccacaaa ctttctgatc ttcagcacac 2640 

actacccaac catcaaggct gagacttccc gtggccagca gtgtctcatg ctggcttcaa 2700 

gccccacagc actgcttttt tcaacttctc ttgtggttta gactgtcttt agcccagcaa 2760 

gagaattcat tgtcttatcc cccattaaac tgtacctaca ctctttgagg aaaagggtcc 2820 

atcttactta aaatatttta aaaattcaca tgtgataaaa ctgatgtgta tgtgtatgag 2880 

agagagagaa agagagaaca gttctatgag gcttaatgta tacatgtggc tgtgcagccc 2940 

actgccccat gtcaccattt tcaattctgt cattacccct gggacaatgt cttgtcaata 3000 

tacacttgag gctgggcggg gtggttcctg cctgtaatcc cacctgtaat caggagttca 3060 

agaccagcct ggccaacaca gcaaaacccc gtctctacta aaaatacaaa aatttgctgg 3120 

gcgtggtggc acgggcctgt aatcccagct actcaggtgg ctgaggcagg acaatcgctt 3180 

gaaccgggga ggtggaggtt gcagtgagcc aagatcgtgc cactgcactc cagcctaggc 3240 

aacagagcga gactctgtct caaaaataca tacatacaca cacacataca cacacacgcg 3300 

caagagacac agtgagtaga aacaccga 3328 

<210> 624 

<211> 3051 

<212> DWA 

<213> Homo sapiens 

<400> 624 

tttttttttt tgagagggtg tctcgctctg ttgacaggct ggagtacagt agcacgatct 60 

caggccactg caacctctac cttctgggtt caagagattc tcctgcctca gcctcccaag 120 

tagctgggac tataggcgca tgccaccacg cccagctaat ttttgtattt ttagtagaga 180 

cagggtttca ccattgttgg ccaagctggt ctcgatctct ttgaccttgt gatccaccca 240 

ccttggcctc ccaaagtcct gggattacag gcgtgagcca ctgtgcccag cctcggctca 300 

ggtttttcaa tacagtcttg accttggcat tcagtatcct cacagcatgg ttctaattaa 360 

ctttctagct ctatttccct tttcctgctc cctctctcta caactagtct ttctctgatt 420 

gccccgccct caacccatct aaactagacc ccagggaagc accttggtcc ccttcctctc' 480 

tcccactcac catccaacca atcaccagag cctgtacatt ctatattttc aacatcgatt 540 

caattgtcta cttctttcta gcctgccctc tctgactggg actccttgag ccagcctgat 600 

caccccaatc catccctcac actgtgccca tctttctgaa gtaggaatct gatcacacca 660 

ccctgctaaa aacactctgg ttctccccac ggcatgtggt gcccttgtat agctggcaaa 720 

gccttgcatg gcacggcccc agcctgtgct tcaactcaat tgcccgactc tctccagctc 780 

tgctgagcca cctaagtcac agatggtttc tcctctcatc tctgctctct tccatgtgcc 840 

atttctgtgg cttggaatgt tcttccctca ttctctttct ggccctttcc cgtcacacct 900 

tagacgtgca tcttcctctc gaaaacctct agtgaagcct cccagggcca ggcagtaccc 960 

tcctctggct tcttctggat acagaggaag aatctgagca tcgattctcc atctcagcag 1020 

gcctctgtgt gcctgctgac tccgactaga ccagagatcc gtaaggacag ggatcgagtt 1080 

ttttttcttt taattcactg cctcaaaaat cctctgtgca ttacctattc atcctcttct 1140 

ctcccttaac ctgaaccagt gatcttactg tctccatcat tgtttttttc ttttcttttc 1200 

ttttcttttt tttttttgag gtggagtctg gctcttcacc caggctggag tgcagtgatg 1260 

cgatctcgac tcactgcaac ctccatctcc tgggttcaag cgattctcct gcctcagcct 1320 

ccccagtagc tgggattaca ggcatgcgct accatcccca actaattttt gcctccataa 1380 

ttttgccttt tctagaatgt catacaggtg gaattactca gtatgctgcc tttttcagat 1440 

tggcttcttt cacttagtaa tatgtttgtt ttttgagaca gggtcttgct ctgtcgccca 1500 

ggctagagtg tggtggtgcg atcttagctc actgaaacct ccacctccca ggttcaagtg 1560 

actctcctgc ctcagcctcc cgagtagctg ggactacagg cacgtgccac catacccggc 1620 

taatttgtgg atttttagta cagacggggt ttcgtcatgt tggccagggt gttgttgaat 1680 

tcctgacctc aagtgatcca cctgcctcag cctcccaaag tgttgcgatt acaggtgtga 1740 

gccactgcgc caagcctcat ttagtaatat gcatttaaac tttctccatg tctttaatgg 1800 

cttgatagct catttatttt tatcatggaa tatttcattg tctggatgga ccacagttta 1860 

tttctccatt cacctactga aggacatctc ggttgcttct aagttttggc aattatgaat 1920 

aaagctgcta taaccatcaa gtgcaggttt ttgtgtggac ctattatcaa ctaattcggg 1980 

taaatctcaa ggagtgcaat tgctggatca cacagtaaga gtgtgtttag ttttaagtgg 2040 

ctgtgccatt ttgcattccc accagcaatg aatgagagtt tctgttgctc cacattctca 2100 

ctaccattcg gtgttgtcag tgttttgcat tttggccatt ctagtaggtg tttacatggt 2160 

atctagtcat ttgaatgggc atatgatgtg gaacatcttt ttttttttaa ttttattatt 2220 

attatacttt aagttttagg gtacatgtgc acaacgtgca ggtttgttac atatgtatac 2280 

atgtgccatg ttggtgtgct gcacccatta actagtcatt tagcattagg tatatctcct 2340 

aatgctattg gaacatcttt tcatgtgttt atttgccatc tgtatatctt ccctgatgag 2400 

ttggggatgc attctttcca tctcagagtc cccagaaact aacatagcag ttggtacaga 2460 
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gttggtgctc aacaaacatc agcttaggaa ctatgtccta tgtttttttg tttttttttt 2520 

tttttaaaaa ggaatgtgag ctgttcccaa aacgtatgtc cttcccccat gcctctaccc 2580 

tgcccttcca caaactttct gatcttcagc acacactacc caaccatcaa ggctgagact 2640 

tcccgtggcc agcagtgtct catgctggct tcaagcccca cagcactgct tttttcaact 2700 

tctcttgtgg tttagactgt ctttagccca gcaagagaat tcattgtctt atcccccatt 2760 

aaactgtacc tacactcttt gaggaaaagg gtccatctta cttaaaatat tttaaaaatt 2820 

cacatgtgat aaaactgatg tgtatgtgta tgagagagag agaaagagag aacagttcta 2880. 

tgaggcttaa tgtatacatg tggccgtgca gcccactgcc ccatgtcacc attttcaatt 2940 

ctgtcattac ccctgggaca atgtcttgtc aatatacact tgaggctggg cggggtggtt 3000 

cctgcctgta atcccacctg taatcaggag ttcaagacca gcctggccaa c 3051 

<210> 625 

<211> 3D29 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 

<223> n equals a,t,g, or c 
<220> 

<221> misc_f eature 

<222> (4) . . (4) 

<223> n equals a,t,g, or c 

<220> 

<221> mi sc_f eature 

<222> (150) . . (150) 

<223> n equals a,t,g, or c 

<400> 625 

ntancccatt gtgcccagcc tggctcaggt ttttcaatac agtcttgacc ttggcattca 60 

gtatcctcac agcatggttc taattaactt tctagctcta tttccctttt cctgctccct 120 

ctctctacaa ctagtctttc tctgattgcn ccgccctcaa cccatctaaa ctagacccca 180 

gggaagcacc ttggtcccct tcctctctcc cactcaccat ccaaccaatc accagagcct 240 

gtacattcta tattttcaac atcgattcaa ttgtctactt ctttctagcc tgccctctct 300 

gactgggact ccttgagcca gcctgatcac cccaatccat ccctcacact gtgcccatct 360 

ttctgaagta ggaatctgat cacaccaccc tgctaaaaac actctggttc tccccacggc 420 

atgtggtgcc cttgtatagc tggcaaagcc ttgcatggca cggccccagc ctgtgcttca 480 

actcaattgc ccgactctct ccagctctgc tgagccacct aagtcacaga tggtttctcc 540 

tctcatctct gctctcttcc atgtgccatt tctgtggctt ggaatgttct tccctcattc 600 

tctttctggc cctttcccgt cacaccttag acgtgcatct tcctctcgaa aacctctagt 660 

gaagcctccc agggccaggc agtaccctcc tctggcttct tctggataca gaggaagaat 720 

ctgagcatcg attctccatc tcagcaggcc tctgtgtgcc tgctgactcc gactagacca 780 

gagatccgta aggacaggga tcgagttttt tttcttttaa ttcactgcct caaaaatcct 840 

ctgtgcatta cctattcatc ctcttctctc ccttaacctg aaccagtgat cttactgtct 900 

ccatcattgt ttttttcttt tcttttcttt tctttttttt ttttgaggtg gagtctggct 960 

cttcacccag gctggagtgc agtgatgcga tctcgactca ctgcaacctc catctcctgg 1020 

gttcaagcga ttctcctgcc tcagcctccc cagtagctgg gattacaggc atgcgctacc 1080 

atccccaact aatttttgcc tccataattt tgccttttct agaatgtcat acaggtggaa 1140 

ttactcagta tgctgccttt ttcagattgg cttctttcac ttagtaatat gtttgttttt 1200 

tgagacaggg tcttgctctg tcgcccaggc tagagtgtgg tggtgcgatc ttagctcact 1260 

gaaacctcca cctcccaggt tcaagtgact ctcctgcctc agcctcccga gtagctggga 1320 

ctacaggcac atgccaccat acccggctaa tttgtggatt tttagtacag acggggtttc 1380 

atcatgttgg ccagggtgtt gttgaattcc tgacctcaag tgatccacct gcctcagcct 1440 

cccaaagtgt tgcgattaca ggtgtgagcc actgcgccaa gcctcattta gtaatatgca 1500 

tttaaacttt ctccatgtct ttaatggctt gatagctcat ttatttttat catggaatat 1560 

ttcattgtct ggatggacca cagtttattt ctccattcac ctactgaagg acatctcggt 1620 

tgcttctaag ttttggcaat tatgaataaa gctgctataa ccatcaagtg caggtttttg 1680 

tgtggaccta ttatcaacta attcgggtaa atctcaagga gtgcaattgc tggatcacac 1740 
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agtaagagtg tgtttagttt taagtggctg tgccattttg cattcccacc agcaatgaat 1800 

gagagtttct gttgctccac attctcacta ccattcggtg ttgtcagtgt tttgcatttt 1860 

ggccattcta gtaggtgttt acatggtatc tagtcatttg aatgggcata tgatgtggaa 1920 

catctttttt tttttaattt tattattatt atactttaag ttttagggta catgtgcaca 1980 

acgtgcaggt ttgttacata tgtatacatg tgccatgttg gtgtgctgca cccattaact 2040 

agtcatttag cattaggtat atctcctaat gctattggaa catcttttca tgtgtttatt 2100 

tgccatctgt atatcttccc tgatgagttg gggatgcatt ctttccatct cagagtcccc 2160 

agaaactaac atagcagttg gtacagagtt ggtgctcaac aaacatcagc ttaggaacta 2220 

tgtcctatgt ttttttgttt tttttttttt ttaaaaagga atgtgagctg ttcccaaaac 2280 

gtatgtcctt cccccatgcc tctaccctgc ccttccacaa actttctgat cttcagcaca 2340 

cactacccaa ccatcaaggc tgagacttcc cgtggccagc agtgtctcat gctggcttca 2400 

agccccacag cactgctttt ttcaacttct cttgtggttt agactgtctt tagcccagca 2460 

agagaattca ttgtcttatc ccccattaaa ctgtacctac actctttgag gaaaagggtc 2520 

catcttactt aaaatatttt aaaaattcac atgtgataaa actgatgtgt atgtgtatga 2580 

gagagagaga aagagagaac agttctatga ggcttaatgt atacatgtgg ctgtgcagcc 2640 

cactgcccca tgtcaccatt ttcaattctg tcattacccc tgggacaatg tcttgtcaat 2700 

atacacttga ggctgggcgg ggtggttcct gcctgtaatc ccacctgtaa tcaggagttc 2760 

aagaccagcc tggccaacac agcaaaaccc cgtctctact aaaaatacaa aaatttgctg 2820 

ggcgtggtgg cacgggcctg taatcccagc tactcaggtg gctgaggcag gacaatcgct 2880 

tgaaccgggg aggtggaggt tgcagtgagc caagatcgtg ccactgcact ccagcctagg 2940 

caacagagcg agactctgtc tcaaaaatac atacatacac acacacatac acacacacgc 3000 

gcaagagaca cagtgagtag aaacaccga 3029 

<210> 626 
<211> 224 
<212> DNA 
<213> Homo sapiens 

<400> 626 

gaccaacatg gtgaaaccct gtctctacta aaaacacaaa atttagctgg gtgtggtggt 60 

gcgcacctgt aatcccagct actcaggaag ctgaggcagg agaatcactt gaacctggta 120 

ggtggaggtt gcagtgagct gagatcatgc cactgcactc cagcctgggt gacagagcga 180 

gactctctct ccaaaaaaaa aaaaaaaaac ctagaccaga tgca 224 

<210> 627 
<211> 468 
<212> DNA 

<213> Homo sapiens 
<400> 627 

tcagcattaa aatttgggga aaaaaatgtt tgaagatcaa tgaaaaatag cactggactc 60 

tatatttact tatataagat ttttttttta aatcttgatg aaaattttta ttaggctaga 120 

attgcagagc tgcaagagaa ttctgagatc aacaggtctt tttccttcat ttaccagctg 180 

agaaacttgg ggtcatagga gattaagatt gtttatctta ccagttgaat tcaatacttt 240 

cagaaataat tgttagaatt cagtgaaagt tattctctta aaagtacact gcttttcagc 300 

tcctatcaag tgtttcccta tgcttccaca atcctgatta tcaatttggg tatgggccac 360 

tctgggaaag gctgtcaaat gtttattcta gaatccagca actcccagat gtccatctcc 420 

atctcaggtt ctggtcaaag tctgatgtag caaagaaaca gagtaaca 468 

<210> 628 
<211> 222 
<212> DNA 

<213> Homo sapiens 



<400> 628 



ccaacatggt gaaaccctgt 
gcacctgtaa tcccagctac 
tggaggttgc agtgagctga 
ctctctctcc aaaaaaaaaa 



ctctactaga aacacaaaat ttagctgggt gtggtggtgc 
tcaggaagct gaggcaggag aatcacttga acctggtagg 
gatcatgcca ctgcactcca gcctgggtga cagagcgaga 
aaaaaaacct agaccagatg ca 



60 
120 
180 
222 



<210> 629 
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<211> 468 
<212> DNA 

<213> Homo sapiens 
<400> 629 

tcagcattaa aatttgggga aaaaaatgtt tgaagatcaa tgaaaaatag cactggactc 60 

tatatttact tatataagat ttttttttta aatcttgatg aaaattttta ttaggctaga 120 

attgcagagc tgcaagagaa ttctgagatc aacaggtctt tttccttcat ttaccagctg 180 

agaaacttgg ggtcatagga gattaagatt gtttatctta ccagttgaat tcaatacttt 240 

cagaaataat tgttagaatt cagtgaaagt tattctctta aaagtacact gcttttcagc 300 

tcctatcaag tgtttcccta tgcttccaca atcctgatta tcaatttggg tatgggccac 360 

tctgggaaag gctgtcaaat gtttattcta gaatccagca actcccagat gtccatctcc 420 

atctcaggtt ctggtcaaag tctgatgtag caaagaaaca gagtaaca 468 

<210> 630 
<211> 224 
<212> DNA 

<213> Homo sapiens 
<400> 630 

gaccaacatg gtgaaaccct gtctctacta aaaacacaaa atttagctgg gtgtggtggt 60 

gcgcacctgt aatcccagct actcaggaag ctgaggcagg agaatcactt gaacctggta 120 

ggtggaggtt gcagtgagct gagatcatgc cactgcactc cagcctgggt gacagagcga 180 

gactctctct ccaaaaaaaa aaaaaaaaac ctagaccaga tgca 224 

<210> 631 
<211> 1854 
<212> DNA 

<213> Homo sapiens 
<400> 631 

tttttttttt gagactataa acatatatat acacgtagta ttttctatgg ttgccaattc 60 

ttatgctaga tgttagcatt acaaattcgt ataaaatctg tgtccgcctt catagtccaa 120 

caacaaacgt ttgctgagtg tagaactccc aggagttctc tcatggacac aacagccaca 180 

ctcaccctca cattagctgg cagaggccct ttataccagt gatacgcttg ggccaagggc 240 

ccctgggttg aaaatcatta ggtagacacc atccagtgcc ctgtctcatg tgctgaacag 300 

ggctagtaat gccgactttc actttgggag gccaagtcgg gtggatcact tgaggccagg 360 

aggtcaggac cagctagcca acatggtaaa accctgtctc tactaaaaat acaaaaatta 420 

gctgggcatg gtggcatatg cctatagtgc cage tac tea ggaggctgag gcatgagaat 480 

cacttgtacc caggaggecg aggtttcagt gagecaagat catgccacta cactccagcc 540 

tgggtgacag agtgatatag atctcaaaaa aaaaaaaaaa aagtgtcaac ctttacttct 600 

attttatcaa gttcttgttc cagcctgtcc ttttcaaaag ataccctcct tggtaaagat 660 

tccaaagttt tgggttttct ttaaatgtat gcatccaatg tctttttcat ttcattaagg 720 

tcaaataaat atttgtcctg ggtttccaat aaggtatttg agattttcct gtgccataat 780 

gactgtattt ggtacaggca aggcaatata ctaaggtctc tctacagatc ttgaaaatca 840 

gatcagcett ggaggatgaa attagtaaca actaatacac cctggaggac accagccctt 900 

tggaaggcat tcattggaag agactagaaa agcagttgag gttgggagtt tctcatttca 960 

ggcaaagtga caagggagtg gtatcagaaa atacgatctt aactaagaaa gctagaacct 1020 

atatcactct agaagactgt teacaaaegg tagcagggee catgtgcagc ttggaaagag 1080 

gaagcttaat gtgcaaaaat ttaatatccc tgcaactctc tggaaattgg caagatggtc 1140 

cctcagaaga gcagatctgt gtggtgagtg agatatcaca gctccatgag gtgaacacca 1200 

aaccttccaa tcatcagttg tctcccatgt acttatggag aatggtagac aaaactcact 1260 

gaactgette atttgtacat tgatctcttg tgaccttget ttgtgggagt tagtcccaga 1320 

gaatctttat aaactggcaa gattttctga cctaaagagg taggatctgt ggaaaatgat 1380 

gacctaccaa agtacaaatc tagagaactg aggtttggag aacatgaaaa ccatggtcaa 1440 

acacgtgaag gcctgccact gggaagagga . attcaactag caatacacag gcccaactaa 1500 

tatgtaagaa agacactaca aagtagtagt taagaacaca ggctctggag ccaaacgact 1560 

tggctctaac acttactggc tccatgacct tgaacaactt aggttccatg tgecttaatt 1620 

tctccatctg ttaaataaag tgattgtgaa gatttaatga gataatccaa ataaatgett 1680 

agtacaatgt atggaacaca gcaagttctc aagagactag ccattacatc attattatcc 1740 

aaagaagaaa cagaccaccc tgtaagattt ggaagggagc tcccccagga ctcctcccac 1800 
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cccctgccag gggtacagag agactagttt attgagggcc aaacacaaaa cttg 



1854 



<210> 632 
<211> 2128 
<212> DNA 

<213> Homo sapiens 
<400> 632 

tctacctccg ggctgaaacg tcaccatgcc tccccacaga cagacggatg gacagatggg 60 

cctccctgca cctgctctgt gggtgtgggg gctcctgctc agcagcagtt tccagaccct 120 

tctccctgct ttccccaagc cacccgcctt gaatctgggg tgctctacca gacccatccc 180 

ctcatttcta aagatttgag ccactagtcg tgtccctctc cctcagaaat gccttggtga 240 

cacttggctg ctttcaactc ttccacccat ctgcctcttg gtctcatctt taccttctgc 300 

taaaggtcct gacccccacc cccgccacgc catggggcac cccatggtgg tgcgtccttg 360 

ggagcagctc tgtccctttc cccgtggcct ttgccccgcc tcctatgact tcgattccca 420 

cctgtccccg acccctggga ccactgaccg ggcccgatca ccctgtcact gccctgtcat 480 

ctgcttaccc cacacggtgc tctgctgacc caggtcttgc tgtctcccaa cagccccacg 540 

aggcttcccg tcgctcctgg acactgcagg ctgagcccgc tgccccgccg cctccatgag 600 

gaaggctttt cctctgtgag ccccaggcca ccctttccct cctttaagta attacttaag 660 

tcccttgcca gggccctccc agtacccttt ctaaagacac ccctgcccca gcatgctgca 720 

ggctcctgct ccactttcct ctcaggccct cgtcgctgtg gtgctgcctt tgttttctgt 780 

ctctgccacg gcagggggtc agctccttgg aggtggggct tctgcccttg ctgtaccact 840 

gcctggcaca cagtaggtgc tcaataaaga cttgcagggt gagctgcctg aagaatagtc 900 

accagaggcc agaaatgtct agagctctgc cggtagggtg actggccgag gagcctggcc 960 

tgcatgtgtg cgtgtgtgtg tgtgtgtgtg tgtgtgtgag tcagggttta tatgcaggtg 1020 

tctacaggag acatgctggg ttctgtgctg ggtgtgagga atatgggagc agaaccccag 1080 

ggaggtggca gagacttggg ggccaaaggg ctggggtgca ggggggcaac agccaggtgc 1140 

cactggccac cccagccgca gggagccctg cccaccctcc aggtgcctgg atgtccaacc 1200 

tcactgctat tcccacctca agccaggcct ggagatggag gccccatgac tcagccaggg 1260 

ccggtttgca gctgcggctg acccagacgg gcgggcagcc cccagccccc gggcctgcac 1320 

ccaggacagg gccgccctcc ctccctcccc cgcttctggc tcctaggaca ggattctctg 1380 

aattcagctc ccctgaggct ggggccaggt tggaggccag gcctgggggc tctgggctgg 1440 

ggtcccagat aggggctggg cggccaggct tggaatctgg aatccagccc cattcctggc 1500 

atctgcagga gcctcgtggg gagggagact tgggatggac ttcaaccagc cagggctgga 1560 

ttcttgcccc ggaacctgca ttcctggggc agccaaggga tccttcccac ttctgggccc 1620 

agcttggccc tgcctggcat tcgaggccca tctggggctt gggggtgtct ccccaactct 1680 

cagacataag gacacccttc caagcttgtt ccttcacctg gcggggccct gagccccaca 1740 

cccctcccct gtcctttctc catccgacat caagcgcctc cctgcctctg ctcgcacagt 1800 

ctctgagatg gggaactcag cacctcacag gtgggcccag ctctggtgct gtctgtgttg 1860 

ggggagctgg ggcagccccc aaaagacctt ggagacagac cctcagaggc aggagcagag 1920 

gctggcagtg gatgctgtgc ctggaggcct tgagggcgag gtgtgatgat gaggcccagg 1980 

ctgcagggct ctttctggct ctccagctcc ggagaacaag ggatttcctc ctgctctgcc 2040 

caccctcccc agccagtgca tgctcagcct cagcaccgca cctgggcgcc ctccatgatc 2100 

tgccccacct ggacacatgg ctcgaggg 2128 

<210> 633 

<211> 2129 

<212> DNA 

<213> Homo sapiens 



<400> 633 



tctacctccg ggctgaaacg tcaccatgcc tccccacaga cagacggatg gacagatggg 
cctccctgca cctgctctgt gggtgtgggg gctcctgctc agcagcagtt tccagaccct 
tctccctgct ttccccaagc cacccgcctt gaatctgggg tgctctacca gacccatccc 
ctcatttcta aagatttgag ccactagtcg tgtccctctc cctcagaaat gccttggtga 
cacttggctg ctttcaactc ttccacccat ctgcctcttg gtctcatctt taccttctgc 
taaaggtcct gacccccacc cccgccacgc catggggcac cccatggtgg tgcgtccttg 
ggagcagctc tgtccctttc cccgtggcct ttgccccgcc tcctatgact tcgattccca 
cctgtccccg acccctggga ccactgaccg gggccgaatc acctgtcact gccctgtcat 
ctgcttaccc cacacggtgc tctgctgacc caggtcttgc tgtctcccaa cagccccacg 
aggcttcccg tcgctcctgg acaatgcagg gtgagcccga tgccccgccg cctccatgag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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gaaggctttt cctctgtgag ccccaggcca ccctttccct cctttaagta attacttaag 660 

tcccttgcca gggccctccc agtacccttt ctaaagacac ccctgcccca gcatgctgca 720 

ggctcctgct ccactttcct ctcaggccct cgtcgctgtg gtgctgcctt tgttttctgt 780 

ctctgccacg gcagggggtc agctccttgg aggtggggct tctgcccttg ctgtaccact 840 

gcctggcaca cagtaggtgc tcaataaaga cttgcagggt gagctgcctg aagaatagtc 900 

accagaggcc agaaatgtct agagctctgc cggtagggtg actggccgag gagcctggcc 960 

tgcatgtgtg cgtgtgtgtg tgtgtgtgtg tgtgtgtgtg agtcagggtt tatatgcagg 1020 

tgtctacagg agacatgctg ggttctgtgc tgggtgtgag gaatatggga gcagaacccc 1080 

agggaggtgg cagacacttg ggggccaaag ggctggggtg caggggggca acagccaggt 1140 

gccactggcc accccagccg cagggagccc tgcccacctt ccaggtgcct ggatgtccaa 1200 

cctcactgct attcccacct caagcaaggc tggaaaatga aggcccacta atccgccagg 1260 

ggcgggttgg cacttccggt taaccaaaac gggcgggcag ccccagcccc ctggcctgca 1320 

cccaagagaa gggcggcctc cctccctccc ccgcttctgg ctcctaggac aggattctct 1380 

gaattcagct cccctgaggc tggggccagg ttggaggcca ggcctggggg ctctgggctg 1440 

gggtcccaga taggggctgg gcggccaggc ttggaatctg gaatccagcc ccattcctgg 1500 

catctgcagg agcctcgtgg ggagggagac ttgggatgga cttcaaccag ccagggctgg 1560 

attcttgccc cggaacctgc attcctgggg cagccaaggg atccttccca cttctgggcc 1620 

cagcttggcc ctgcctggca ttcgaagccc atctggggct tgggggtgtc tccccaactc 1680 

tcatacataa cgacaccctt ccaagcttgt tccttcacct ggcggggccc ttagccccac 1740 

acccctcccc tgtcctttct ccatccgaca tcaagcgcct ccctgcctct gctcgcacag 1800 

tctctgagat ggggaactca gcacctcaca ggtgggccca gctctggtgc tgtctgtgtt 1860 

gggggagctg gggcagcccc caaaagacct tggagacaga ccctcagagg caggagcaga 1920 

ggctggcagt ggatgctgtg cctggaggcc ttgagggcga ggtgtgatga tgaggcccag 1980 

gctgcagggc tctttctggc tctccagctc cggagaacaa gggatttcct cctgctctgc 2040 

ccaccctccc cagccagtgc atgctcagcc tcagcaccgc acctgggcgc cctccatgat 2100 

ctgccccacc tggacacatg gctcgaggg 2129 

<210> 634 
<211> 2132 
<212> DNA 

<213> Homo sapiens 
<400> 634 

tctacctccg gcctgaaacg tcaccatgcc tccccacaga cagacggatg gacagatggg 60 

cctccctgca cctgctctgt gggtgtgggg gctcctgctc agcagcagtt tccagaccct 120 

tctccctgct ttccccaagc cacccgcctt gaatc.tgggg tgctctacca gacccatccc 180 

ctcatttcta aagatttgag ccactagtcg tgtccctctc cctcagaaat gccttggtga 240 

cacttggctg ctttcaactc ttccacccat ctgcctcttg gtctcatctt taccttctgc 300 

taaaggtcct gacccccacc cccgccacgc cacggggcac cccatggtgg tgcgtccttg 360 

ggagcagctc tgtccctttc cccgtggcct ttgccccgcc tcctatgact tcgattccca 420 

cctgtccccg acccctggga ccactgaccg ggcccgatca ccctgtcact gccctgtcat 480 

ctgcttaccc cacacggtgc tctgctgacc caggtcttgc tgtctcccaa cagccccacg 540 

aggcttcccg tcgctcctgg acactgcagg ctgagcccgc tgccccgccg cctccatgag 600 

gaaggctttt cctctgtgag ccccaggcca ccctttccct cctttaagta attacttaag 660 

tcccttgcca gggccctccc agtacccttt ctaaagacac ccctgcccca gcatgctgca 720 

ggctcctgct ccactttcct ctcaggccct cgtcgctgtg gtgctgcctt tgttttctgt 780 

ctctgccacg gcagggggtc agctccttgg aggtggggct tctgcccttg ctgtaccact 840 

gcctggcaca cagtaggtgc tcaataaaga cttgcagggt gagctgcctg aagaatagtc 900 

accagaggcc agaaatgtct agagctctgc cggtagggtg actggccgag gagcctggcc 960 

tgcatgtgtg cgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgagtcaggg tttatatgca 1020 

ggtgtctaca ggagacatgc tgggttctgt gctgggtgtg aggaatatgg gagcagaacc 1080 

ccagggaggt ggcagagact tgggggccaa agggctgggg tgcagggggg caacagccag 1140 

gtgccactgg ccaccccagc cgcagggagc cctgcccacc ctccaggtgc ctggatgtcc 1200 

aacctcactg ctattcccac ctcaagccag gcctggagat ggaggcccca tgactcagcc 1260 

agggccggtt tgcagctgcg gctgacccag acgggcgggc agcccccagc ccccgggcct 1320 

gcacccagga cagggccgcc ctccctccct cccccgcttc tggctcctag gacaggattc 1380 

tctgaattca gctcccctga ggctggggcc aggttggagg ccaggcctgg gggccctggg 1440 

ctggggtccc agataggggc tgggcggcca ggcttggaat ctggaatcca gccccattcc 1500 

tggcatctgc aggagcctcg tggggaggga gacttgggat ggacttcaac cagccagggc 1560 

tggattcttg ccccggaacc tgcattcctg gggcagccaa gggatccttc ccacttctgg 1620 

gcccagcttg gccctgcctg gcattcgagg cccatctggg gcttgggggt gtctccccaa 1680 
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ctctcagaca 
cacacccctc 
cagtctctga 
gttgggggag 
agaggctggc 
caggctgcag 
tgcccaccct 
gatctgcccc 



taaggacacc 
ccctgtcctt 
gatggggaac 
ctggggcagc 
agtggatgct 
ggctctttct 
ccccagccag 
acctggacac 



cttccaagct 
tctccatccg 
tcagcacctc 
ccccaaaaga 
gtgcctggag 
ggctctccag 
tgcatgctca 
atggctcgag 



tgttccttca 
acatcaagcg 
acaggtgggc 
ccttggagac 
gccttgaggg 
ctccggagaa 
gcctcagcac 

gg 



cctggcgggg 
cctccctgcc 
ccagctctgg 
agaccctcag 
cgaggtgtga 
caagggattt 
cgcacctggg 



ccctgagccc 
tctgctcgca 
tgctgtctgt 
aggcaggagc 
tgatgaggcc 
cctcctgctc 
cgccctccat 



<210> 635 
<211> 290 
<212> DNA 

<213> Homo sapiens 



<400> 635 

gatgtgtgtt 

gtatgtgtgt 

ctttgaatga 

agcaagctgg 

gaaccaggag 



tgtgtgtgtg 
gtgtggtgta 
gaaattggct 
aggcccagga 
ggcgggtggt 



tgtggtgtgt 
tgtgtgtgtt 
cccatgatta 
gggccggtgg 
gcagctgcag 



gtatgtgtgt 
tgtgtgtgtg 
tggagctgac 
tgtggctgca 
tgtgaaagcc 



ggtgtgtgta 
tggtgtgtgt 
aactcccaag 
gccggtgtct 
ggcaggctcg 



tgtgtgtggt 
atgtgtattt 
gtctgcaggc 
gaaggcctga 



<210> 636 

<211> 96 

<212> DNA 

<213> Homo sapiens 

<400> 636 

ccaacatggt gaaaccccgt ctctactaaa aatacaaaaa ttagccgggc atggcggcac 
gtgcctgtaa tcccagctac tcaggaggct gaggca 

<210> 637 
<211> 2649 
<212> DNA 

<213> Homo sapiens 



<400> 637 

cttgtaggta 

atgtgattca 

atgttatatc 

agcataagtc 

aaatacagtt 

taaagcatca 

attagaatgc 

ggacagtaga 

ccctaaaatg 

tagagtgaga 

tgcagggacc 

aagagctacc 

ggcataaggg 

ttttaaacat 

ggatgttata 

catctttgtg 

gtgtaagagc 

aaggagagta 

gtcaattgtc 

ttctgtcagc 

ctcggcctcc 

tagtcatttg 

cccaggtctt 

tcatttctac 

aggctctgac 



ctcattgagg 

gatgcccaat 

aaaatatgtt 

ccattttttt 

tggaacttaa 

agaaaaagaa 

aggaaatgag 

aagtgaatgt 

ctgtttgctg 

cagctgtcat 

aacagtcatg 

ccaggcagca 

aatactctat 

actttacatt 

tctgacatag 

cctcatgcag 

ttgtgtttca 

aatggaaatg 

tcttgagaat 

aaagaacaag 

ctacaaaaac 

tttcttcttg 

gtgaagggtt 

atgaaaccat 

accaaaattg 



tttattgtgt 
cttactctgt 
gttatacttt 
ccatgggagt 
ataaactatt 
aagatttgct 
gggaagtgga 
aggagctttc 
tcttaggttg 
cataagagtc 
ttcagggcag 
cctgctacca 
cctttctgag 
cctcctcttc 
agtcctcgga 
tggcttacaa 
aatgggtttg 

gggggagcag 

agaggaggct 
cactactctc 
ctagggatca 
tccacaggga 
attaacaccg 
cagattttag 
ccataatgaa 



aagatgaatg 

tactttatga 

aggataatcg 

ctaggaaagc 

gatcaatttc 

gtcaagacca 

aaggcagcaa 

tgacccatgc 

caagtaacca 

atttgatctg 

cagctggtgc 

atagtgcaaa 

taaagagcaa 

tggggctcaa 

gcagcagttg 

gtcaaccaga 

gttttcttaa 

ggtcccctgg 

attgagtttt 

cagcaattgc 

acttagttca 

gaaccatttt 

gaaagaaata 

tactgtgata 

ggtgtttcac 



aatgttgcaa 

aaatttttta 

gtgtgttagc 

tatatgttta 

tggtcttatg 

ggaaaatttg 

gtaggagaga 

acttcaggaa 

aattaaaacc 

tttaaaggtg 

acacttcaag 

caactcagag 

gtagaactaa 

gcctactttt 

ttcctgaaag 

cttctcccct 

tgtcacccta 

ggaggtttaa 

cattccacac 

tttccactgg 

ctccaaatta 

ctttccttct 

cattttaata 

ttttggtccc 

ttcttctcat 



attcctaaac 

aagctatatg 

cctgaatttc 

ttcagcagca 

ctagaaggaa 

acaatagagt 

aaaagtgcag 

cgcaattcat 

agtttgaaag 

gctgcttgta 

cacagaccat 

agacctcgtt 

aggtttcaca 

gggccaaagc 

ttcctttttg 

gacttttgat 

ggttggtgga 

acagatggaa 

tctgctcctg 

actcccccac 

gaaaatttaa 

ttcaaaattg 

agcttaaatc 

tctgtctttt 

ttatttttat 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2132 



60 
120 
180 
240 
290 



60 
96 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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gggatctttt attcccaaat gccttttcat cccagccaaa gggagaaatg ttgatagatc 1560 

tgccatcaag aaggttccaa agctggcctg tcaggttttc tgttttcttg tttattatct 1620 

ttgaactttt gttttaaatg ttttaaacac ttatttacca tgtaactaaa tgtctgatag 1680 

tattgaaaat actttgtggg ttttaattta tttaatgctc atgaaaccct atgaggtagg 1740 

tactgatatt atttttattt tactgatgag gaaagtgaag caaagagaag tgaaatgaaa 1800 

ggtagtgagt gatgggacca gggtttggac atgggcagtc tggctctaaa atgtatgctt 1860 

ttaactacta tgtaatgctg cctcacaaac aacttgtctc acaaattgat attctggatc 1920 

agaggatgtc gactggcccg caaatgtatt ttgtatggct catacacagt tcagaagttt 1980 

taaaaattta catagaaatc tgcatttcct gacttctttt gaaaatggga ataccaaaca 2040 

tcattaggct tgaattccca atacggcaac aacagctgag caacaagcag ctgtttagac 2100 

taggcactcg ctctccaatt tccacagtcc ccaccaatgc agatcatagt atcgacttaa 2160 

atttcctgcc tgccttagag aagcttctga gcttgtgacc tctattctag ctgctctatg 2220 

aatggacgct gccccagtac agcgaggacc tgctgcaaaa tgcatttctt agtcttcaat 2280 

acttattcct ccttgtaact ggatttctgg taagttatgt ctcatggtgg atctgcccca 2340 

aagatggaga ctgaatggca gtgagtcact cgccctggcc tccattgttc tggagaaggt 2400 

tccagccaca tggttgatgt cagctggttt tccagagcca gagctgggtt gcgggacaga 2460 

cacacctgca tctaatagtg aaaggcaaag ttgaaaggcc aagaccagcc tgaggtctga 2520 

gggaccaagg gcttcacaga ggccagaagt tcagaggtgg acataaaagg tgttaggaga 2580 

ataaggaagt gaaaagaaca tagtacagtg tatcagagga ggagctccag gctggcaaat 2640 

atcactccc 2649 

<210> 638 

<211> 2649 

<212> DNA 

<213> Homo sapiens 

<400> 638 

cttgtaggta ctcattgagg tttattgtgt aagatgaatg aatgttgcaa attcctaaac 60 

atgtgattca gatgcccaat cttactctgt tactttatga aaatttttta aagctatatg 120 

atgttatatc aaaatatgtt gttatacttt aggataatcg gtgtgttagc cctgaatttc 180 

agcataagtc ccattttttt ccatgggagt ctaggaaagc tatatgttta ttcagcagca 240 

aaatacagtt tggaacttaa ataaactatt gatcaatttc tggtcttatg ctagaaggaa 300 

taaagcatca agaaaaagaa aagatttgct gtcaagacca ggaaaatttg acaatagagt 360 

attagaatgc aggaaatgag gggaagtgga aaggcagcaa gtaggagaga aaaagtgcag 420 

ggacagtaga aagtgaatgt aggagctttc tgacccatgc acttcaggaa cgcaattcat 480 

ccctaaaatg ctgtttgctg tcttaggttg caagtaacca aattaaaacc agtttgaaag 540 

tagagtgaga cagctgtcat cataagagtc atttgatctg tttaaaggtg gctgcttgta 600 

tgcagggacc aacagtcatg ttcagggcag cagctggtgc acacttcaag cacagaccat 660 

aagagctacc ccaggcagca cctgctacca atagtgcaaa caactcagag agacctcgtt 720 

ggcataaggg aatactctat cctttctgag taaagagcaa gtagaactaa aggtttcaca 780 

ttttaaacat actttacatt cctcctcttc tggggctcaa gcctactttt gggccaaagc 840 

ggatgttata tctgacatag agtcctcgga gcagcagttg ttcctgaaag ttcctttttg 900 

catctttgtg cctcatgcag tggcttacaa gtcaaccaga cttctcccct gacttttgat 960 

gtgtaagagc ttgtgtttca aatgggtttg gttttcttaa tgtcacccta ggttggtgga 1020 

aaggagagta aatggaaatg gggggagcag ggtcccctgg ggaggtttaa acagatggaa 1080 

gtcaattgtc tcttgagaat agaggaggct attgagtttt cattccacac tctgctcctg 1140 

ttctgtcagc aaagaacaag cactactctc cagcaattgc tttccactgg actcccccac 1200 

ctcggcctcc ctacaaaaac ctagggatca acttagttca ctccaaatta gaaaatttaa 1260 

tagtcatttg tttcttcttg tccacaggga gaaccatttt ctttccttct ttcaaaattg 1320 

cccaggtctt gtgaagggtt attaacaccg gaaagaaata cattttaata agcttaaatc 1380 

tcatttctac atgaaaccat cagattttag tactgtgata ttttggtccc tctgtctttt 1440 

aggctctgac accaaaattg ccataatgaa ggtgtttcac ttcttctcat ttatttttat 1500 

gggatctttt attcccaaat gccttttcat cccagccaaa gggagaaatg ttgatagatc 1560 

tgccatcaag aaggttccaa agctggcctg tcaggttttc tgttttcttg tttattatct 1620 

ttgaactttt gttttaaatg ttttaaacac ttatttacca tgtaactaaa tgtctgatag 1680 

tattgaaaat actttgtggg ttttaattta tttaatgctc atgaaaccct atgaggtagg 1740 

tactgatatt atttttattt tactgatgag gaaagtgaag caaagagaag tgaaatgaaa 1800 

ggtagtgagt gatgggacca gggtttggac atgggcagtc tggctctaaa atgtatgctt 1860 

ttaactacta tgtaatgctg cctcacaaac aacttgtctc acaaattgat attctggatc 1920 

agaggatgtc gactggcccg caaatgtatt ttgtatggct catacacagt tcagaagttt 1980 

taaaaattta catagaaatc tgcatttcct gacttctttt gaaaatggga ataccaaaca 2040 
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tcattaggct tgaattccca atacggcaac aacagctgag caacaagcag ctgtttagac 2100 

taggcactcg ctctccaatt tccacagtcc ccaccaatgc agatcatagt atcgacttaa 2160 

atttcctgcc tgccttagag aagcttctga gcttgtgacc tctattctag ctgctctatg 2220 

aatggacgct gccccagtac agcgaggacc tgctgcaaaa tgcatttctt agtcttcaat 2280 

acttattcct ccttgtaact ggatttctgg taagttatgt ctcatggtgg atctgcccca 2340 

aagatggaga ctgaatggca gtgagtcact cgccctggcc tccattgttc tggagaaggt 2400 

tccagccaca tggttgatgt cagctggttt tccagagcca gagctgggtt gcgggacaga 2460 

cacacctgca tctaatagtg aaaggcaaag ttgaaaggcc aagaccagcc tgaggtctga 2520 

gggaccaagg gcttcacaga ggccagaagt tcagaggtgg acataaaagg tgttaggaga 2580 

ataaggaagt gaaaagaaca tagtacagtg tatcagagga ggagctccag gctggcaaat 2640 

atcactccc 2649 

<210> 639 
<211> 190 
<212> DNA 

<213> Homo sapiens 
<400> 639 

atgagaaagt aaacatacaa cacagtaaaa gttgttccaa gtatttaaaa atgttctata 60 

tcttctacct tattatggct caagatatag taacttttgt ggggttttat cattgttttt 120 

gttttatttt gtgaaaacat ccaagagccc gcaacagcct atagatttgc agatcctttt 180 

ctttcctggg 190 

<210> 640 
<211> 190 
<212> DNA 

<213> Homo sapiens 
<400> 640 

atgagaaagt aaacatacaa cacagtaaaa gttgttccaa gtatttaaaa atgttctata 60 

tcttctacct tattatggct caagatatag taacttttgt ggggttttat cattgttttt ' 120 

gttttatttt gtgaaaacat ccaagagccc gcaacagcct atagatttgc agatcctttt 180 

ctttcctggg • 190 

<210> 641 
<211> 21501 
<212> DNA 

<213> Homo sapiens 
<400> 641 

gaggctctgg tcataatggc gaccacactt ttattgaata tctgttgtgt actgtccagg 60 

tcttcacatt tgtcacctca tttaattctc atgttaagcc tgagtggaaa gcattatgaa 120 

gcccatttga ggaaagagaa gagtgaagtt cagaggcaaa agtcatctgt ccagggtcct 180 

ttgccaaatg acagctggca tggaaccctc ctaaggctga ttcctctatc ctccttctcc 240 

taacagcact tcaggggcaa ccttctcctc tatcttttct ccactcacca tcaaagaagc 300 

cctcaatttc ccaggattct aggacatttc cctaaggtgg gggtggtggg acaggtagta 360 

agactgatgc cttttaactt tccaagacct ttctttattt cctgagatga ataaataagt 420 

ggcaatgggg tgtggggatg tggtgaacac agaacaggaa ctggcagcat gcagctggca 480 

ggagggagct gtgatggccc cacccctgtg aggaccacct aatcctaagc aggggccaaa 540 

gctctagcag ccaccattaa ggaaaaactt ttgaaattca cacattgtga agcctgccag 600 

tccccgccag gtgaagagct catggtatcc accttcaggg ctacttttta ccccaggggc 660 

cttgtctgcc cgggcctggg tacctgggct gctctgtaag gcaggtaggg gtgcttaccc 720 

cactctcaaa aataaaccaa gctgtcctga gtgtccagca ggagggcagc ccaagctctg 780 

ggggcagcgt gggtgggagg aggtcctaca ggtggtggcc gcgattatct gcccttgatg 840 

aagccctcca gcacacggtc actgcgctct gacaaccagt agccagtcat ggccgccaca 900 

gccccaatga agatggcggt gaacaccaag tcgcccttag cagccagcgt ggccagtgca 960 

cagatgatga tgccaaagaa catctgctgc agcaccacca actcgtcttc tgtcatctct 1020 

gagaagaagc ctgctgactc tgcagaggag caagagatgc agcctggcac tcgtatccct 1080 

gttcctggta gttcttggag tctccccaga actcccacac acctgatcat catggccacc 1140 

aggcctgccc agcaagccag gcacactaca tttactgctc aaaaccaccc tgcaaggggc 1200 

agttaccatc cctgtctcac taaatgggaa agtgaggctc agagaggtaa aggggttgga 1260 



442 



WO 02/076488 PCT/US02/08276 



cttgccaagg ccatgtggtg gacaagttgg gggcagagca tttattacgt gtatgagtct 1320 

cgttccacgg tcagactgtc tggattggga tcccagcttt gcgacttgct tgccatatag 1380 

ccctgggaaa gtagcttcat ctctctgcat cttcattttt acatatgtag agtggggatg 1440 

agaacccaac ttccagcgtc gggttgtttc aggggttaaa tcagttaaga tggggaatgc 1500 

acctagaaca gcatctgaca cagtaattgc tggggaaatg gtccctgcca tcatcgttgt 1560 

gattaccctg catctcacca cacccccggc ctaggacagc tctcagcagg ccagggcaaa 1620 

cactggtccc attggataga tgaggaaaca ggttaaggag gcagactgat ttgatgccag 1680 

gtctgattcc ctttgtaccc tggaaggtat aaggctgctg agagccaacc tggagatcag 1740 

atgtagagcc cggtctcagt gcagcctcca gggggctggg tggcagtagg gatggggcag 1800 

ggcctggagc tgccactgct acctggggaa ggggcttgcc tgccttctct ttgaatgagg 1860 

acctcccctc tcccgcccca gggctcacct gggatctgct ccttcacaca gatgccttcc 1920 

atctgcttgt agccctcggc acagatgcag cgataaccgc cctcggtgtt ttcacactgc 1980 

ttgttctctc ccggacacac ctctgtctca cactcatcca catcttgagg ggagggaggg 2040 

gtcagagtcc tgctagggcc ccccacccag caccctgact ccatcttgtc catctctcaa 2100 

cactcccctc tgcccttgcc caggcaacat tccacccttc cttcccccac ctgcccatat 2160 

tccagcctct tcaccattca tgctgggtga ggcactccca ggtgcctgtg tcccatcagc 2220 

aggagactca ccgagacact tggagcccac ctgctgatag ccagggctac acttcttaca 2280 

gcgacctggc cctgccccca tgcagcctag gcaggccttg gcacagtctg gagagaggag 2340 

agaaagatga gggtgagatg tgcacctagg cccaggggag ccacgctcca gaagcggagg 2400 

cctgggggtg ctctgaagcc tgagaagggg aaggtttgga gggacaggtg' tggagcagct 2460 

cttcctccac tcctcacgtc ccctcctctg cagaagtaga cactgacctc ggcactcata 2520 

ggagccctca gtgttcacgc agaattggtc agctccacag ttggctccct ctgtgccaca 2580 

ctcatcaatg tctgcagaga ggtgagcagg gtgagaattt caatccctat ttagtaggga 2640 

agctgaagcc agagccgttg gggaatggtc ttgcctggga ttaaatggta tggcagagac 2700 

tgcaagctgt ctaccaacac agaagcagac tagctggact tcaggctgcc cagccagaga 2760 

ccacatttgc caacctgtct tgaagctaag tgtgtccctg tgattgagtt ctggccaaca 2820 

ggatgtaaac aggagtgatg agctccaccg ttagcttggt cttaaaacaa tgcttggact 2880 

cctcaatgca ctttctttcc cttctccctt cctgagtgct ggcatgtggg acacagagat 2940 

gcccaggact cagcttcaac catgcagaca acgcccaagg ggatgattgt gcagtaagtt 3000 

acaggaataa tctgggtact ggaatgatta tgactagcag agctcccctg ccagcctgga 3060 

ctactcactt tgggactgta atatgataaa aaaaaataaa cctctaccct ctagattctt 3120 

tttttttttt ttgagatgga gtttcgctct tgtcacccag gctagagtgc aatggtgtga 3180 

tcttggctca ctgcaacctc tgcctcctgg attcaagcga ctctcctgcc tctgcctccc 3240 

gagtagctgg gattacaggc acctgccacc atgcctggct aaatttttgt atttttagta 3300 

cagacagggt tttgccatgt tggccaggct ggtctcgaac tcctgacctc aggtgatcca 3360 

ccccacttca gcctctcaaa gtgctgggat tagaggcgtg agtgagctac cacacccagc 3420 

caacctctat actctttaag ccacagtctt ttggggattt ttgttacagc agcttagcct 3480 

ttactttagt ctatataagt ggctaggctg ggaccaaacc aagtcttgaa cccaggtttc 3540 

ctgaccccaa gttcagtgtc ctcattctcc tagccaaccc ttcctcattt tttttttttt 3600 

tttttttttt ttttgagatg gagtctccct ctgttgccca gactggagtg cagtggtgcg 3660 

atctcggctc actgcaagct ccgcctccca ggttcatgcc cttctcctgc ctcagcctcc 3720 

tgagtagctg ggactacagg cacccgccac cacgcccagc taattttttg tatttttagt 3780 

tgagacaggg tttcaccatg ttagccagga tggtctcgat ctcctgatct catgatctgc 3840 

ccgccttggc ctcccaaagt gctgggatta caggcgtgag ccaccgcgcc cagccccttc 3900 

ctcaattttt aggacttggc caaagtaacg cctcctccaa gaagcctttc ctgattcccc 3960 

atcaccactt ccactgaact agatcagagg catctctcac tgtacacagg gactatacta 4020 

cacagtacct gtgtactgga ctgtaagtat ctggttactt ttctgccttc ccagccaggt 4080 

tgagatctcc atgaaggtgg agactaggct agtctagccc aggcttgtgt tgagagaata 4140 

aaggaccaag cccaggcccc tgaccatctt cccagaccta gctagggccc cacttaccta 4200 

cacacttgag gtgatgcagg gcccagccct tcttgcattg caaacagttt gattcctcag 4260 

gtcctgagca tcgggcacag gggccaaaac aagctgggtg gaatgggggc agcatgagga 4320 

tgggcaggca ggtgccccat ctgcccctgc ccagtgccac accttttggc tacctacccg 4380 

aacataccag atggctggcg ttgcgttctg cctcaaagta gccaaggcca cactggccac 4440 

aggcctcacc cccgtagccg gcttggcagt cacagtgccc gctgccccct cgtgtccctt 4500 

ctccttcaca ctgcccgtag ccaccgcagg gcctctctgt tcccccagga caggctgtgg 4560 

gaagacacca aaccaggtga ggtcatctat acaaccttga tatacaagat aatattttct 4620 

ccctctccct ctccctctcc cttgccttcc ctccctcccc ccctcccctc cccctccccc 4680 

tttccctctc cctctcccca tggcctccct ctccctctcc ccatggtctc cctctccctc 4740 

tctttacacg gtctccctct gatgccaagc tgaagctgga ctgtactgct gccatctcgg 4800 

ctcactgcaa cctccctgcc tgattctcct gcctcaacct gccgagtgcc tgcgattgca 4860 

ggcgcgcgcc accacgcctg actggttttc gtattttttt ggtggagacg gggtttcgct 4920 
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gtgttggccg ggccggtctc cagctcctaa acgcgagtga tccgccagcc tcggcctccc 4980 

gaggtgccgg gattgcagac ggagtctcgt tcactcagtg ctcaatggtg cccaggctgg 5040 

agtgcagtgg cgtgatctcg gctcgctaca acctccacct cccagccgcc tgccttggcc 5100 

tcccaaagtg ccgagattgc agcctctgcc cggccaccac cccgtctggg aagtgaggag 5160 

cgtctctgcc tggcggccca tcgtctggga tatgaggagc ctctctgcca ggctgcccag 5220 

tctggaaagt gaggagcgtc tctgcccggc cgccatccca tctaggaagt gaggagcgtc 5280 

tctgcccggc cgcgaccccg tctgggaggt gaggagcgtc tctgcccggc cgccctgtct 5340 

gagaagtgag gagcccctcc gcccggcagc cgccccgtct gggaagtgag aagtgtctcc 5400 

gccgggcagc cagcccgtcc gggagggagg tgagggggtc agccccccgc ccggccagcc 5460 

gccccgtccg ggaggtgagg ggcgcctctg cccggctgcc cctactggga agtgaggagc 5520 

ccctctgccc ggccaccacc ccgtctggga ggtgtaccca acagctcatt gagaacgggc 5580 

caggatgaca atggcggttt tgtggaatag aaaaggggga aaggtgggga aaacactgag 5640 

aaatcggatg gttgctgtgt ctgtgtagaa agaagtagac acgggagact tttcattttg 5700 

ttctgtacta agaaaaattc ttctgccttg ggatcctgtt gatctatgac cttatcccca 57 60 

accctgtgct ctctgaaaca tgtgctgtgt ccactcaggg ttaaatggat taagggcggt 5820 

gcaagatgtg ctttgttaaa cagatgcttg aaggcagcat gctggttgag agtcatcacc 5880 

actccttgat ctcaagtacc cagggacaca aacactgcgg aaggccgcag ggtcctctgc 5940 

ctaggaaaac cagagacctt tgttcacttg tttatctgct gaccttccct ccactattgt 6000 

cctatgaccc tgccaaatcc ccctctggga gaaacaccca agaatgatca ataaaaaaga 6060 

caaaaaaaca aaaaaaacaa gataatattt tctgctctgc atatcagtaa acatttttgg 6120 

agaagaaaat taattttaat gacataatat tttgtcataa aactgacatt tacttaacca 6180 

atttcctctt gtggaacaat tagattttac agacctttta tttatttatt tatttattta 6240 

tttatttatt tatttattta tttttagtga cagggtctcg ctctgtttcc caggcaatga 6300 

catgatcata gctcactgca gcttcgaact cctgggctca agcaatcctc ccacctcagc 6360 

ctaggggaac aacaggctgc actaccatgc cccactattt tttttttaag agacagggtc 6420 

ttgctctact acccaggctg gtctctcaaa ctcctggcct caagtgatcc tcctgcctca 6480 

gcctcccaaa gtgctgggtt tacaggtacg agccactgga ccaaatattt tctttttttt 6540 

ttttgagaca gagtctcgct ctgtcgccca ggctggagtg cagtggcatg atctcggctc 6600 

actgcaagct ccgcctgccg ggttcatgcc attcttctgc ctcagcctcc agagtagctg 6660 

ggactagagg cgcccgccac catgcccggc taattttttg tattttttag tagagacggt 6720 

gtttcaccat gatagccacg atggtctcga tctcctgacc tcatgatcca cccaccgcag 6780 

cctcccaaag tgttgggatt acaggtgtga gccaccgcgc ccagccaata ttttctttta 6840 

caaagactga atttgccatc tttgtagcta gctctttatc tgtagcttca ggaaagactg 6900 

ctctaaaaga ggctggtagt aagttcagct cttgtgccag gactgtgtgt gaggatggat 6960 

ggcgatcatg gaggctccag gtcagcaccc gcccctcccc aggaaggtac catagtccga 7020 

tctggcatgg gaaatgaagg ggctgggaag tctatcattt ctttagtaca tgatgataac 7080 

tatactgcat atgactcagt ctagcccctg tgctttctcc aagcctttcc ccactgattt 7140 

gcccatctcc ccagccctgt caggcaggca agaactgagg tccagagaag tttatctgag 7200 

tagcctgagt tcacacagcg aactagggac agagccagga ctggatccct ggactcggtg 7260 

gtcccctccc atccccaaga gcaacttaaa aactcacgaa ggcaggaggg cccgaaggtg 7320 

cctgcggggc agcagagctt cagggaatct gagcacagcc actggaagag gtccggggcc 7380 

tcctgctgcc tgaggcaggg gcagggtgga gggaggtggg gaggaggtgc tgagatagtg 7440 

tcataagtgc tgttcccatt cccccatgcc tgaatctgtg attcaaccca gcttggacat 7500 

aagacccaaa aggccatctg accaatgccc tttccccatt tggcagaagg gtaaactgag 7560 

gctcaaagtg gagcgcaagt cccccggcaa gttcttagca gacctgggtt tggaaggagc 7620 

acaaccagga catgggcccc agcatgcacc cccctgcttc tcggccaaat caggccttat 7680 

caccaggcct cacctgtgac ccacttccag ggaaggccct ttgccactca cttgtgaaac 7740 

caccagctct ccaccagctc ctcactcagc tccagcaggc ggtggcactc gaagtctgac 7800 

ttgctgcaca caccctccag cacctctacc aggcgggtct cactgatcgg acagggcagg 7860 

gctggtggga aacatgcccc accccaccca ccaccctcct caagtctctc tccacccttc 7920 

ctcgtgcggt gaaaatatca aaggagaggc catgaggtat aatggaaggg gtcatggttc 7980 

tgggaagcag agagacctgg gttcaaatct agtggtagat ctagtgggca agtcacttaa 8040 

cctctctgaa cctctttgtt ttgtcatctg taaaatgggg atgacaatac cttccctgca 8100 

ggtttgttgt aacatacatc agttcctgca tctgagcaat ggaatataat acccttgctg 8160 

ggtagatatg gctaaaaata catcctataa caagtgttgg cagggatgtg gagaaattag 8220 

aatcctccca cactgcaggt gggaatacaa aatggcatca ccactgtgga aaagcagttc 8280 

ctcaataagt aaaacatgga attgccatag gacctgccaa ttccactact ggatatagac 8340 

ccaaaagaat ggaaaacagg tgttcaaacg aaaacttgta gacaatgttc atagcagcac 8400 

tattcacaac agccaaaagg tagaaacaat gcagatgtcc atcaactgat gcatggttaa 8460 

acaaaatgtg gtacagccaa atgagaatat tattcagcca taaaaaggaa taagtattga 8520 

tatatgctac ggtatgggtg aaccttgaaa acattatgct aagtgaaact agccagacac 8580 
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aaaaggccac atattttatg attccaatga aatcagcaaa tccataattt ctgcttgact 8640 

aacacaaatt agtgattgcc aagagccgag ggaaggggag aatagggagt ggctgtgtaa 8700 

taggtatagg attttttggg ggggtaatga aaatgttctg gaactacata gtggtgatgg 8760 

ttacacaaca ttgtaaatat tattaaatgt ctttaatggg aaatttttta tgtgtatttt 8820 

accacaaata tatacatata tatatatata tacgcatata tatacgcaca tatatatacg 8880 

cagacatata tgcacatata tatacacaca catatatata tacacacaca catatatata 8940 

cacacatata taaatatttg atgtcttata tcctgtattt aaagggtctg gcacagtgcc 9000 

tggcatatag gaggtgattt aaaaaagcaa gcaacgcagg agccattaca aagctggaag 9060 

tgcctctggg gttgtcttag ccaatcccct cattcctgaa tgaggaaact caggcccggc 9120 

gaaggaaaag agaaaatgac caatgttaca taataacaaa acctttattt ctagtgggca 9180 

tcccagttaa gccttttcca cacttctatg aaataggcaa accaggtatt aatatcccca 9240 

ttttacaggt gaagaaactg tggctctgag aggttaagaa ccgacaacca ggtctaactc 9300 

tgagcctttg ctgactcctc ttccatctga ctttgaaaag ctggagggct cagggctcgc 9360 

ctttctgtat ccttcccatg cctttcttta catactgact atataaatgg ctttctgtat 9420 

tacatatatt gactatataa tggctttctg tatccttccc atggctttct ttacatactg 9480 

actatataat gacaactctc aaattcctgt ttccaattcc tactcttttc tgaattccag 9540 

atttgtatgg agtctacctg gatgtctaat ggacatttca gcccaatacg ttcaaagaac 9600 

accacatccc acccctcccc taaaccttct cggtttcgca tctcaccatc ttagaaaaca 9660 

gctgcactac tcacccagtt gctcaggccc aaaatctagt- aagcatcttt tattctactg 9720 

tttctctata cccataatga atctatcacc aggtcctatt gttctctctg caaaatatac 9780 

ccagactgtg acctcttcat catgtcctcc actagatacc acactagctt gtgttgcctt 9840 

tgtacctaga tgatgcactt gcctcccaaa ctggtctccc tgcttcctcc gcccttgccc 9900 

cttacaatcc gtttgctaca tagcagtgtg agtaatcttt aaaggtgtag agaagacagt 9960 

atcacttttt ctgctcatag ccttccaaga gcttcccatt acactccaaa aaaaatctaa 10020 

actcttttct gtagccctcg aggcttaagc ctttgccttc accttctagc actctcccct 10080 

tggctcactc cactccagcc acttctcaaa tacaccaagc tcagtcccat cctctgccca 10140 

attccctccc tgtcacatag ctcgcttctt cacttcctgt ctctctgttc aaatgccaaa 10200 

ccttgaacac cttgcctccc accgtcctgc agtctaccgt gcttcactgt ttgttcttca 10260 

tctgacataa tattattcac ttactcattc actgcttgtc ttctctacta gaatgtgagc 10320 

tccaggaagg gggcattttg cttgccttgt ttactgctat actatagatg ttcagtgcta 10380 

tttgttgaat gaataagtgc tccccaagag ttcctagccc tacaccagca gagcagagat 10440 

ttggcgggga ggggaatata caccccttcc cccagcagcc ccttacctgt ctttgtattt 10500 

ggacaaattc tcttcctccc aggcagtgtt tccacctcca aagttgtccc ggatggttct 10560 

ctccaggccc tggaaacaga aaattagtaa tagctaagta cagtgcctgg cacatgctag 10620 

cacttcactg ttctactgac taggaaactg aggcacagaa aggcaacctg cctagggcca 10680 

tacaactagt aagagaccaa gatgaaactt aaatccacgc agatttgcat cccagagttc 10740 

cccactctaa atcgctgtgt tcccctctaa cgaggctgcc ggtgcaccca ccttgttaaa 10800 

gctgtcaacc agtccccggc aggtatgaca cggatggggc tgaggcgggg gagaagactg 10860 

gggaggtgga gagggctgga gccagatagg tcctgggagg ttgaggaaga ggctgaggcc 10920 

ccagagcaca gctgggacta ggcccttcgg gggccatggg gccatcttta cccaggctgg 10980 

ggacctgcag atagagggcg tggagaggct tcactgaagg agtcaggccc cacgcgagaa 11040 

agaaaaaggt ctgttctaag gtgccgtggc gctgggatct gcgttattac aagcaaaaga 11100 

ggccaaactg ggtattgcag gggctgtggg gatgggtggg gaaaggcgca taagatgcaa 11160 

ttccagaaga gaaataagtt atcctcgcta gcatggggtg gagaggaaag tggcctcgct 11220 

ccttgacaag ggggcagaag tcagcctctt gtctgagggt ctgataaaga atattagggg 11280 

ccggatccgc agaattaggg agaagagatc agattccaag tactgaagga gaggaggcca 11340 

gaggggttgt cttgggggag gtgtccggaa ccccgggctt ggagggagga gcccggatcc 11400 

ggatatgaag ggaagggatc agattcaggg cctcgcagaa gggcaaaggc acggcccacg 11460 

agccggcaga gaaggatcca gaccctcgta ggccacggag aaggaagaag cgaccgcccc 11520 

agcccgcggg cgccgtgggc cgtctcctca ccgcgtcgct gccacagcct gcgtaaaacg 11580 

cacaaccccc acccgcgccg cagtccaggg ggcggagctt gccgcccagc. caatggcgac 11640 

gggccatgtg ccattacgtc accccacgcc agccaaccag aggcggcacc gagctgcacc 11700 

tgtccgggct ttaaagggcc gcggggctag ggaacgacgt ttgccagccc cgccactgaa 11760 

acggagacgc gtgtgggcca catgggtaga aaaacagcgt ctgcctttat ttaagtcccg 11820 

tcccccgccc caggtcccgc ccctggtccc accccgcgtc ccggcgcggg agtccccggg 11880 

ggatggaagt agctatccag ggctggctga agggcccggg acacttgctc cgctctctct 11940 

tggagccgcc cggaacgcgg ggcgcgaggc tgctgcaggg cccccaggaa gtctggcagt 12000 

tgggagggca gtgtgaagac gggcacggtg cggaaaaggg cgggtacggc gtccgggtgg 12060 

agcagcctgt cgaagcaggc ccccacgtag ctgccgggcg cccggccctg caagaagtcg 12120 

ggcagcacgc agctgagcga ggcgcggaag gcgtcgtgcg ggccglzgcgc cccgggcccg 12180 

gacaccccat cctgtagcca ctcgctgcac agcgccaccg caccgggaga gaagagcaag 12240 
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accaccacgc 

ccctgcgcgc 

cacagggccg 

aggagcagag 

aggtgagctg 

ctcacccccc 

gcggaggcag 

ggtcctcccc 

agaaggagga 

cagcgcttgt 

agaagaaaaa 

atataatccc 

ccatcctggc 

gtggtggcac 

actcgggagg 

cagggtgaga 

ccagagtcca 

tctttgaata 

gtcatagggt 

taatacatac 

ccctaagcta 

tgtccctgcc 

tcatgacagc 

actataatcc 

accagcctgg 

atggtggcag 

acccgggagg 

cagagtgaga 

tgtgtgtgaa 

atcccctaca 

gtatctctac 

gcccctcacc 

cacttggtgg 

agtcttctag 

caagctctca 

gaaggacaag 

ctggtacaga 

tagcgctccc 

ccagccaggt 

caactatgca 

ctgactaggg 

aagctctccc 

ctagccaaca 

agggctgaac 

gcccctaggc 

ctgcaggtct 

ggggcactgt 

agaatgaggt 
agtcaccccc 
gatgtgagat 
ctttgctggg 

gggggcctcg 

ggggtgagag 
cccatgccct 
tcaccagccc 
gagaccatgg 
gcaggcaaca 
tggattcaag 
tcatctgtat 
cccctctcat 
tgcaacccca 



cgccctcctg 

tcagttcacg 

acgccagggc 

ccgcgcggcc 

gcacaggcct 

gagcggacgt 

ccagtgagct 

caggggaatg 

tgagggaaag 

ggatgtctgg 

ctgagactta 

agcacttcgg 

taacacggtg 

acgcctgtaa 

cagaggttgc 

ctcctctcaa 

cagcacagtg 

agtaatgtca 

tggtgtgagg 

ttaacacaat 

gggttgctgg 

ctgtcttttt 

ctcagagatc 

cagcactttg 

ccaagatggt 

gcgcctatta 

cagaggttgc 

ctctgcctca 

atatataaaa 

tgtctcctgc 

ccccttcccc 

cctcacccca 

tgcttcagta 

gggtggggcg 

tcattttttg 

gccattgcat 

aaggaaaggc 

aagtcatcgt 

gttccaatat 

tatctcccgg 

tatgacttct 

tggaacccaa 

gtgtgctcct 

ataaaataag 

attgaggtgg 

tgtagtaagt 

tacctggagg 

agaggggact 

catgttccca 

gcatttggag 

tagtgaagta 

tgtctccaac 

gccactctga 

gaccttcacc 

acaagcactc 

ttagggccac 

gaaggcctgt 

gctgggtgat 

aaagcgtctt 

ctaaccttct 

gtcccacctt 



cagggtctgg 

acggctccac 

gcccaccagg 

cctggcggcc 

ggctccccgg 

cctgtttcaa 

cggccgccgc 

gggagccggg 

cgcagcggca 

gagacccaga 

acagaaatga 

gaggccgagg 

aaaccccgtc 

tcccagctac 

agtgagccga 

aacaaaacaa 

tcagagccag 

ccctctgcct 

actaaatata 

gccgagcatc 

gcccttgatc 

tttttttttt 

ttttaaaaat 

ggaggccgac 

gaaaccccat 

atcccagcta 

agttagccga 

aaaataaata 

atttgatcgt 

atctttatct 

tagctcagtc 

cccaggtcag 

gctcccactt 

ggggcaagga 

agtgaatgaa 

gtgctgtgcc 

agttcttaca 

cccatagctg 

tcaccaagct 

ctaaggttcc 

tccataccaa 

aacttagcac 

tggggatagg 

cctaaataga 

ggcccttctt 

actctccaag 

tagcaccctt 

gggagacttt 

gagaccggca 

agatctgtgg 

cagccactgg 

agtagcacat 

ccaagaccca 

cctaggaggt 

ctgcagctgc 

taagaaaagc 

tgggttggga 

taggaatcac 

ctgaccctaa 

caaagagatt 

atccctttcc 



cgccgctgcg 
aggtctacgg 
cgctcgaaac 
gctgccggag 
gggcggccct 
gagcctcagc 
tttccctcgg 
aagcgctcac 
aagagtaggc 
gaagaagagt 
attgggccgg 
cgggcagatc 
tctactaaaa 
tcgggaggct 
gatcgcgcca 
aacaaaacaa 
tggatgaata 
cagttttctc 
ttaatacata 
ttaagtgcta 
ttgctgtcct 
ttacctccaa 
ataaatttgg 
gcgggcggat 
ctctactaaa 
cccgggaggc 
gatcgcgcca 
aaaaaataag 
agctcctgct 
catgacagaa 
ttcagagttt 
gtccacattg 
taaatgtaag 
tttatatttt 
ggaatgaccc 
caagagtggt 
atactcacat 
taagatacaa 
ctggtggcct 
tgcagctgct 
atctggctta 
ctgtaggtgt 
aactgtgctt 
cagtaggtga 
ccaccagctc 
gcgagctgcc 
agggcagcac 
ggtgatgccg 
cagggtggat 
aggagggtcg 
gttccaaggc 
cgtctttgag 
tccttgctgg 
ggccctgccc 
agcttctccg 
aaaagccaaa 
atctgcactc 
tacatctttc 
agctgttttt 
cctttctcac 
catggtattt 



cgtgaaacca 

ccacgcgcag 

ccgagtcatc 

aaaacagatg 

ccgcccagcg 

caccctgtta 

agaactccca 

ctttcgcgtg 

aggccagcca 

tagaaactta 

gtgcggtggc 

acgaggtcag 

atacaaaaaa 

gaggcaagag 

ttgcactcca 

aaaacgaaga 

tagctccatc 

ctctgtaaag 

taaagcttaa 

aaatgtcagc 

gggtgtctgc 

ttgatttgcc 

gccaggtgcg 

cacgaggtca 

aatacaaaaa 

tgaggtagag 

ttgcactcca 

tgtatatata 

gctctcaaca 

acctgcctgt 

tgctggaacc 

tatgcattta 

tgactggctc 

tggtgttcct 

tctgtttatc 

ccctgtcaag 

ttgtccatgg 

acagaaccaa 

gatctccaaa 

ctgcctttgt 

taggaagaaa 

ggttatctgt 

cttctactct 

gtgctagttt 

acctgcagac 

ctctgccaag 

ctaccaggct 

ctgaatatgg 

ggagtgaaaa 

cccagtccta 

acagagggat 

aggccccagg 

gattggccac 

cagctgcccc 

agctgttcac 

ggcagaaaga 

aatctgacct 

tctgccccag 

atgtcttacg 

atgaccaccc 

gtatgcaatt 



agccacgggc 

cggcagctgg 

ggctgagtag 

gggagggaag 

cttgctccca 

ggggagaggg 

cagccactct 

atcctttttg 

cacgagggcc 

ccctccacag 

tcatgccttt 

gacatcaaga 

ttagccgggc 

aatcgcttga 

gcctgggcga 

aatgatttgc 

tgtttagggg 

ttgtacctat 

cacaatgtgt 

tgtttttctc 

aactacctgg 

ttgaattgat 

gtgactcacg 

ggagttcaag 

ttagcagggc 

aattgcttga 

gcctgggcaa 

tatatatatg 

taagatccaa 

ccaagaccct 

ttccccctga 

cagcactgaa 

atgtcttgct 

gcgatccctg 

tgatctcagg 

atccccactc 

ggcaggccca 

atccagggca 

tttccagcat 

gctgtgccca 

ttcttcccgg 

ttatgggtgt 

taactttccc 

taggcctgtt 

actggcctga 

ccaaaggatg 

acagctgggt 

ccctggtctg 

tttgaggcca 

accgtggagg 

ctgttgtcct 

gagtctggaa 

ttgcctgcct 

caggcccaac 

ctgctgggaa 

ccagaatggg 

gaggcagatg 

ccttgtgaga 

gagcatggtt 

tgctaaaaat 

ttaagatact 



12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 
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atatgattta 
gggtagagat 
atcttgaatg 
cactctgtca 
cccaggttca 
ccaccacgac 
aggatggtct 
attacaggcg 
gaaactgtag 
agtggctaca 
atttcattgt 
tgaggaaact 
gtaggcagtg 
ctaagacccc 
ccctttctga 
ttctgtggaa 
ccctcaccac 
agcggtggga 
tctgcgggca 
cgggcggctt 
cgcaggccag 
ccctccctga 
aaaggaaatg 
cagtggggat 
ttacaaaacc 
aacatcttca 
gctggagtgc 
ttctcccatc 
aactggttaa 
ctcgaactcc 
agcgtgagcc 
cttagcagta 
ttttgagatg 
accacaacct 
ggactacagg 
ttcactatgt 
tcccaaagtg 
tcattccaca 
gacacattca 
atgctaagca 
actatgtggc 
gaacatctgc 
tgaggacctc 
ccctgtaatc 
gaccatctgg 
atggtggcat 
atccaggagg 
agagcgagac 
acaaaaaaca 
tatgatttgc 
tatataaatt 
aagaagaaat 
cgccacaaaa 
agcactttgg 
caacatggtg 
tgcctgtaat 
ggaagttgca 
tccattaaaa 
tgcagtccca 
gcaatgagac 
ctaaaaaaaa 



tttacaatgt 
gttagggtct 
agtgaatttt 
cccaggctgg 
cgccattctc 
cggctaattt 
tgatgtcctg 
tgagccaccg 
agcaacccat 
tttatggagc 
cacaaccact 
gacgttcaga 
gaactcccat 
cttggctggg 
cctgaacaca 
agagaggaat 
gggaggtgcc 
ccagtggctg 
gcgagcacgg 
gccagaggtt 
gtacccatcg 
aggggcagat 
ggggtctgct 
gcactcattc 
tggaggtgct 
ttaacaaact 
agtggtgcaa 
tcagtctcct 
tttttgtatt 
tgacctcaag 
accgcgcctg 
tggcaagcac 
gaatctcact 
ccgcctcctg 
cgcacatcac 
tagccaggct 
ctgggattac 
atcactttgt 
tttattcttt 
ctagaggtaa 
aggcactgtg 
tgtgtcctat 
atactgttac 
ccagcacttt 
ccaacatgct 
gcacttgtag 
cagagattgc 
tctgtctcaa 
agtaggcgct 
ttgccttcaa 
agtaaaacac 
tagatgccca 
cattaaaaaa 
gcggccgagg 
aaaccctgtc 
ctcagctact 
gtgagccgag 
aaaaaaaaaa 
gctgcagagg 
gtgttcatgc 
aaaaaaaatt 



ctgttgcctg 

atcacaagtg 

tttttttttt 

agtgcagtgg 

ctgccccagc 

tttgtatttt 

acctcgtgat 

cgcctggccg 

tctacaggta 

atttagtctg 

ccatgaggta 

gaggtgaagt 

agaggaagct 

gggtctgccc 

gaggttaggg 

gggtgggatc 

ctctgcttca 

gcaggggtcc 

tgcgtccagc 

ctggtgtgcg 

ctcgcaacac 

gttcgtccta 

caggtcccct 

agcaggtatt 

cacatctatt 

tttttttttt 

tcttggctca 

gaggagctgc 

tttagtagag 

tgatccaccc 

gcccattcaa 

aagagacctt 

ctgttgccca 

ggttcaagtg 

catgcctggc 

ggtctcaaac 

aggtgtgagc 

aagcacctgc 

caacagacat 

tacattaatt 

tttggcaatg 

ggctgtgtta 

cttgtttaaa 

gggaggccaa 

gaaaccccgt 

tcccagctac 

agtgagccga 

aaaagacaaa 

atgtgcaagg 

ggaaatcaca 

tcattcattt 

aataagaaag 

tgaaaacagg 

cgggtggatc 

tctactaaaa 

tgggaggctg 

atcacgccac 

aaaaaaagga 

ctgaggcagg 

cactgcactc 

taagaaaata 



cctttctcca 
cctaaaacag 
gagacggagt 
cgccctctca 
cttcctagta 
tagtagagat 
ctgccctcct 
gatgagtgaa 
tggaaataga 
taccgattcc 
gttactgtat 
gagttgccca 
ggcaaccact 
aggtccagcc 
tggcctttca 
acaaaagggg 
cttacgtcca 
ccacccggag 
agccagctct 
cgggggtctg 

cccaacccca 
acggagtcag 
ccatgggcat 
tcacaaacac 
cttccagcaa 
tttgagatag 
ctacaacctc 
gattacagga 
atggggtttc 
gtctcagcct 
cagactttta 
ggataattaa 
ggctggagtg 
attctcctgc 
taatttttgt 
tcctgacctc 
cactgtgcct 
tatgtgttgg 
ttcaagagtg 
tatccatcaa 
aagccattta 
ggcagtaaca 
aacaaaacaa 
ggtgggcgga 
ctctactgaa 
ttgggaggct 
gatcacgcca 
aaaacaaaaa 
ctcttgcaaa 
aactcacact 
cataatattt 
taagtgattg 
ctgggcacag 
acgaggtcag 
atacaaaaat 
aggcaggaga 
tgcactccat 
aacaattagc 
aggatccctt 
taccatgggt 
aaagaaatga 



ttacaatata 
tgacttgtgc 
ttcactctgt 
gctcactgca 
gctgggacta 
ggggtttcac 
cggcctccca 
tgttcttata 
ataaggatag 
taaggggaag 
tatcatctgc 
gggtcccgcg 
atgctttagc 
tccagcccag 
gcaatgggaa 
acccccggtt 
cagtgacgtt 
cccgccagca 
gcagggtcag 
caaggaaagg 
gccccaggcc 
gttccagagg 
gtgcctggcc 
ttgctgggtc 
acataatgct 
agtctcgctc 
cacctcccag 
acatgccaca 
accacattga 
cccaaagtgt 
atgagtgcct 
atttttatta 
cagtggtgtg 
ctcagcttcc 
acttttagta 
gtgatccacc 
ggtggataat 
ggaatgtgct 
cctattatgt 
gcctttagtt 
aatcacagca 
tttattcact 
ggctgggcgc 
tcacgaggtc 
aatacaaaaa 
gaggcagaag 
ctgtactcca 
aaaaacaaaa 
gctattacat 
tgtagtaagg 
attgagcatc 
agctgggtgt 
tggctcacgc 
gaatttggga 
tagctgggca 
atggcttgaa 
cctgggcaaa 
caggcatggt 
gaacctggga 
gacagagcaa 
agtgataaat 



agttctatga 
tgctcaaaaa 
cgcccagtct 
agctcagcct 
caggtgccca 
catgttagcc 
aagtgctggg 
atcacccttt 
caataataaa 
cacttctctt 
attaaacaga 
cactactaaa 
ggcccaagaa 
cactatgcac 
ctcgagaacc 
ccccactctc 
ctcccaggaa 
cagtgccgct 
cagtcgcagt 
gcacagtcac 
ggtcggctca 
ccacacctgg 
ctctgctgga 
ttcctgcctt 
gggcactggc 
tgtcgcccag 
gttcaagcga 
acggccggct 
tcaggctggt 
tgggattaca 
accattgtgt 
tttttttatt 
atctcagctc 
caagtagcag 
gagactgggt 
caccttggcc 
taattttaat 
aagttctaag 
gcatagctct 
aggaacaccg 
tacatctcat 
cattcaacag 
ggtggctcac 
aagagatcga 
ttagccgggc 
aattgcttga 
gcctgccaat 
acacaaaaaa 
ttgaattaga 
ataaaatatt 
tgttgtgtac 
ggtggcccat 
ctgtaatccc 
ccagcctgac 
tggtggtgtg 
cctgggaggc 
agagtgaaac 
ggcgcatgcc 
gcttggggct 
aatgctgtct 
ggacaaggaa 



15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
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agatagaagt 

ggaaggaggt 

tgtatcttgg 

gaaaatagag 

gatgggaagg 

ctaagctgga 

taagatggat 

catgcctgta 

cgagaccatc 

tgggcgtggt 

cgtgaaccca 

ggcgacagag 

cacagtggct 

taagcccagg 

aaaaaaaaac 

agacaggagg 

tgcactccag 

ggaagaggaa 

catgaggctg 

aagtgacttg 

caagttcgta 

ggaaaggttt 

tgtggaggat 

ggtcaaggga 

agaatggtga 

ggcacccagc 

ttcaaggtaa 

ataaggagga 

caattttccc 

cctttgcaca 

gtttttgtgc 

gaagtgctgc 

gttgagccag 



gcaaggggat 

gagggcattt 

taagtagctc 
tgaaaaggca 
gaggggagct 
ctgagatgga 
gagacctgat 
atcccagcac 
ctggctaaca 
tgcaggcacc 
ggagaagcag 
aaagactctg 
catgcttata 
aattcagaac 
tagctgagta 
atcacttgag 
cctgggcaac 
agagccagca 
ttactattat 
cccaaggtta 
gcacatgctc 
caaggagaag 
cagggagagc 
gggccagcca 
gaccaggaag 
tagactctgc 
tgatctgcgg 
agaggaggag 
ttgcttgggg 
ggcatcggcc 
caccggggtt 
tcctcagaga 
ggcagggctg 



ttggagctag 
gaggtaatgg 
cactggatgt 
ccgttggggg 
tggaggtgtt 
gaggaggggg 
gaccaattaa 
tttgggagga 
cggtgaaacc 
tgtagtccca 
cttgcagtga 
tcttaaaaaa 
atcccagcac 
agcctgggta 
tggtagtgct 
cccatggggc 
agagtgagac 
tatttaagcc 
tattcccatt 
cacagctagc 
tgcccgccag 
gtgacatttg 
attcagggaa 
ccccagggag 
gcagggatca 
tcctacctga 
tccagtctgt 
gccaaagtca 
tgagtcctct 
tacagtggga 
ttggggggcc 
cattcagaac 

g 



gagaggccgg 
gaaccacctg 
aacaatgttc 
cagaatatgg 
gaaacagtct 
aaagaaaatg 
atgtgcaatc 
cgaggcttgt 
ctgtctttac 
gctactcagg 
gccgagatca 
aaaaaaaaaa 
ttggtgggag 
acatagcaag 
caccagtagt 
tgaggctgca 
cttgtctaaa 
tgggtgtggc 
ttacagatga 
aaatggcaag 
gggaacacaa 
aatggggcca 
tgtaatagac 
ccacagtctc 
cccccatcat 
atacagaggc 
ggaacaaata 
aggaaaccag 
tcaaggatcc 
atggacttgg 
ctggacctga 
cagatgcacg 



gtggttgggg 
agaaaaggca 
tcatgaaggg 
gcccagaatg 
agaaaacaga 
agagacatta 
agcggctggg 
ggatcataag 
tgaaaataca 
aggctgaggc 
cgccacggtg 
aaagtgcaat 
gctgaggtgg 
accccgtctc 
tccagctact 
gtgtgccatg 
aaaataaata 
acacattctt 
gaaacaggct 
agccaagatt 
agacagaact 
tgaagaataa 
agaaggcata 
cacctcaatg 
catcttccca 
agggaaccag 
aaggaaggag 
gaaggcagaa 
tctctgatat 
gaagggggag 
ttccagtaca 
ttgtcaccat 



aaaagctttg 
gggagctagc 
gaatcatggg 
ccaggaaagt 
tgatagagcc 
atgtcagagg 
cacagtgggt 
gtcaggagat 
aaaaattagc 
aggagaatgg 
ctccagcctg 
cagggcaagc 
gaggatagca 
tacacaaaac 
tgggaggctg 
attgtgccac 
aatgtacaga 
ataacagccc 
cacagaggtg 
caaaccagtc 
ggaggagtag 
gcaggagttc 
ggacagcaag 
aagggtttga 
ctcctcttct 
gtctgtgtgg 
gttggagaag 
gttcccccac 
ggcactgcgt 
gcctcaccag 
gggagaggcc 
ctgctgacac 



19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21501 



<210> 642 
<211> 308 
<212> DNA 

<213> Homo sapiens 



<400> 642 

tttttttttt 

gcagtggtgt 

gcctcaggct 

ttgtattttt 

cctcgtgatc 

gcccggcc 



cctttttttt 
gatctcggct 
cccgagtagc 
tgtagagatg 
cgccagcctc 



tttttgagat 
cactgcaagc 
tgggactaca 
gggtttcacc 
ggcctcccaa 



ggagtctggc 
tctgcctcct 
ggcgcctgcc 
gtgttagcca 
agtgctggga 



tctgtcgccc 

ggggttcaag 

accatgccca 
ggatggtctc 
ttacaggcgt 



aggctggagt 
ccattctcct 
gctgattttt 
gatctcctga 
gagccaccgc 



60 
120 
180 
240 
300 
308 



<210> 643 
<211> 1024 
<212> DNA 

<213> Homo sapiens 



<400> 643 

gcggccgctg 

tggaagcacc 

ggggttcact 

tgcccctaga 

attctaccgc 

gggaatcccg 

gccaggaccc 

aatgggcatc 



caggtcgctc 
cgacggggca 
caagatcagt 
tgagttgccc 
cgctcggcag 
cagcgcctcc 
tgcccgatgc 
ctggccaggc 



agcgacgtgc 
gccgctgcgg 
tggaacccct 
agcacggtac 
ggggacggcc 
agtgatacca 
ccacatggca 
gcggtggctc 



gcgtgcgcgg 
cttctggcag 
ggcgccacgg 
agctactgcc 
agggagaggt 
tcgagctttg 
tggcctactg 
acgccaggaa 



gccggtccca 
ctccctcgac 
gctgtcatca 
tgccccgacc 
ccagccgcgc 
aagagcggtc 
ggacagttga 
tcccagcact 



cagcacgtag 60 

agcttctcca 120 

gtggacagtc 180 

ccagcccctg 240 

ggcaagcctg 300 

ctgacgcagg 360 

gcctttaaaa 420 

ttgggaggcc 480 
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gaggtgggtg gaacacgagg tcaggagttc gagatcagcc tcctggtctc gaacatagcg 540 

aaaccctcgt ctctactaaa aatacaaaaa aattagccag gtggcacacg cctgtagtcc 600 

cagctactcg ggaggctgag gcgggagaat tgcttgaaca gggaggcaga ggttgcagtg 660 

agccgagacc acgccattgc actccagcct gggtgacaag agtgagactc cgtctcaaaa 720 

aaaaaaaaaa aaaaaaaaaa aaaaggcatc cccaggcctc tctacctgac ggcaaccccc 780 

gggggacagc caggtttgaa ctcaccccac ccccatgccg ttttttgctt ttaatatttc 840 

tatttttttt tcagccggca ttttgcacag aggccgtgtc ttatgcggaa tacagggtgg 900 

gtgtgcatgg attcgggcat gaagaacagg gagcaggtgc cccagactcc ccaacgtgga 960 

gtgatgagcc tcgcttgtct agattgtcct ctttgtgcca aagaaataaa cccttaggac 1020 

ttgg - 1024 

<210> 644 
<211> 7365 
<212> DNA 

<213> Homo sapiens 
<400> 644 

gaaagcgtgt tggccaaatc tgaatgatga aagccaatta caaactagaa aatgaaaaca 60 

gaccccagat gcaaggagat gagacagtta aatttacttc ctcttttcta atctgagagg 120 

tttcatgttg aagaaaatca gtgttggggt tgcaggagac ctaaacacag tcaccatgaa 180 

gctgggctgt gtcctcatgg cctgggccct ctacctttcc cttggtgtgc tctgggtggc 240 

ccagatgcta ctgggtaagt aaaatatttg aatattggtg tgggaatgga gctttgctta 300 

ccttgggaga aaggaccaga aggaaagaaa catggactaa ttctggccct ccaaagaagc 360 

agtattcagg tggaataaga agggtaggta taagaaagtt tggtgtacag gattgaatta 420 

ctttctgaat ataaaaatct tccatcctga aagtcccctc tgatcatgat atcctctcta 480 

gctctaatct caactctttt ctttcacttg tgagacaagt tttcttaaaa gaatagtttg 540 

cacttgctat ctccacttcc tcacttctca cacgctcctc aatgcttgca gcctggcttc 600 

cactggaatt ggttttgtct agggcacaac gatctctata ttgactgggt tatggcatgc 660 

ttttcagttc atatcctgct ggacttctct gctgcactgg gctctgttga tcacttactc 720 

cttcttgcat tctctactac cctggcttcc atggcaccat tttctccttg tactacatgg 780 

acttctcttg atcccttaat agcttctctt ctcttcccct taaacactag tgttctctat 840 

agttatagcc ttgcctctgt actcctcttt ctttacacaa tctccttcat gagcttattt 900 

gtctgtttaa tttttgccac tataactctg tatttaagag tcccaaatca tatccccaac 960 

cctaaccatg cttttgagct tcacattcaa atatcaagct tcataccaga catatctacc 1020 

tacatatctc acaagtgtct ctgcaccatg tgactagagt tgaacttatc tgtccttttg 1080 

gcccctgcct catctaattc attttccact ttctgatctc agctaatgac accagcatct 1140 

acccttacaa ggccataaac ctcagaaaca acccagactc ttccttcttt ttcattctct 1200 

aaatctgatc aatcattagg tttggccaat tttaccttct aaattgttca tagaagcagt 1260 

cccttgcctg gatcatcact aatgtcttag gacaagccct catcctctat gacttgggct 1320 

attccagtaa attcctaact aatctcctcg tccaaacctt gtgttcttga aatgaattct 1380 

gtaccctgca gacagagtaa tttctaaaac tcaagtttac cagcccaaat tctttatcat 1440 

ggcacaaaag gccaagcagc ctctactttt accctctcag ccttgtaact tatgctctaa 1500 

taaatctcaa tcagttcata tttacatact cacacaccag actcttgcct gtctctaagg 1560 

ctttgcttat accactatct ctacctgaaa tgagttcttt ccatggcacc ctactctcct 1620 

tctccttgca aatgtgtagg tacccctaaa gactcagctt aaatggtatc ttttccaggt 1680 

agcttttctt gaatgcttct ttctgctcca atggctgctt gcgatacatt tttcttatta 1740 

taatattttc ctacttatgc tgaaataata ttttcatcta cctgtgtctt ccactaaatt 1800 

gagagctatt tgtgggtagt gacaatgttt ttttgttttt gtcttgtgat aagaactgca 1860 

gacttggaat cagatactcc tgtttcaagc ccctcttcta tttaccagct ctatgtcact 1920 

ttagaaaatc actttttcta tttctggttc agttttctca tttgtagaat gaaaataata 1980 

atacctacca cctaaggttt attgagtgag ttcccagtaa cccaacaaga ttttatttta 2040 

ttttatttta tgttttgaga cagggtctcg ctctgccacc caggctggag tgcagtggtg 2100 

caatcactgc tcactgcaac cttgacctct tgggcttaag caatcgtcct atctcagcct 2160 

cccaagtagc tgggaccaca ggcaagtgcc accaggcctg gccaattttt taaaaaatta 2220 

tttgttgaga cagggtctct ctatgttccc caggctggtc tcaaactcct gggctcaagg 2280 

gatcctccca cctcaacctc ccaaagtgct gtgattataa gcatgagcca ccacgcctgg 2340 

ataagatttt atttttaaaa agaaaaaatg gagaggaggc agagcacaga tgagtgacag 2400 

tggaggaact gaaaagaggt agcattacag tgaagggagc ggagaaagga tggaaagaaa 2460 

acacctggta ctgatgtgat gagaattagg aagtagggag gggtcttggt aagggacatg 2520 

acaaaatcaa agaagaacgc tgggctattg ggagtggaag agaataaagg caagtggggc 2580 

acgaaatgac agtcagatgg agtgaagaag tatgcacatg tgaatggcca tgaggctgga 2640 
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ggagggggag aatagttagg gagataaaga gcagtcccag agaagggaga aatgtcactt 2700 

ccc ttgccta ggtctaccat gaaggtaaga gataggttga ggagatgtgg tgagaaatgg 2760 

ataattctaa atttcaatct ggagcaaaga attgttccac agttactaat atggactggg 2820 

aggaggggaa aaaagacaga gaaagctctg tgccactgcc agggaaaatt caagtggcag 2880 

aggacattaa agtaaagatg gctgaaaagt gaagacctgc aagggggagg aagaaaggag 2940 

gagagacact gctgggtact gaaagctctt tgttagataa catattcaag gctgaggagc 3000 

caagtgaagt ttgggaaaat cagaatatga aagaaagtcc ccgaggaagg tctgaggact 3060 

gaagtagacc atatttgcac agtgaagtaa aaagaacatt gattttggac tcagacacac 3120 

ctgcctttcc cgatccattg cttagtctct ttttaagctt tgggtatgtt aatttggttt 3180 

tctaagtgtg tttctttaag attataaaat gaggataaag tcacctacct ggaaagtttg 3240 

tgaggatgaa gttagatact gcacacaagt gctagcaggg tctggcaccc agtgaacatt 3300 

agaatctgat gtctcccttt cattcctgtc ctatgcctcc ctgcttcgag ttaccattgt 3360 

ctttgctcct ggtttcccgt accagcagct ggatgtcatg ccggtgagtt tcaccattct 3420 

ttcattctct cccctccctc ttctccaaac cacagtgcat ttgctgagag cacctggacg 3480 

ccagtgaatc aaggtcaaaa ctatccccct gagacactct caggggtctt tccgaaaaaa 3540 

aaaaaggttt tgatgtctta ctagtctcat tctaaaaagg atgattatcc tgggagatgg 3600 

aggaaccctg tcctctttct ccaagttgca tcatcatttt ctggttctcc ctgggccatg 3660 

ccctcttcag ctgccagttt tgagacgctg cagtgtgagg gacctgtctg cactgaggag 3720 

agcagctgcc acacggagga tgacttgact gatgcaaggg aagctggctt ccaggtcaag 3780 

gcctacactt tcagtgaacc cttccacctg attgtgtcct atggtgaggt cctgggaagg 3840 

cctgagcagt gccccaaacc ccttctttca gcctcagagc ccacccagga aagtgtccct 3900 

aggaaattgg ctatgggatg aggtacttcc ttcccagtct tcagcccaca gcaaccaggg 3960 

tgctctaagt cctaggccct agggaagttg tcccatactc ccaaccaggg gtgagagttt 4020 

acctggaccc tcagagaaag tttaggggag taggcatgct tgggggtggc ccccaaggga 4080 

ggccctgggt gacactcagc tctttctctg gaccatagac tggctgatcc tccaaggtcc 4140 

agccaagcca gtttttgaag gggacctgct ggttctgcgc tgccaggcct ggcaagactg 4200 

gccactgact caggtgacct tctaccgaga tggctcagct ctgggtcccc ccgggcctaa 4260 

cagggaattc tccatcaccg tggtacaaaa ggcagacagc gggcactacc actgcagtgg 4320 

catcttccag agccctggtc ctgggatccc agaaacagca tctgttgtgg ctatcacagt 4380 

ccaaggtgag agctagaagc agcattgtca tggcagggga gggtaaggag agacagggag 4440 

cccaaattgt ctttctttag cctggaggta gcaagatcat caacagacta tggagcaggt 4500 

tgctggcaaa tgcccaagtt gggcttcgaa agggacgaga cctcagcctt ttgaagtcat 4560 

ccagccccac tcagagtcca gcatagttcc taccacccag atctatgctt cctagtgctc 4620 

actgattggg tgtctggaca actatcagat ttgcccatct gtggatcttt cctagtcctg 4680 

tctacttaca gaaagaactt gacagattca aatggaatct aagagagccc attcctgctg 4740 

gaaacagggg aaaggaagta tgactcccca agtctcttca cctgcatgtg tctacctgta 4800 

tataagatgg ctgagctctt ctttcctgcc tatctttttc ccagaactgt ttccagcgcc 4860 

aattctcaga gctgtaccct cagctgaacc ccaagcagga agccccatga ccctgagttg 4920 

tcagacaaag ttgcccctgc agaggtcagc tgcccgcctc ctcttctcct tctacaagga 4980 

tggaaggata gtgcaaagca gggggctctc ctcagaattc cagatcccca cagcttcaga 5040 

agatcactcc gggtcatact ggtgtgaggc agccactgag gacaaccaag tttggaaaca 5100 

gagcccccag ctagagatca gagtgcaggg tgagttcgca tcagagtgca ggttgtctgt 5160 

ttggcatgcg tgtgagtgaa aaggagggat aggataaatt gacctgtgag ctggggttca 5220 

gtgtgagcag gttaagaagg gacacagagg gggcaggaac aatgggccag aatccctgat 5280 

gatgagagac agcacaaata gagaacttct ccctcagact gtggtgcaca cctcaccaga 5340 

ccaagagctg agccagctcc acacctgctg ccccacaagc cttagcatcc cccccaacat 5400 

gctctctggg catttatagg aatttaatat ctggaatgaa gatgggatag tctgaagtct 5460 

atgttcaatt ctgggagcca ctgttaagag gctgcacttg acctgaaaca ctcccagaag 5520 

aggtgtccag ggtgataagg gggcttaaag tcatgtcata tgaaaaataa tggaaggggc 5580 

tggagatgct atcctgaaga ggaggtgact cagaggccag cagaggtgtc ctcaaaaagc 5640 

tgaaggctgg cattagcaca aggaattgga ctaatgtctt atataataaa gaggctgacc 5700 

tagaaccagt taccacattc atgcttcagg gaattagatt cagccttgat agagaaagaa 5760 

ctggctaata gttatagctg tccagtggtg aagcctcact gctgaagaat ttcaaacaga 5820 

agtggccaag cttcagatta aaatttggac caggagggac actagagtcc tcaagtccct 5880 

gctgccattg ggccccacag ccactttcag agaacaagaa agattggcca gggatgggag 5940 

ggaagggcca gagtcagact tcactcttgt atgtgcctcc tgccccataa ttcaggtgct 6000 

tccagctctg ctgcacctcc cacattgaat ccagctcctc agaaatcagc tgctccagga 6060 

actgctcctg aggaggcccc tgggcctctg cctccgccgc caaccccatc ttctgaggat 6120 

ccaggctttt cttctcctct ggggatgcca gatcctcatc tgtatcacca gatgggcctt 6180 

cttctcaaac acatgcagga tgtgagagtc ctcctcggtc acctgctcat ggagttgagg 6240 

gaattatctg gccaccggaa gcctgggacc acaaaggcta ctgctgaata gaagtaaaca 6300 
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gttcatccat 
tcctgcacat 
gttaggtgag 
attctctctt 
gaccagaatg 
ctgggggcaa 
atacttgggg 
ccaggggtgc 
ctggcccaaa 
gttgggctct 
gctattctta 
ttgtatcccc 
agtgggcctt 
cccttattta 
agaagatgac 
ccttgatctt 
ctattcagtt 
acatgactga 



gatctcactt 
atgcataagt 
tgtaaataaa 
aacacaacag 
ttgtgattta 
ttttgccccc 
ggttggggga 
cgctaaacat 
atgtcagttg 
ggaatgggac 
ctgacatacc 
tcaaaagcca 
tgggaagtga 
aaaaggcccc 
atgtatgaga 
gaacttccag 
tgtgtatttt 
gtttatgata 



aaccacccca 
acttttacaa 
tttatataaa 
aattctgctg 
aagagaacta 
cagaggacat 
tggtgggatg 
cctataatgc 
tactgagttt 
tttgtccctt 
agtctttagc 
ttatgttgaa 
ttagatcagg 
agagagctaa 
accaaaaaac 
cctccagaac 
gttatagtag 
tgttaaaaat 



ataaatctga 
gttgtcccag 
gtgagaatta 
tctagatcag 
atggaagtgg 
tgggcaatgt 
tgtgtgctac 
acagggcagt 
gagaaacccc 
ctaattatta 
tggtgctatg 
atcctaatcc 
agtgcagagc 
ctcacccttc 
agctgtcgcc 
tatgagaaat 
cccaaatgga 
actcataaaa 



ttctttattt 
tgttttgtta 
gagtttagct 
gaatttctat 
attgaataca 
ttggagacat 
tggcatccag 
accccacaac 
agcctaatga 
tctctttcca 
gtctgttctt 
ccaaggtgat 
cctcatgatt 
caccatatga 
aaacaccgac 
aaaattctgt 
ctaggcagtt 
cagtg 



tctcttcctg 
gaataatgta 
ataattgtgt 
ctgttatatc 
gcagtctcaa 
tttggtcatt 
taaatagaag 
gaaaaataat 
aaccctaggt 
gcctcattca 
tagttctagt 
ggcattaaga 
aggattagtg 
ggacgtggca 
tctgtcgttg 
tgtttgtaag 
ggcctctgcc 



<210> 645 
<211> 2593 
<212> DNA 

<213> Homo sapiens 



<400> 645 

tgggcacctg 

caggaggtgg 

gagcaaaact 

atacggggat 

aaacagaaat 

cacacacgaa 

tttttgagac 

tcagctctac 

tgcctcaacc 

taattttttg 

tcctgagctc 

gccaccgcac 

tctcagggca 

tgccagggcc 

tggtttgggg 

ttgctgggag 

tcagggactt 

cccatcccca 

tctccaggac 

cacctcttcc 

ctcctgaact 

cggggccatg 

gaggatgcat 

catgcctgtc 

tttgagacca 

gtgggttggt 

cttaaacctg 

actccgtctc 

catctttctt 

ctttctttct 

tttctctcgt 

tacaatggcg 

cccaacctca 

ccggcttagg 

aaataatttg 

aaaattatta 



taatcccagc 
aggttgcagt 
ccgtctcaaa 
taaaatagtc 
tccggaagtt 
ggcgtgaaag 
agggtctcac 
cctccatccc 
ctcagcctca 
tatttttagt 
aagcgatcct 
ctggccgcat 
gaaggtccag 
ccagggctga 
accatcctgg 
gtgcttactg 
ggcttcccag 
gtcttcccca 
tgtgggagga 
acaggttgtc 
cctgtgtgcc 
gtgaggacac 
tagtctgcag 
accccagcac 
gcctggccaa 
ggtgtgcgcc 
ggaggcagag 
aaaaaaaaaa 
tcttttctct 
ttctttcttt 
tctctctttc 
tgatctcggc 
ggtgatccgc 
atgcattttt 
aatcacacat 
ttattattag 



tagttgggag 

gagctgagac 

ataaatacat 

tagtagtgac 

gacctgaata 

actccatgac 

tctgtcaccc 

cccaacctcc 

tgagtaactg 

agagatgggt 

cttgcttcag 

tcttctaaat 

tgtgatcact 

cctggagaag 

ccccccttgt 

tgcctattca 

ttgaccacag 

ccctggcccg 

gcgtgtccct 

acctgcagct 

tggggtggca 

ccttggtggc 

cgtatgataa 

cttgggaggc 

catggtgaaa 

tgtaatccca 

gctgcaacga 

aaaaaaaaaa 

ttctttcttt 

ttctttcctc 

tttttgagat 

ttattattat 

ctgcgttggc 

caatatttta 

cacaccaaaa 

tttagagaca 



gctgaggcag 
cgcaccattg 
acatacatac 
acctgaacag 
cacacacaca 
cctcaaggta 
agactgggtg 
cccaaccacc 
ggactacagg 
ctaaccatat 
cctcccaaag 
cacagtacat 
ttgcatggcc 
gaaaatggca 
cactgttgac 
aggggctggc 
gtccaagaac 
gccctccagg 
cagactggcc 
gccccaggat 
ggggcaaaca 
ttgtcccaca 
aaacggcatt 
caaggtgagc 
actcatctct 
gctacttggg 
gccgaaattg 
aggcatttca 
ctttctttct 
ccttccttcc 
ggagtttcac 
tctccatgtt 
ctcccaaagt 
gtgtttgaat 
ataaattcta 
gggtctcact 



gagaatgaat 

cactctagcc 

atgcatacat 

agagattgat 

cacacacaca 

taagatgcat 

cagtggtgta 

ctgagctcaa 

cgtgcaccac 

tgcccaggct 

tgctggcatt 

ctggctccca 

tctctcccct 

gagggtgaag 

atctcttctg 

agccgcagcc 

ctgcagggtc 

tgcagaaaca 

tgtgtcctgg 

aaaggcaagg 

tagccaactg 

tcaagctggg 

tcaggccagg 

agatcatatg 

actaaaaaaa 

aggctgaggc 

caccactgca 

gttcaaatag 

ttctttctct 

ttcctttctc 

tctcgctgcc 

ggtcaggctg 

gctggtgtga 

aacgggctaa 

ggtggatttt 

ccgtcgctca 



cgtttgaacc 

tgggcaacaa 

acatacatac 

ccaagaaatg 

cacacacaca 

tttttttttt 

ctatcccagc 

gcaattctca 

catgcgcagc 

ggtctcgaac 

acagctgtga 

gtgcccaggc 

cctgagcttg 

atggggtgtc 

cacagtggca 

tcactgcaga 

cagcctcccc 

tgcaggcccc 

ctcctcttac 

ccagagagga 

gtggcctgag 

aggtgacact 

cgcggtggct 

aggtcaggac 

caaaaattat 

aggagaatca 

ctccaggctg 

ggaaaggata 

ttctttcttt 

tctctctctc 

caggctggag 

gtcttgaact 

gccactgcac 

cttgagaaaa 

aacactttca 

ggctggagtg 



6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7365 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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gagtggtatg atcatggttc actgcaacct taaactcctg gcctcatatg atcctccagc 2220 

ctcagcctct caaaggactg gaactacaaa catgcaccac cacgcccagc ctaggtgggt 2280 

ttttaaaatc cattcaaggg cgggtgcagt ggctcacacc tgtaatccca gcattttggg 2340 

aagccaaggt gggaggatca cttgagccca ggagttcgag accggcctgg gcaacatagt 2400 

gagactacat ctctacaaaa aatttaaaaa tgagccaggc atggtggtgc acacctgtag 2460 

tctctgctat tcaggaggct gaggcgggat cattgtttga gcccaggaga cagattgcag 2520 

tgagctatga tggcaccact gcatggcagc ctgggtgaca aagggagatt cagtctcaaa 2580 

aaaaaaaaaa aaa 2593 

<210> 646 
<211> 149 
<212> DNA 
<213> Homo sapiens 

<400> 646 

ggctgggtgc agtggctcac acctgtaatc ccagcacttt gggaggccga gacaggtgga 60 

tcacctgaag tcaggagttt gagaccagcc tggccaacat ggtgaaaccc tgtctctact 120 

aaaaatacaa aagttagcca ggcgtggtg 149 

<210> 647 
<211> 8996 
<212> DNA 

<213> Homo sapiens 
<400> 647 

gacggggttt ggccatgttg gccaggctgg tctcgaactg ctgacctcag gtgatccgcc 60 

tgcctcggcc tcccaaagtg ctgggattac aggcgtattc cactgtgccc agcctgagtt 120 

tctgtttaga aacaacagtc tatgatagta taatcctctc ttttttgtac acagagtaaa 180 

gaggacaaat aggtgaaaga ataaatgaaa ggctggaatc ccacttcccc cgctgtccca 240 

gggcattgga tattgacgga taggaggcag caaaccactc acagagccag gaagaaatga 300 

aggcgttggt attgccagga ggggaagccg gctcggctga aatacgctat gaccatagca 360 

aggagatact gatggagaga aaggaacaca gagagggaga ggtcacatct tggaagagga 420 

agattgtgga gagggggaat gagggtctgg ggaggggctg cccatcagag aagggacctc 480 

agtgttgggg tgactgtact catttggaaa ttgcgggatg gaggggtatt cgaaggtcgg 540 

atgcaaatcc gagaagccag aggaagggtt ttgggtgatg ctcccaggat ggtgggctcc 600 

gatgggatct ttggaggggg tgtgtctagg ttggctggtg tcaggagggt cttttgtgtg 660 

ccaggcagag aactgtcccg aagagctgag agtagagggg ccaggagctt cagggctgcg 720 

gccagactgt ggcccagagc tcagatccca aaggacccat aggagaggca ggggccactc 780 

attcactctg caagagacca gcagaatcct gagggagatg ctgacaaatc ataaaaagac 840 

caagaatagc cgggagtggc ggctcaagcc tgtgatccca gtactttttg agaggtggag 900 

acaggaggat catgtgagcc caacagttcg agaacaacct gggcaacata gtgagaccct 960 

gtttccacaa acatttcaaa aattagttga gcatggtggc atgtgcctag tcccagctcc 1020 

tcaggaggct gaggaaagaa gattgcttga gcccaggaat tagaggctgc aatgagctat 1080 

gatcatgcca ctgcactcca tcctggggag cagagctaga ttctgtctca caaaaaaaaa 1140 

atttgtgggt gccaagactc aagaccatgg gagctggtcg ggcacagtgg ctgacgtcta 1200 

taatctcagc actttgggag gccaaggtgg gtggattgcc tgaggtcagg tgttcaggac 1260 

caacctggcc aacatggcaa aaccccgttt ctactaaaaa cacaaaaatt agccaggcgt 1320 

ggtggttcat gtctgtaatc ccagctgctt ggaggctgag gcaggagaat cgcttgaacc 1380 

caggaggcat cggctgcagt gagtgaagat cgagacactg ccctccagcc tgggcaacag 1440 

agcaagactc tgtctcacac acacaaaaaa aaaaaaaaaa aaaagactgt aggagcatct 1500 

ggtgggaggt ggtggaggga gaactgtggg tttggaagct gcgccctccc cccagccatg 1560 

cgttggaaca ggaacagtta catggagaac aaccttacct tgtccgacac cctcagatct 1620 

ttgtcccagg ccaggaatct tttaatgaca ggatcctctg tgattagaga gcagatgtca 1680 

gtgtgagaag caggacaggg tttccgtggg agcagcaggg cagcgaggag aagtgtgcct 1740 

cccgggggga agtctcagga ttgtggccgc gg^tgaggtg gatgggagag gggagaatga 1800 

ctttcactgg gcaagggaga gaggctcctg ctctgagact cccctgagaa gaggccgaag 1860 

gaggccctgg gtgtgagaat ctacaggatg tagagctggg aatcagccag gaccccctcc 1920 

agcagacacg gagggaccac tgcagagtca taaaggaatt cccatcattt cctcatgaga 1980 

cagtcacaca tcagggtgtg accatggcct tggtatcccc cactatggat ggagacactt 2040 

aggtttagaa aagtcagtaa gagacattaa gtttcagagg gcacagctga aaccactttc 2100 

tttgtttatt gattttgttt ttctttattt gatttttatt tttatttatt tattaattta 2160 
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ttttgagaca gagtcttgct ctgtgggcca ggctggaatg cagtggcctg atcttggctc 2220 

actgcaacct ctgcctcccg ggtttaagcg attctcctgt ctcagcctcc cgagtagctg 2280 

ggattacatg catgagctac tgtgcccagc cttggttttt cttttgagac agggttttgc 2340 

tctgtcaccc aggctggagt gcagtggtgt agtcatagct cactgcagcc tcaaagtcct 2400 

gagttcaagc aatcctcttg cctcagcctc ccaacgtgct gggatctcag gcgggagcca 2460 

ctgcgcctgg cccgaaacca agctttctta tcccaagcgc tgacctttat caagttgacc 2520 

taatccttta tcatctccta agtgtccctc atgagtgatc acttcacatt cctcccacat 2580 

ggagagctca cccactgggg cctatttttc . ccattggaaa agtgtggtta ttggaagttt 2640 

cctgtttttg gaaagaacag gattggaggt gctctctggg gtgtcctcct accaagcagc 2700 

ctgttgaagg cctcgtggtg ctcagggagc acgagcgaca ctcgccgtcg cttcagcttc 2760 

atcttgaggc cacacagcat ctccgccacc cagatctcct caggctcagg ggcgagcacc 2820 

ttccgtggct cctcctccga ctcctcagat ttgtcccacc actccatctt ccttttccag 2880 

caaaaggacc tatgcggggg gctgggatct accccagggg ctgagtaaag aaaccaggcc 2940 

acggtgtaat gcttctgcag ttgatcacac tagagcccga cccaaaaccc caaaccactc 3000 

tccatcctcc ccagcctcgc agactgctgg cttctccaag ccatctttcc ttctgtctgt 3060 

ctcctctgct gagctccatg tgccgctcct tctcctcccc attctcccgt ttctctgtcc 3120 

tcagaacact tcctcatatc cttccctggt ccctggctct ctgagtccct tttttttttt 3180 

tttttttttt gttgttgttg ttgagaaaca gtcttgcttt gtggcctagg ctggagtgta 3240 

gtggtgcgat cttggctcac tgcaacctct gcctcctggg ttccagtgat tctcctgcct 3300 

aagcctccca agtagctggg attacaggtg cccaccagaa cgcccagctc atttttgtgc 3360 

ttctagaaga gacagggttt caccatgttg gccaggctgg tctccaactc ctggcctcaa 3420 

gtgatctgcc tgcctggcct cccaaagtgc tgggattaca ggtgtgagcc actgaaccct 3480 

gcctcagtac ctccattctt cccacacacc ctcctcacgt gctccttcct gacttctggg 3540 

cccgcccttc cttctttttt tttttttttt gagacagcgt ctcactctct cacccagaat 3600 

ggaatgcagt ggcactatct tggctcaaaa caacctcttc cacctgggtt caagcgatta 3660 

tcctgtctca gcctcccgag tagctgggat aacaggcatg cctggctaat ttttgtatcg 3720 

ttagtataaa tgacgtttcg ctatattggt ctggttggtc tcgaacaact gacctcaagt 3780 

gatccaccca tctcagcctc ccaaagtaat gggattacag gcatgagcta ccacacccgg 3840 

ccttcgtttt tcttttgaca cagggttttg ctctgtcacc caggctggag tgcagtggtg 3900 

cagtcatagc tcactgcagc ctcaaagtcc tgagttcaag cagtcctctt gcctcagcct 3960 

cccaacgtgc taggatctca ggcgtgagcc actgcaccta gcccgaaacc aagctttctc 4020 

atcccaagcg ccaaccttta tcaagtctag cctagtcctc tattgtctcc taagtgtccc 4080 

tcatgagtga tcacttctga gtcctcctgc gtggagatct cacccactgg gggcgtatct 4140 

ttcccattgg aaaagtgtgg ttattggaag tttcctcttt ttagaaagaa caggattgga 4200 

ggtgctctct ggggtgtcct cctaccaagc tgactgttga agtccttgtg gtgctcaagg 4260 

aggatgggtg acactcgctg ttgcttcagc ttcatcttga gcccacacag cgtctccact 4320 

acccaggtct cctcaggctc aggggcgagc tccttctccg gctcctcctc agattcatct 4380 

gaccactccc tcttcctttt ccagccaagg gacctacatg gggggctggg atctacccca 4440 

ggggctgagt aaagaaacca ggccactgtg taatgcttct gcatctgatc accttagacc 4500 

ccgacccaaa accccaaacc actctccatc* ctccccagac tcgcagactg ctgacttctc 4560 

taagccatct ttctgatttt ctcctctgct caaccccatg tgccgctcct tcccctcccc 4620 

attcttctct ctctctgtcc tccgaacgct gcttcatgtc cttccctggt ccctggctct 4680 

ctgagtccct ccttttttgt tttgttttgt tttgttttga cacagaatct tgctttgtca 4740 

cccaggttgg agtgtagtgg tgcaatctca gctcactgca acatccatct cctggattcc 4800 

atttattctt ctgcctcagc ctctcaggtg gctgggatta caggtgcctg ccataatgcc 4860 

cagctcaatt ttgtactttt agtagagaca gggtttcacc atgttggcca ggctggtctc 4920 

aaactcctgg cctcaagtga tccgcctgcc ttggcctccc aaagttctgg ggttacaggt 4980 

gtgagccacc gcacccagcc tgaatttctc cattcttccc acacaccctc ctcaggttct 5040 

ccttcctgac cgctgaccct tcttttcttt tcttttcttt tttttttttt tggagtgcag 5100 

tagcgtgatc tcagctcact gcaacctctt cctcccagtc tcaagtgatt ctcctgtctc 5160 

agcctcctga gtagctggga ttacaggtgt gcaccactac cacttggcta atttttatac 5220 

ttttagtaga gatggggttt caccatattg gccaggctgg ccttgaactc ctgacctcag 5280 

atgatccgcc cgcctcggcc tcccaaagtg ctggggttac aggcgtgagc caccgcaccc 5340 

ggcccccttc cttcgtctta gtcaatccta tcccacctct tcttccacca gtcccctcac 5400 

ctgatggtcc caacacttca tcatccacca cctcctggag ggggtacccc gaggtgctcc 5460 

gctggggact ctgctcattc tggcggtgcg gttgacggct ggtcgtgatc tttcccgtaa 5520 

tctgtcccct cttacggaac ctagtctccg ttctgtccat ggccttcttc tggacactgc 5580 

taggatccag aagagtatgt tatcaattct caagcctagg agaagtcagg agtggagaac 5640 

agctctgaga agatactgtt gtccaactga tctccaggca ccacggagtc cggtccctcc 5700 

aatcaggaag gtcggaatct ctgatgtcat cgttcatgcc aacctggcaa ccagtttgaa 5760 

aaaaaaacac atgtaactgc caggctgatc tcttgtcctg gagatcctgg gtgaatggta 5820 
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tctcctgcca ctgtcccaac ctcagaccat tgtccaaaag catcttcagg gactccacat 5880 

ccctctgttc cctgtcccag cagaggctgt gtcctctcca ctcaaagcct gaagcatgtt 5940 

ggggtctctt cgtctctgta cgtgcccatt tcagagtcca gtctggtggg agagggaaca 6000 . 

gagtgggaaa gaaaactagg gtaagcagaa acgatgaaac cttataagag tgagattatc 6060 

atgtacaaga gtgagattat catgtacaag agtgagatta tcatgtacaa gagatcccag 6120 

gaatactgac ttgatgaaaa agtcacatca gagcactcag tttggcagag cttttctgct 6180 

gaatgtttac tcacattcac tgtccaagat tctgtactgg gggtacatac gtcctctgcc 6240 

ctaaggcaat tttgagtcca agagacattt tgaggcctaa aaatcatagg aaactgcccc 6300 

tgagctcaca catatttcca atggagctca cacatatttc caatggtgtc cccaatttca 6360 

gggaatccat ggattaccta agccagcccc tccagttcgg ctaagaaact ctagtctata 6420 

tatcaagttt tgtatcatat gtattgctct gaactcagaa atttcccttc catttatgga 6480 

ttctatgaat aaaatatcac atgtacaaaa agactaagtc gaaaaatttc agctgtgcac 6540 

agtggctcat gcttgtaatc ccagcacttt gggtggccaa gggaggaaga ttgcctgagg 6600 

ccagcagttc aagaccagta taggcaacat agcaagagcc catctctaaa aaaacaaaac 6660 

caaaccaaat tagccaggtg tggtggctgg cacctgtgtt ccaactactt gggagactca 6720 

tgtgacagga agatcacttg agcccaggag ttagaagctg cagtgagcca tgatcttgcc 6780 

actgcactcc agtctgggca acacagcaag atactgtgtc aaaaaatttt tttttgataa. 6840 

aaaataaaag agttacatga cattcagaga ccatccaaaa aacctgcggg ttcccggctg 6900 

ggctcagtgg ctcatgcctg taatcccagc actttgggag gccaaagtgg gtggatcact 6960 

tgaggtcagg agtttgagac cagcctggac aacatggtga aaccccatct ctactaaaaa 7020 

tacaaaaaat tagccaggca tggtggtgga tacctgtaat cgcagctact caggagaggg 7080 

cgctggagaa tcacttgaac tcatggtgcg caggttgcag gga"gccaaga tcgcaccatt 7140 

gtgctccagc ctgggcaaca agagcaaaac tccatctcaa aaaaaataaa gaacctgcga 7200 

gtgagttccc acacgttttc ctgatgggct gctgctttcc taggagtctc tcgctcatag 7260 

aaaaggcaca aactgaaaga ggaagcagat cccattgctg tggaagtccc attgttagga 7320 

agctctgctt ttctggagtt caaattcgca ttcatgacgc tttaaaccgt cagagctggg 7380 

tgggtcctcc tacaacaaaa tcgtttgctc tctctctcct agttaacagg ctttcaaata 7440 

ttagaagatc aatgttctga ccccattaaa atttctcttt tgtggaatga aaagctctga 7500 

tttaacccat cttcaagcct ggtttgatgg aggaataggg gctgagtcac ctgcatttcc 7560 

cctccctgca caaagtcctg ggcccagatc tggggtctgt ctctgctgag ggtggggtga 7620 

accaggaagc acctccctct acatctcctt gatgaatggg tataatggtt gccatggaac 7680 

tggggcttgt ttgatgacct ggggctgggt gggcctctga gagcctttat agctgattgc 7740 

cttttgggag agggcaggtg ggagccccac cctgtcttat gagtcacccc aaaggtgcat 7800 

gggcaggcag gtgctgggga atcggctact ccccagagct tggcgtggcc atccctgtgg 7860 

cccctctggg agtctggagc ccattccctc acactggtac tctctgcagc tggggacatc 7920 

tgcactagga agacaggaca cggcatggaa gctggcctct gcccagaagc catgacattc 7980 

tggtcaccag cctgatgcta taaaacgagt gtcacggccg ggcatggtgg ctcacacctg 8040 

taatcccagc actttaggag gccaaggcgg gtggatcatg aggtctggag ttcgagacca 8100 

gcctggccaa catggcgaaa tcccgtctct actaaaaata agaacattag ccaggtgtgg 8160 

tggcacatac ctgtagtccc agctcctctg gaggctgagg caggagaatc acttaaaccc 8220 

aggaggcgga gattgcagtg agccgagacc acggcattgg actccaggct gggcaacaga 8280 

gcacgactcc gtctcaaaaa caaaaaaaaa cgagtgtcac ctggggctac ttggccagac 8340 

acagagagca aggagacatc cctattatct gtcaaaaata attgttgggg ctgagcacag 8400 

tggctcatgc ctgtaatctc agcactttgg gaggtcggag caggaggact tgaggcctag 8460 

agtttgagac cagcctgggc aacatagcga gcaccccatc tccagaaaaa atttaaaaat 8520 

tggctgggcg cagtggctca tgcctgtaat cccagcactt tgggaggccg agggggatgg 8580 

gtcatttgag gtcaggagtt tgagaccagc ctggccaaca tggtgaaacc ccatctctac 8640 

taaaaataca aaaattagcc gggcatggtg gtgggcacct gtaatcctag ctacttggga 8700 

ggctgaggca ggagaatcgc ttgaacccag gaggcggagg ttgtagtgag ctaggatcat 8760 

gccattgtac tccagcctgg acagcaaagc tagactccat ctcaaaaaaa aaaaaaagta 8820 

aaaaatttaa aaattagatg ggcatggtga catgtgcctg taatccaggt actaaggaag 8880 

ctgaggtagg aggatgactt gagtctagga gttcgaggct gcagtgagct ctgatcgcac 8940 

cactgcactc cagcctgagt gacacagcaa gaccctgctt caaaaaaaaa aaaaaa 8996 

<210> 648 

<211> 14976 

<212> DNA 

<213> Homo sapiens 

<400> 648 

gaggggccgc gcgcacggga ggacggagag gcggaaagga tggcgctgtg acagccgggc 60 
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cggagccctc gcgtccccac cccgcgccct ggccgctggg cccggcggtg agtgcgcgcg 120 

ggcgggggcg gcggcggcgc ggcccctccc ccgggtcccg ggcggctggg ccgcggcaga 180 

ggcgggggcg ccgaggctgc gccgccgccg ccgccgcctt tgttgcgggc ccgccgggga 240 

ggcgggcggc gccggctctc gccgcccggc aggtgcgctg gggccgggag ggggccaggt 300 

ggggaggcgg gccgggcgga gggggcagcg gaagcgctcc cggcgcggtt ccgcggagaa 360 

agggaaatgg ggcaagatgg ctggagaaat ccgaacgcgc ccgggggcgc ggcggcggtg 420 

gcgcaggcgg ggcagggcgg gcggcggccc tggttgcggc accttagcgc ggggaggcgg 480 

cccctcccac ctctgccccg gagccggcag ggagaccgcc gccaagttcg ggccccagct 540 

gcaggcatct gaagcctaga cccatccatg ccgcgttcgc ctatcttttt tggaaatatc 600 

gaaggaaccg actttgaact ccagagggat cttttctttg gaggttcccc gatgcagccc 660 

tcaagatgca tccgcactcc acacctagga attgcccgcg cccatagagg gcaggagtgc 720 

ggccctttgt actggccaag agtcagcgaa tagagaattt gcagcccaga ggcaatgggt 780 

tatttccctg tcggggtggg ccccagccga tttctgcacc tttttgggta tccaggcata 840 

accctcacta tgtttcactg actcctgcgt ttgcaaagtc ccttaaggta gaatgaccct 900 

attttctgaa ccagctcgtg gacagaacct ttgaatttgg aaaatccagg caatgggcct 960 

actgagccaa ccgctccgcc catgcttaac tggcttctag cacccattcc agttgagtaa 1020 

gtgtccaatc tgtgttcaga caccttcact gataggaaat cagcaccttt cccctgcctc 1080 

tgattcctgt ttgggtggca ctggctgtga gaaagtactg cctttctcac tgcttaaaac 1140 

cttttccagg agtatttatc cactggccgg cctctgttct gccgtccagt ctctacaaac 1200 

cagtctgggt tctcttcccc agatagcttt ttaaatattt gaaggcagcc agatacgtta 1260 

tctacccaca gcctctctta tcataattaa acatgtcacc ctgtagcttc ctggccgact 1320 

cttgtttcct tacttccctc cagcgggaac agtgcttggg gctaaagagg tttgtatgtg 1380 

gctcagcctc ccccatcata ggctttgggg aggtcacagt gacacctggt ttttctgaac 1440 

ctgaatcttg cttgcagaga tggagtccac agcctaccct ctcaatttga gcctgaaaga 1500 

agaggaagag gaagaagaga ttcagagccg ggaactagag gacggcccgg cagacatgca 1560 

gaaagtacga atctgctctg agggcggatg ggtaagtaag aaggtgtgga gtccacacag 1620 

ccagcagatg caggcctgag agcccagctt gctttaagtc tcctcctcag acagacgtgg 1680 

gaagtaaaaa tagcccagcc cggtgtgttt gcattcctga gaaaacagct ctgtggcttt 1740 

aaaaggcagt gcaaagaatg gaggaggtgt tatggaaaca gtccctcagc catgtagctt 1800 

gtcatcgttc aattggaagt cctagaagcc aggaccttgt ctctctcgtt cacccactta 1860 

tcctgcactg cgtttgatgc caggcacatt gtaggcacca agtaaatgtt tgttgactta 1920 

ctgctggttg aagggaacca ttttaaataa gactcgagtc attgggttga catgagaggt 1980 

taggctagtt ggttctgtgt gagcacctct taccacagtt gacacaggag agctgcctgt 2040 

ccagctgacc tggatagaaa tggtgggcag cagggaatac agagattcag ggtatacacg 2100 

cagaatagta tgatagaggt cggtttatac aaatgcaaat gcagtagata cttatagtta 2160 

cagatagtta ttcaccgtag ttactggact tcgtggtaat taggcatgaa tagtaaacat 2220 

tttaaataag caacaaattg acattccttt tctctgctta ttttccgttg tgtataaaca 2280 

ttgtcagtat acatagtatc ctttgcttcc atgaagagcc cctatccata caagcatggt 2340 

acttctaaat tgttccctac cagaggagag aactgggatg cagaaagatt atctgccttg 2400 

ccctaaggtt acacatgaac actgacattt gttcttcaaa acaggtctcc tgagctgcac 2460 

ccttaaaagt aatgcatttt accatatgta tgctatacct taatttaaaa aaaaaaaaaa 2520 

cagtgcaacc acaggctctc agtttgccag tctagctgtg gacaccgcgc tgcctttgtc 2580 

tcatcagtgt tctttgccca ccacatcctt cttcccaatt ctgatccaag caggcgtcag 2640 

gtctatttag cctggaacaa gaactcaagg cagtcaaaac tcagggcttt gaggctagat 2700 

tggtcaaccc agaacatcta aatggagagc cttggagaac aggtgcaatg aaagtgaatg 2760 

ttgggacagt gaggaaggag caagatgaag accaaccatg cttaggtggg aaaagaagga* 2820 

agtaaccaga agggctggag gagggagtcc agcagctggt ggtgctgggg attgggcatt 2880 

tgtggcctcc aggcaatcag ccatccatcc acagttacca agcacagtgc tgggcgtaaa 2940 

gagagactct tttcctgatg gaatttccag tcttgtgagc aaggcaggaa agtaaacata 3000 

agtaatagta agcacagtgt gcagagtgca gtggtggaag tctgcaagga tgcaccagga 3060 

gccaagatga ggggatgcct cattcaggct gagaactccg gggcttcctg gacaaggcct 3120 

aatcttgagc agtgagcaat tttgccccca cccctggaca tttggcaaca tctagaaaca 3180 

ctaccccctt tccccacccc tggatatttg gcaatatctg gaaacacttt tgttggtctg 3240 

acttggtggg ggtgcgttgg ggactgggag atcctcctgg catctagtgg atagaagcca 3300 

ggggagctgc tagatatcta cagtgcacag gatgcccccc caccaccaca cacacacaca 3360 

catacacaca caacaaagag gtaactggcc caaaatgtca gttgtgctga ggttgagaaa 3420 

ccctgctcta gagcattaga agaaaagaat ctgtttttaa aatatgtatt ttggttatag 3480 

agaattatgt tagggtgatc agtgaaagct tttcaagcat aagaactatg ataatatatt 3540 

aggataaaat tccatagaat tgaaatagaa atacaatttt aagtagaaaa aacaaaaaat 3600 

gtgctttgaa ctgagtctta gtcattatgg aatcagccaa gggaagggaa gggggttgca 3660 

ttccaggcaa gaggacagca caggcaaaag cactgaggag aggagcttgg gtgcagagtt 3720 
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aggagcagct cctgcagctg gaacctgaaa ctcatggcag ggagtggaag gggatcaggt 3780 

ataggaggcc aggccagggg ccagagccag ccaggaggac atgcagtgct acgccagcga 3840 

gtttgagctt tctccttagc tctgtggaat gaggccttag cccatctttc taagggaggc 3900 

ctcaggctct ctacagccac catgctaagg cctgagtggc ctgctaagaa agcaaagctt 3960 

tggcatcaga ctttctagct ctgtgacctt ggacaagtta cttaaccact ctgtttcctt 4020 

acccataaaa gtgaggtagt gatccccatg tccctagttt agcggattca gtgagagaat 4080 

atatttatat gattgagtgt cctatacagt gcttggccca tagcaggcac tccatcagtg 4140 

gatcagtaca aactacccca accccttccc ccgagatccc tgcgtctttg ctgggtggct 4200 

gcaaggcttt agcgaggctg tgcaaggtgt aagcacttta caacatttac cctttgagat 4260 

agcattggaa tgccaacttc ctgagcagag caaggcagtg cacacaagct agaggaagaa 4320 

cacttatcca gtgagccctt gatcttctga ggaaagggag gagggcaggg actgaaggtc 4380 

cctacctgca aggccaagga ggggtgtctg gaggtggtac acagactgcc tgacttctca 4440 

gttttgcccg gggccggccc tgtttacctt ctgctatgtt gagccagata aaaggcctct 4500 

gtggagcgag gggaggcctg cgcagcaggg cctggctgaa ggatcgctcc attctcacag 4560 

aggcttgctg tgaggagaat gtctaagcca ctacactgtg cctctggagc ataatgtggg 4620 

cttgtttgaa gatcctccgt ggctcagtca ttaataggaa gagatgaatc ctgcaccagc 4680 

ccccagcttt cttcctgcag cttcagggct cctggtacct gatgttttta ccaggcctag 4740 

aatatgttac cccccctaag agtattgatg gcccctctga aatgacaggc atagacagtc 4800 

ctgttcccat tttatcagca aagggaaagg gactataaat aggtatgacc tagatctgga 4860 

acttcccagg gagaaaaaaa ggccaaagag acagttcttt aaaagcctgg gaattgggaa 4920 

aggagtaggg tggaaaattc ctgggctgac tggctttcca gggtgaggaa gtctgaggaa 4980 

agcctgaaga cgaatccatt ttacctcctc atatttcaca cttagggatg tttgtccatt 5040 

cttactccaa agagtaggtt tggttttatg acagataaaa tttaatgatt ctttgaaaca 5100 

gttccaagta gaacaagaga agggaaatcc agagatttgc ctagatcgta tattcaaaag 5160 

cacatttttc agtgactact agctacagga agtgtcctcg gcttagggaa gcaaaaagga 5220 

gtaagtacag gtcctgcttt caacaagctc agattctagt ggagggagca gacaagtcaa 5280 

ccaaaagaat tctgcagtgt gatgagtgcc ccgggagtgt ggagcccagg acatttcagg 5340 

gacatgtggg tgtgctcaca gcgtggcaca gagtgggctt cctcgggagc agggattttg 5400 

ggctgagtcc tggaagatgg gtgggtgtca gccagaccaa gaaaggggag gccctccaag 5460 

gccaggggag aatatgggcc aaggtgtggg ggcgttgggg aatggttgtt cagggaactg 5520 

cacacagctg agtacaagtg gagcacagag ctggggagag ttgagggtta gtcatcaggg 5580 

aggcagctgt gctcggtatt gctggatcct tggggtcagg cggtggtggc cccaaaacgc 5640 

aaccccacca tttattggct atgacattag cctggaaatg agacctctcc aagcccatat 5700 

tctccttgat taagtagggg catggtggct tgtgcctcta atcccagcta ctagggaggc 5760 

tgaggcagga gaatggcttg aactcaggag gcagaggttg cagtgagcca agatcgtgcc 5820 

actgcactcc agcctgggca acagagcaag actccatctc aaaaaaaaaa aaaaaaaata 5880 

ggcgtatcta ctcctccaca ggcccattgc agggaataaa tgaagcagtt acaggccctt 5940 

tcctagtacg tgcttcctat gtatgaggtg ctgtggaggt ggggagagaa caagggggct 6000 

tccttgggtc agggaggaaa ggccaatgta tgtaagtgac tgttggcctg acccctgaag 6060 

tggctgcagg ctatatagca tatggacact aattttctaa gctaaggaaa aacagtatca 6120 

taatcaccca gacccaagat acagctttgg gaaatagctt tgaaggcaag aatacaaaat 6180 

aaccagcctg aaacaagcca tccagtgcct tcttatcact aaataggatg agctttggag 6240 

taggattttt aatgactgta ataagaaata cagttaaaaa caaaaagagg aggttggttt 6300 

gggtggcttg gctgccttgg gctactgagg agccccctgg tgctgggcca ggaagcagag 6360 

gaacagggga gggatgcagt cctttgggct ttctgcctac ctggtccagg atcaaagtga 6420 

atgagtttgc tctggcatct cattacatat attaagctgt aattagccct tctttcttcc 6480 

tcttcatgct cttgtggatt ggaaaggcca aatagggccc cttctctcag tggctcctcc 6540 

ctgggcagct gtgggttcca agtctaacac tttggttagt acccttggaa atgcagtgcc 6600 

ctggaaaccc acaaagatcc cacggagggc agtgatccta ttcagattta caaacccaac 6660 

tcaccccgcc cggggccgag ggtctttttc tggggtggga gggaattggg ggtgcttgtg 6720 

ttggtgggac actttgtctg ggtgtgggtg gcttaagtct tgggtcagag gtgagccgtg 6780 

agcagctctc accacagtgc atgagggtgc gatgtcccat gtcctccaac cggtgatcat 6840 

ggtagggagt tagttttctt ctcaaagtgc tgcttacctt atctgaactg agggtggcag 6900 

ctgctgtggt gggcacagaa ggactgtgct cttgcccttg ttggcttcca gcagctgtat 6960 

gaatgtggac agtccgtttt cccctgggtc agggtccctc acctgacctg tgagaggaca 7020 

ggctgtgctg ggctgtctgt aaggtcctgt ccgctctgaa agtctgtctg catatgcctg 7080 

tttgtcattg tgaaagggct gttcactctt ccaggtaccg gccctatttg atgaggtggc 7140 

catatatttt tccgatgagg aatgggaagt tttgacggag caacaaaagg ccctctaccg 7200 

ggaagtcatg aggatgaatt atgaaactgt cctgtccctg ggtaaagctg tgttttcctt 7260 

tgtctcctgg aagctctgag cccagagaat tgtttccatg cctctcttct ctggtatatc 7320 

caatgtccct ctctcctgaa cttagactct gcctcctcaa tgtctgtgcc catttattcc 7380 
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aatttttgat ttgggcagta tatcaatatt gtgtcaccct actcggcaat ccgtgctgcc 7440 

tcagaactgc ccccattcaa cccaggcatt cacatttaag gataaaaacc atgtaacctt 7500 

agaaatgtca ccatggatct gggcctgtgc ctcactgttc cttacgagag aaggcaggcg 7560 

ttaatgttac cctgtcagcc tcagaagttt ggggacatac cctcccaaag actgtttgct 7620 

cctctcccca acttccaccc caaattgata tcacacatct tatcagcaga atgtaatcat 7680 

ttgaagagat ttcccagttt gcttatacag gtaccaaaat gttaattgga tcagcagctg 7740 

cattcaccct tctccctgct aaatgcagtt cttttattgt tggtgccaat accatccctc 7800 

attaggagag atctacccat tcctttagac agacctattt tccattatct tttttcttta 7860 

agtatccctg agcctttttc cccttgtttt . tcacaaagaa ggggagagtg gctcttaagt 7920 

attttgaagt acttattttg gtagtgctcc aggacagtaa cacacagctg ttacctattt 7980 

tatttcctat aaacagaatt cccattccct aagccagaca tgatcacccg tttggaaggg 8040 

gaggaggagt ctcagaattc tgacgagtgg cagctccaag gaggcacctc tgcaggtact 8100 

acaagtaaag cagttcagcg tccatccagc tgggaacatt gtggagtggg attttcagtg 8160 

tacattccac ggggtattta agaggtagaa ggattatggg ttttggagtt ttgaaatctc 8220 

cctctttgca gtgctaaatg tgtgacagtg tgtgacaggc aactgctttt acctcagagc 8280 

ttccattctt cctctatcag ctctgagtaa caagacttga tgtatgacct tgtttgtgga 8340 

tgaagcaaaa tgaggcttgt gcagcatttg gtctagtgca cc ttctggct gaacagggct 8400 

ccctcagtgg cgtaccatgt gcctctgcaa actaagaata gtgagaggaa gcacacttcc 8460 

tgagttacct ggctccctgt ccatctcata ctggatagaa aaagcaatag caatcatagc 8520 

atacatttga gcaaatgtat ctaagaagta cctagtcctt tgtcaagaaa aatggttatt 8580 

gaggccacag ttgaataaat atagcattgc ctccagacgg ccaaaatagt tttatctttt 8640 

ctccggaagt aagagtttaa aatgccatat ttcactgact ctgagacatg tttttctccc 8700 

attgctttac tgtttctgaa attgagaatg cctcttaaaa tcagtggcat gtcatagctt 8760 

aattagtagt gttttctttc ttaatggcac ataatgatgc atcctacaac tgatgacatt 8820 

ttagtttggg tgagataggg taatttcccc aaatcatatt caacttctgt gaccaccttc 8880 

taaatataaa gtgaaacctg gttgcaccat taatgtgcaa gaactaattt tctgccttat 8940 

aagcttaact ttgctaagtg tggtttttgc tgtaggtttt cttggttcta aagggatatg 9000 

attcaggcca tgatgcatct ctgtaatatt atatggtagt atggttaaca tttggcttct 9060 

cattcacata aaaagtattt tgatgcaaac atggccccca aattatttgg tccaagcttg 9120 

gttcctctgt tgagttccca gctctgccac tttctggctt ggctaccttg gtgcattatg 9180 

taacctctgt agcctcccct tcttcatctg caaagaagga acagtgatac ttagagttac 9240 

tctattaaga gaactaggtg tgggtagtcc acataaattg gcatatggaa aaggacttag 9300 

gcaatgttac ctgttattaa agttactact tttggtgaac ttcagattaa ggtattctta 9360 

gcatttcaac ccccaaaaat cagggtataa gccattttac ttactggcat tatttttgcc 9420 

cctcactatt tgtctcccag aaaatgaaga atctgacgta aagcctccag actggccaaa 9480 

cccaatgaat gctacctccc agtttcctca gcctcagcac tttgacagct ttggcctccg 9540 

tctgcctcgg gatatcacag agctgcccga gtggagtgag gggtacccct tctacatggc 9600 

catgggcttc ccagggtatg acctctcggc tgatgacata gctgggaagt ttcagttcag 9660 

ccggggcatg cgccgcagtt acgacgcagg gttcaagctg atggtagtgg aatatgctga 9720 

gagtaccaac aactgccagg ctgccaagca gtttggagta ttggaaaaaa acgttcgaga 9780 

ctggcgcaaa gtgaagccac agcttcaaaa cgcccacgcc atgcggcggg cattccgagg 9840 

ccccaagaat gggaggtttg ctctggtgga ccagcgtgtg gccgaatatg tcagatacat 9900 

gcaggccaaa ggggacccca tcacccggga ggcgatgcag ctgaaagctc tcgaaatcgc 9960 

ccaggaaatg aacattccag agaaagggtt caaggcaagc ttgggttggt gtcgaagaat 10020 

gatgagaagg tatgacctgt ctctgaggca taaagtgccc gtgccccagc acctgccgga 10080 

agacctgact gagaaactcg tcacttacca gcgcagtgtc ctggctctgc gcagggcgca 10140 

tgactatgag gtagctcaga tggggaatgc agatgagacg cccatttgtt tagaggtgcc 10200 

atcacgggta actgttgata accagggcga aaagcctgtc ttggtcaaga caccaggcag 10260 

ggaaaaactg aaaatcacag caatgcttgg tgtcttggct gatgggagga agttaccacc 10320 

gtacatcatt ttgaggggaa catatatccc cccggggaag tttcccagtg ggatggaaat 10380 

tcgctgccac cggtatgggt ggatgactga agacttgatg caggactggt tggaagtggt 10440 

gtggagacgg aggacaggag cagtgcccaa gcagcgaggg atgctgatct tgaatggctt 10500 

ccggggccat gccacagatt ccgtgaagaa ctccatggaa agcatgaaca ctgacatggt 10560 

gatcatccca gggggtctga cctcacagct tcaggtgctg gatgtcgtgg tctacaagcc 10620 

actgaatgac agtgtgcggg cccagtactc caactggctt ctggctggga acctggcgct 10680 

gagcccaacc gggaatgcta agaagccacc cctgggcctc tttctggagt gggtcatggt 10740 

cgcgtggaat agcatctcaa gtgagtccat cgtccaaggg ttcaagaagt gccatatctc 10800 

cagcaacttg gaggaggaag acgatgtcct gtgggaaatc gagagtgagt tgccaggagg 10860 

aggagaacca ccaaaagatt gtgacaccga aagcatggct gagagcaact gaagggaaag 10920 

ggaaaggtaa ccactcagga gtagatactc agtgcctttg ctgacatgtc tgtgttctac 10980 

aaacaccttc tctccattat ttttctgttt ttaagttccc ttagagccta cagtgcagta 11040 
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gtgtagatca 

gaactgggct 

gcccgctgtc 

cacatgtgag 

ttcatcccaa 

gtcttccatg 

gaccattagc 

ttgctctgtg 

ctgccagaac 

gacgttgttc 

aattatgctg 

cttttgctgg 

gggtctcagt 

cttgcttttc 

tcacatgggc 

taactgaatc 

gggctgagga 

aaggaaagta 

acacctccag 

acaattcatt 

cagtgtttta 

tcttcacaca 

tttttctcca 

catctgtgtt 

atattgctga 

ttactattac 

ttgagaaacc 

atatcaaact 

ctccttgtta 

gacatgatta 

gaatgttggc 

ctgcagctcc 

gaccaagtgg 

ttatgtttgt 

tttccagcaa 

tcctgaaagt 

gaacagcccc 

cccacaccat 

agaaaagaaa 

aagaaacctg 

acaccaacga 

caagtaaagt 

ggtcaagac t 

acatttcctc 

cctggtgcca 

aaaataaact 

ggtctcattt 

tatgcaagct 

ctacatggta 

tttatcacgg 

tctcctgccc 

ttcccacata 

atatgccagg 

ggaaacaggc 

actgttcttg 

ttcacctttc 

ctagtatttg 

ttcttggtca 

ccctccaatt 

tttctcttgc 

gagccatagt 



ggggtcccca 

gcacagcagg 

acagcagcgg 

ggatctaggt 

agccatccta 

aaaccagtcc 

attttctcct 

gcttggaccc 

atctgagagc 

agttcacaca 

aacatttagg 

cctgtaggac 

cctggagcta 

cctacagagt 

tatgacaata 

tttacaccaa 

caataggcct 

ggaatggagg 

agaggaggag 

tacctgttcg 

tatgaaggaa 

ttctttttct 

aatgaaatgc 

tcaaagagtg 

atcccattgt 

attcaaaatc 

ccatgaatcc 

gaattacatt 

gataacattt 

aaaacttcca 

ctgtaatatt 

tgaggaagtg 

gcataaaact 

taaaagttat 

atggaactct 

gtggatgcgc 

ggtccagaca 

acaaggtatc 

ctgcttctgc 

taaatatgaa 

gggactcatc 

tttattctgc 

acagagaagg 

ttttggaaaa 

gcagatggct 

atgctgtttt 

aatggccctg 

cttgtttctc 

caaggcccat 

ggcagtggga 

tgcaggaata 

agtttgagtg 

acctcacaga 

tgccataacc 

gcatcttgaa 

tgatcctttt 

aacaactcaa 

tctcaaatgg 

ccacctaatt 

tttgggagct 

taccctgcca 



acccccaggt 

aggtgagcgg 

cattagattc 

tgcaggctcc 

ccacccacta 

ctggtgccag 

ttatatgaaa 

agaaacactt 

tgtaaagtta 

aaattttgtc 

gccagtatgt 

tgtgacatag 

tctacagtat 

tcaggagttt 

atttcctgta 

aaaaagaaaa 

ggaattttct 

cctaactgag 

gatgagcatg 

ttctcatctc 

gaagccacct 

ttcttttaaa 

taatgggagc 

acttatgtct 

aattctggga 

accaaaattg 

agcctccctt 

gagtttccat 

ttgtgcactt 

gaagaggaaa 

ggt cat gage 

atgggagtag 

aegtcttgge 

accaaaaagc 

gatttaaaca 

tggatgegea 

gcagcctgta 

agaaaagtct 

cccatttctt 

agaacaaagg 

atatgggcac 

ctttgaaaat 

ctttgtagaa 

atgagatact 

tttctataga 

atttatcttc 

tttttttcat 

aggctgectg 

tgactcatct 

gggcttgggc 

ggtgggaagt 

ttacgaaaac 

caaagecatt 

acttttcctt 

acacctattt 

caaccagttt 

tatttgaatt 

cgtctaaacc 

tttcatctgt 

taaaagattt 

agaagagtag 



cacggaccat 

caggtgagca 

tcataggagc 

teaatgectg 

ccccgcactt 

aaaagttggg 

catttattat 

gtataatagg 

aaattatttg 

tgaaaageca 

gtaactgaca 

ggagaacact 

gttaccagcg 

taaaaegtea 

gtcagcagat 

taaaacggtg 

gggttgagta 

acaacactag 

tacagcaggg 

cctaattata 

tttttacaga 

gtataattag 

cttattttac 

cctgccttgc 

tccctggaat 

cttttttttc 

ttctgetgae 

cttttcttca 

ttaatttcct 

ggagaaaaag 

ttttgetett 

acagattcac 

ctctatttgg 

caagggttaa 

gctggggatg 

gggaacatca 

catccatccc 

aggacctatc 

gttttggaga 

ttatttcctg 

aactctggtg 

gcttccaaaa 

atgtgtcacc 

tagaataaca 

caaactaggt 

ccaacctgat 

tattatattt 

cagaagaagt 

gatgcttgtt 

ttttagecac 

cattgaattt 

attcctttaa 

gctagaaatg 

cttgtagact 

ctactcaggt 

tcccccaagg 

gttgccccat 

cagctacttt 

cctagttact 

tacaagacct 

aaaatgggtt 



actggtccat 

agcattacca 

acaaaccctg 

aggatctgag 

cctgggtccc 

gactgctggt 

ttgatttatt 

tgaaatgaac 

gacccaaaga 

acttttttct 

tgctgggaca 

aagtgtgtcc 

agtaagaata 

tcttagtctc 

gtctgaagga 

cgtatctagg 

cacaagaaga 

tgaaatgacc 

gggacaccca 

aattaaattg 

ggacctgaaa 

gtatttccat 

ccacttcagc 

ttccaaatta 

tacaatggag 

ccccttgggt 

ccttatattg 

tttacctttt 

aagctcctta 

etatgetaag 

ctagtctgcc 

agtgaacaag 

aaatctgaat 

ttcatttagg 

aaattcctca 

ggaaaaggee 

aggacacagc 

atttcatcag 

ttactccatc 

gagaaaagac 

tccttctatg 

gtagaccctg 

tatgtacacc 

agaaaattaa 

tagtgtggaa 

tggcagctag 

aatgataggg 

cgctataaat 

ttgttaattt 

agctgtttta 

ttacactata 

agggatctgt 

tcattccaat 

cagctcacct 

actcattgtc 

ggggaaattt 

ttgettttae 

gcattccaga 

ggctctttct 

aattttgggt 

caactcctgt 



ggcctgtcag 

cctgagctcc 

ttgtgaactg 

gcggaacagt 

tgcaaaaatt 

atagataaca 

tgtgtctatt 

aaagaatgag 

agcatagtct 

ttccctttta 

gttgtactca 

tccgggcctc 

atagcttcta 

attatgacct 

agatattttt 

actctagtga 

acagctttca 

ccacccacct 

caaagaacaa 

atcctccaga 

ccgtcaacca 

cttccttgct 

tatataagtc 

aattgtactc 

caagaagttc 

cctccttgct 

ectgeattgg 

gcttttccct 

ccacctcagg 

ttgeatttat 

acaattctct 

agatttggga 

gatggaatta 

ttctctctgt 

agatgattat 

aeggggctet 

ccagcccctc 

agacatgatc 

tgtccatcaa 

aatttattca 

gagaaaacct 

tccccacaca 

tgetacttae 

gacatactgg 

gatataggtt 

acttttttag 

caggatttcg 

tatctgttgt 

ctttaatatt 

agacttctga 

gtaatttgea 

gctacacaaa 

gatcagatct 

gtatatttaa 

ctgttactga 

tacttaacct 

ctgtactgta 

agtttccatt 

tcatgtctta 

tccttccttg 

ttcgctccac 



11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 
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caacacctct gtgagtctca tcatcagctg agcgatgatg ccttacaggt tgcatagcac 14760 

tggaactttc ctagagtaac ggctctgctg ccagggtttc tctgggctca ttcttccact 14820 

gacttaatta tgatctatgc ctaacagagc cccagtacaa ctattttgca gaatggctgt 14880 

taccctagaa ttactatagc acatattgag atatagttgt actccctagt agataggaac 14940 

tgaccccaac aataaacttt gataataaag acaata 14976 

<210> 649 
<211> 720 
<212> DNA 
<213> Homo sapiens 

<400> 649 

gtcttccagc attctatggt cccaccaggt gggtaggtga ctaggaattc aggccaggtc 60 

ctagagagta gtgagaagtt ttccagtttg cagtttcctc tgtgctgtat gtacagccat 120 

acagcctatg gaacctgaca ttgcagtggg cagcaggaca cggtcaaggc tcctagctgc 180 

ctctccatct atggaactat gaaatttcac acatattccc taggcagcat taaggcccag 240 

gtatattgag tagtccaaac caaaaaacta tattcgagta tcagtttaaa ttcttccaag 300 

ccctaaaaat tcttacgtag tttctcacct aaaactaatg gcttgtgcca agaactgaaa 360 

ctaagctatt gatttttttt taagaagtct taatctatac ataagaaatt acatacctgg 420 

ccaggtgcag tagttcaggc ctgtaatcct aacaaacact ctgggaggcc aaggtgggca 480 

gactgcttga gctcaggagt tcattaccag tctgggcaac atgataaaac cgtgtcttta 540 

caaaaaaaaa aaattttaaa ttagccgggc atggtggtgt gaatctgtag tcccagctac 600 

ttaggaggtt gcgatgggag aatcacctga gcccagaggt ccagtctgca gtgagccatg 660 

atcacaccac agcactccag ctggggtgac agagtgagac cctatctcaa aaagaaaaga 720 

<210> 650 

<211> 3126 

<212> DNA 

<213> Homo sapiens 

<400> 650 

cttggcaatg tattaaacag caggccttgg agactagcac ttgagttaac acagccacca 60 

caaccaccac tgccatcatc accttcccgg aaagcagcca cctgtctggc tcctggcttt 120 

gtccagctgc caacctaagg catgtgccta cgcaggaggc gatgacattt tggctccacg 180 

ttcaaagttg tttttttttt cctttctcat gtgttatttc taaagataac aaaggtcaaa 240 

aggcatccag cgttttctgg tttctcataa gcttctggtc aatatttaat ctggtttatg 300 

gatttttttt aggtcttcta gatgccttct tgaggctgct tgtggccacc cacagacact 360 

tgtaaggagg agagaagtca gcctggcaga gagactctga aatgagggat tagaggtgtt 420 

caaggagcaa gagcttcagc ctgaagacaa gggagcagtc cctgaagacg cttctactga 480 

gaggtctgcc atggcctctc ttggcctcca acttgtgggc tacatcctag gccttctggg 540 

gcttttgggc acactggttg ccatgctgct ccccagctgg aaaacaagtt cttatgtcgg 600 

tgccagcatt gtgacagcag ttggcttctc caagggcctc tggatggaat gtgccacaca 660 

cagcacaggc atcacccagt gtgacatcta tagcaccctt ctgggcctgc ccgctgacat 720 

ccaggctgcc caggccatga tggtgacatc cagtgcaatc tcctccctgg cctgcattat 780 

ctctgtggtg ggcatgagat gcacagtctt ctgccaggaa tcccgagcca aagacagagt 840 

ggcggtagca ggtggagtct ttttcatcct tggaggcctc ctgggattca ttcctgttgc 900 

ctggaatctt catgggatcc tacgggactt ctactcacca ctggtgcctg acagcatgaa 960 

atttgagatt ggagaggctc tttacttggg cattatttct tccctgttct ccctgatagc 1020 

tggaatcatc ctctgctttt cctgctcatc ccagagaaat cgctccaact actacgatgc 1080 

ctaccaagcc caacctcttg ccacaaggag ctctccaagg cctggtcaac ctcccaaagt 1140 

caagagtgag ttcaattcct acagcctgac agggtatgtg tgaagaacca ggggccagag 1200 

ctggggggtg gctgggtctg tgaaaaacag tggacagcac cccgagggcc acaggtgagg 1260 

gacactacca ctggatcgtg tcagaaggtg ctgctgagga tagactgact ttggccattg 1320 

gattgagcaa aggcagaaat gggggctagt gtaacagcat gcaggttgaa ttgccaagga 1380 

tgctcgccat gccagccttt ctgttttcct caccttgctg ctcccctgcc ctaagtcccc 1440 

aaccctcaac ttgaaacccc attcccttaa gccaggactc agaggatccc tttgccctct 1500 

ggtttacctg ggactccatc cccaaaccca ctaatcacat cccactgact gaccctctgt 1560 

gatcaaagac cctctctctg gctgaggttg gctcttagct cattgctggg gatgggaagg 1620 

agaagcagtg gcttttgtgg gcattgctct aacctacttc tcaagcttcc ctccaaagaa 1680 

actgattggc cctggaacct ccatcccact cttgttatga ctccacagtg tccagactaa 1740 

tttgtgcatg aactgaaata aaaccatcct acggtatcca gggaacagaa agcaggatgc 1800 
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aggatgggag gacaggaagg cagcctggga catttaaaaa aataaaaatg aaaaaaaaac 1860 

ccagaaccca tttctcaggg cactttccag aattctctca tatttgtggg ctgggatcaa 1920 

gcctgcagct tgaggaaagc acaaggaaag gaaagaagat ctggtggaaa gctcaggtgg 1980 

cagcggactc tgactccact gaggaactgc ctcagaagct gcgatcacaa ctttggctga 2040 

agcccctgcc tcactctagg gcacctgacc tggcctcttg cctaaaccac aaggctaagg 2100 

gctatagaca atggtttcct taggaacagt aaaccagttt ttctagggat ggcccttggc 2160 

tgggggatga cagtgtggga gctgtggggt actgaggaag acaccattcc ttgacggtgt 2220 

ctaagaagcc aggtggatgt gtgtggtggc tccagtgggt gtttctactc tgccagtgag 2280 

aggcagcccc ctagaaactc ttcaggcgta atggaaaatc agctcaaatg agatcaggcc 2340 

cccccagggt ccacccacag agcactacag agcctctgaa agaccatagc accaagcgag 2400 

ccccttcaga ttcccccact gtccatcgga agatgctcca gagtggctag agggcatcta 2460 

agggctccag catggcatat ccatgcccac ggtgctgtgt- ccatgatctg agtgatagct 2520 

gcactgctgc ctgggattgc agctgaggtg ggagtggaga atggttccca ggaagacagt 2580 

tccacctcta aggtccgaaa atgttccctt taccctggag tgggagtgag gggtcataca 2640 

ccaaaggtat tttccctcac cagtctaggc atgactggct tctgaaaaat tccagcacac 2700 

ctcctcgaac ctcattgtca gcagagaggg cccatctgtt gtctgtaaca tgcctttcac 2760 

atgtccacct tcttgccatg ttccagctgc tctcccaacc tggaaggccg tctcccctta 2820 

gccaagtcct cctcaggctt ggagaacttc ctcagcgtca cctccttcat tgagccttct 2880 

ttccccactc catccctctc ctacccctcc ctcccccaac cctcaatgta taaattgctt 2940 

cttgatgctt agcattcaca atttttgatt gatcgttatt tgtgtgtgtg tgtccgatct 3000 

cacaagtata ttgtaaaccc ttcggtgggt gggggccata tcctagacct ctctgtatcc 3060 

cccagactat ctgtaacagt gccaggcaca cagtaggtga tcaataaaca cttgttgatt 3120 

gagtaa 3126 
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Groups 1167-1455 represent polypeptides of SEQ ID NO: 300-588. The polypeptides lack unity as described above- 
Groups 1456-1744 represent methods of using a polypeptide of SEQ ID NO: 300-588 to identify binding partners for the polypeptide. 
The polypeptides of SEQ ID NO: 300-588 lack unity as described above. Because each method depends on a unique polypeptide 
sequence, the binding partners identified by each method will differ. Therefore unity among the methods is deemed lacking. 

Groups 1745-2033 represent antibodies which bind specifically to polypeptides of SEQ ID NO: 300-588. These antibodies lack unity 
as described above. 

Groups 2034-2332 represent nucleic acid molecules of SEQ ID NO: 1-299. The nucleic acid molecules lack unity as described above. 
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